SlideShare a Scribd company logo
Prepared by:
Doaa Gamal
Lecturer Assistant
Faculty of Engineering – Suez Canal University
1
Outline
 Introduction
 Applications
 History of time and pitch modification
 Time-domain techniques
 Frequency-domain techniques
 Parametric techniques
 conclusion
2
Introduction
 Timescale modification: slow down or speed up a given
signal, possibly in a time-varying manner, without
altering the signal’s spectral content (and in particular
its pitch when the signal is periodic).
 pitch-scale modification: the aim is to modify the
pitch of the signal, possibly in a time-varying manner,
without altering the signal’s time-evolution (and in
particular, its duration).
3
Introduction
 time-scaling or pitch-scaling is not easy because time
and frequency characteristics of a signal, being related
by the Fourier transform, are not independent.
 the simplest method of time scaling a sound is to just
replay it at a different rate. When using magnetic
tapes, for example, the tape speed may be varied, but
this incurs a simultaneous change in the pitch of the
signal.
4
applications
 Speech Synthesizers
 Post-synchronization
 Data compression
 Reading for the blind:
 Foreign language learning
 Voice transformation
5
History of time and pitch modification
Signal
type
method technique
Analog tape recorder machine Time-domain
Digital Digital tape recorder Time-domain
Digital Periodicity-driven
methods
Time-domain
Digital STFT Frequency-domain
Digital Linear prediction models
& sinusoidal models
parametric models
6
time and pitch modification
techniques
Non-
parametric
Frequency-
domain
techniques
Time-domain
techniques
Parametric
Time-domain techniques
Pitch independent methods
 requires very few calculations
 very well to real-time
implementation.
 prone to artifacts because no
precaution is taken at the
splicing points, other than to
guarantee continuity.
Time-domain techniques
Periodicity-driven methods
The most popular method using pitch information is
TD-PSOLA
modification factors (between 0.5 and 2).
TD-PSOLA analysis-synthesis
process without modification
1 2
3
TD-PSOLA analysis-synthesis
process without modification
The output speech waveform of PSOLA analysis-synthesis is
perceptually indistinguishable from the original waveform.
4
pitch-scaling (lowering) using
TD-PSOLA
12
time-scaling (lengthening) using
TD-PSOLA
13
Computation of synthesis pitch-
marks for pitch modification
14
Computation of synthesis pitch-marks
for pitch modification (raising)
Computation of synthesis pitch-
marks for duration modification
16
Computation of synthesis pitch-marks for
time-scale modification (lengthening)
From the synthesis pitch-marks to
the modified waveform
The simple way is
 calculate the nearest analysis pitch-mark to the virtual
pitch-mark is found
 The frames which corresponds to the nearest analysis
pitch-marks are centered on the synthesis pitch-marks.
 The overlapping regions are added together.
18
From the synthesis pitch-marks to
the modified waveform
19
• In more sophisticated systems, the mapping involves
linear interpolation between the two successive
short-time analysis signals lying the closest to the
virtual pitch-mark
 The perceptual quality of the prosody modified speech
using PSOLA methods depends on the accuracy of the
pitch markers estimation. As estimating epochs from
speech provide more accurate pitch marker locations
LP-PSOLA & FD-PSOLA
 The Frequency-Domain PSOLA (FD-PSOLA) and the
Linear-Predictive PSOLA (LP-PSOLA) approaches are
theoretically more appropriate than the time-domain
PSOLA method for pitch-scale modifications because
they provide independent control over the spectral
envelope of the synthesis signal.
Frequency-domain techniques
 Frequency-domain algorithms operate with a short-
time spectrum of the signal (phase-vocoder)
1. Calculate shift-time Fourier transform (STFT) of a
signal
2. Modify phases of each frequency channel.
3. Synthesize a signal using inverse STFT with a different
time stride
21
Parametric techniques
 linear prediction models
 sinusoidal models
 the Harmonic plus Noise Model, HNM
 wideband models
 STRAIGHT
conclusion
 Time-domain approaches are computationally cheap
and perform good for small modification factors.
 Good for real-time implementations
 possible to incorporate such systems in consumer
products such as telephone answering systems.
 suffering from echos.
 In particular, time or pitch-scale modifications by
large factors cannot be carried out by time-domain
methods and usually require the use of the more
elaborate frequency-domain techniques.
23
conclusion
 Frequency-domain techniques are capable of
providing very high quality output. However, they still
suffer from some distortion, mainly due to the effects
of “phase dispersion.”
 computationally intensive.
24
conclusion
 Parametric techniques tend to outperform non-
parametric methods when the adequation between the
signal to be modified and the underlying model is
good. When this is not the case however, the methods
break down and the results are unreliable.
 Parametric techniques usually are more costly in terms
of computations, because they require an explicit
preliminary analysis stage for the estimation of the
model parameters.
25
Pitch and time scale modifications
Pitch and time scale modifications

More Related Content

What's hot

Butterworth filter design
Butterworth filter designButterworth filter design
Butterworth filter designSushant Shankar
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basics
sivakumar m
 
Switching and Traffic of Mobile Communication
Switching and Traffic of Mobile CommunicationSwitching and Traffic of Mobile Communication
Switching and Traffic of Mobile Communication
Md. Saddam Hossain Noyon
 
dsp-processor-ppt.ppt
dsp-processor-ppt.pptdsp-processor-ppt.ppt
dsp-processor-ppt.ppt
Krishnavenimanickam2
 
Analog to Digital conversion
Analog to Digital conversionAnalog to Digital conversion
Analog to Digital conversion
Chetan Pandey
 
Quadrature amplitude modulation
Quadrature amplitude modulationQuadrature amplitude modulation
Quadrature amplitude modulation
AanchalKumari4
 
Ec 2401 wireless communication unit 3
Ec 2401 wireless communication   unit 3Ec 2401 wireless communication   unit 3
Ec 2401 wireless communication unit 3
JAIGANESH SEKAR
 
Real-Time Signal Processing: Implementation and Application
Real-Time Signal Processing:  Implementation and ApplicationReal-Time Signal Processing:  Implementation and Application
Real-Time Signal Processing: Implementation and Application
sathish sak
 
Digitization of Audio.ppt
Digitization of Audio.pptDigitization of Audio.ppt
Digitization of Audio.pptVideoguy
 
Digital Signal Processing
Digital Signal ProcessingDigital Signal Processing
Digital Signal Processing
Sandip Ladi
 
Real-Time Jitter Measurements
Real-Time Jitter Measurements Real-Time Jitter Measurements
Real-Time Jitter Measurements
Rohde & Schwarz North America
 
NYQUIST CRITERION FOR ZERO ISI
NYQUIST CRITERION FOR ZERO ISINYQUIST CRITERION FOR ZERO ISI
NYQUIST CRITERION FOR ZERO ISI
FAIZAN SHAFI
 
819 Static Channel Allocation
819 Static Channel Allocation819 Static Channel Allocation
819 Static Channel Allocation
techbed
 
MPEG/Audio Compression
MPEG/Audio CompressionMPEG/Audio Compression
MPEG/Audio Compression
Daniel Brewster
 
Combating fading channels (1) (3)
Combating fading channels (1) (3)Combating fading channels (1) (3)
Combating fading channels (1) (3)
liril sharma
 
Equalization
EqualizationEqualization
Equalization
@zenafaris91
 
Video compression
Video compressionVideo compression
Video compression
nnmaurya
 
DIGITAL SIGNAL PROCESSING BASED ON MATLAB
DIGITAL SIGNAL PROCESSING BASED ON MATLABDIGITAL SIGNAL PROCESSING BASED ON MATLAB
DIGITAL SIGNAL PROCESSING BASED ON MATLAB
Prashant Srivastav
 
Digital Signal Processing
Digital Signal Processing Digital Signal Processing
Digital Signal Processing
Sri Rakesh
 
carrier synchronization
carrier synchronizationcarrier synchronization
carrier synchronization
bharadwajareddy
 

What's hot (20)

Butterworth filter design
Butterworth filter designButterworth filter design
Butterworth filter design
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basics
 
Switching and Traffic of Mobile Communication
Switching and Traffic of Mobile CommunicationSwitching and Traffic of Mobile Communication
Switching and Traffic of Mobile Communication
 
dsp-processor-ppt.ppt
dsp-processor-ppt.pptdsp-processor-ppt.ppt
dsp-processor-ppt.ppt
 
Analog to Digital conversion
Analog to Digital conversionAnalog to Digital conversion
Analog to Digital conversion
 
Quadrature amplitude modulation
Quadrature amplitude modulationQuadrature amplitude modulation
Quadrature amplitude modulation
 
Ec 2401 wireless communication unit 3
Ec 2401 wireless communication   unit 3Ec 2401 wireless communication   unit 3
Ec 2401 wireless communication unit 3
 
Real-Time Signal Processing: Implementation and Application
Real-Time Signal Processing:  Implementation and ApplicationReal-Time Signal Processing:  Implementation and Application
Real-Time Signal Processing: Implementation and Application
 
Digitization of Audio.ppt
Digitization of Audio.pptDigitization of Audio.ppt
Digitization of Audio.ppt
 
Digital Signal Processing
Digital Signal ProcessingDigital Signal Processing
Digital Signal Processing
 
Real-Time Jitter Measurements
Real-Time Jitter Measurements Real-Time Jitter Measurements
Real-Time Jitter Measurements
 
NYQUIST CRITERION FOR ZERO ISI
NYQUIST CRITERION FOR ZERO ISINYQUIST CRITERION FOR ZERO ISI
NYQUIST CRITERION FOR ZERO ISI
 
819 Static Channel Allocation
819 Static Channel Allocation819 Static Channel Allocation
819 Static Channel Allocation
 
MPEG/Audio Compression
MPEG/Audio CompressionMPEG/Audio Compression
MPEG/Audio Compression
 
Combating fading channels (1) (3)
Combating fading channels (1) (3)Combating fading channels (1) (3)
Combating fading channels (1) (3)
 
Equalization
EqualizationEqualization
Equalization
 
Video compression
Video compressionVideo compression
Video compression
 
DIGITAL SIGNAL PROCESSING BASED ON MATLAB
DIGITAL SIGNAL PROCESSING BASED ON MATLABDIGITAL SIGNAL PROCESSING BASED ON MATLAB
DIGITAL SIGNAL PROCESSING BASED ON MATLAB
 
Digital Signal Processing
Digital Signal Processing Digital Signal Processing
Digital Signal Processing
 
carrier synchronization
carrier synchronizationcarrier synchronization
carrier synchronization
 

Viewers also liked

MIMO Evolution: Coordinated Multi Point Transmission / reception (COMP)
MIMO Evolution: Coordinated Multi Point Transmission / reception (COMP) MIMO Evolution: Coordinated Multi Point Transmission / reception (COMP)
MIMO Evolution: Coordinated Multi Point Transmission / reception (COMP)
Electronics & Communication Staff SCU Suez Canal University
 
Introduction to Network Coding
Introduction to Network CodingIntroduction to Network Coding
Memristor overview
Memristor overviewMemristor overview
Introduction to optimization Problems
Introduction to optimization ProblemsIntroduction to optimization Problems

Viewers also liked (7)

MIMO Evolution: Coordinated Multi Point Transmission / reception (COMP)
MIMO Evolution: Coordinated Multi Point Transmission / reception (COMP) MIMO Evolution: Coordinated Multi Point Transmission / reception (COMP)
MIMO Evolution: Coordinated Multi Point Transmission / reception (COMP)
 
kalman filtering "From Basics to unscented Kaman filter"
 kalman filtering "From Basics to unscented Kaman filter" kalman filtering "From Basics to unscented Kaman filter"
kalman filtering "From Basics to unscented Kaman filter"
 
Introduction to game theory
Introduction to game theoryIntroduction to game theory
Introduction to game theory
 
Introduction to Network Coding
Introduction to Network CodingIntroduction to Network Coding
Introduction to Network Coding
 
Indoor positioning
Indoor positioningIndoor positioning
Indoor positioning
 
Memristor overview
Memristor overviewMemristor overview
Memristor overview
 
Introduction to optimization Problems
Introduction to optimization ProblemsIntroduction to optimization Problems
Introduction to optimization Problems
 

Similar to Pitch and time scale modifications

IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time Domain
IRJET Journal
 
Unit-4 Pulse analog Modulation.ppt
Unit-4  Pulse analog Modulation.pptUnit-4  Pulse analog Modulation.ppt
Unit-4 Pulse analog Modulation.ppt
MATRUSRI ENGINEERING COLLEGE
 
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
IJMTST Journal
 
A BER Performance Analysis of Shift Keying Technique with MMSE/MLSE estimatio...
A BER Performance Analysis of Shift Keying Technique with MMSE/MLSE estimatio...A BER Performance Analysis of Shift Keying Technique with MMSE/MLSE estimatio...
A BER Performance Analysis of Shift Keying Technique with MMSE/MLSE estimatio...
AM Publications
 
BEC604 -COMMUNICATION ENG II.pdf
BEC604 -COMMUNICATION ENG II.pdfBEC604 -COMMUNICATION ENG II.pdf
BEC604 -COMMUNICATION ENG II.pdf
Sunil Manjani
 
Fundamentals of coherent systems
Fundamentals of coherent systemsFundamentals of coherent systems
Fundamentals of coherent systems
farha zeba
 
An Adaptive Approach to Switching Coded Modulation in OFDM System Under AWGN ...
An Adaptive Approach to Switching Coded Modulation in OFDM System Under AWGN ...An Adaptive Approach to Switching Coded Modulation in OFDM System Under AWGN ...
An Adaptive Approach to Switching Coded Modulation in OFDM System Under AWGN ...
ijsrd.com
 
communication concepts on sampling process
communication concepts on sampling processcommunication concepts on sampling process
communication concepts on sampling process
NatarajVijapur
 
ESTIMATION OF SYMBOL TIMING AND CARRIER FREQUENCY OFFSET USING SYNCHRONIZATI...
 ESTIMATION OF SYMBOL TIMING AND CARRIER FREQUENCY OFFSET USING SYNCHRONIZATI... ESTIMATION OF SYMBOL TIMING AND CARRIER FREQUENCY OFFSET USING SYNCHRONIZATI...
ESTIMATION OF SYMBOL TIMING AND CARRIER FREQUENCY OFFSET USING SYNCHRONIZATI...
Michael George
 
Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...
Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...
Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...
IJCSEA Journal
 
Bz33462466
Bz33462466Bz33462466
Bz33462466
IJERA Editor
 
Kr2418101813
Kr2418101813Kr2418101813
Kr2418101813
IJERA Editor
 
Spatial multiplexing ofdmoqam systems with time reversal technique
Spatial multiplexing ofdmoqam systems with time reversal techniqueSpatial multiplexing ofdmoqam systems with time reversal technique
Spatial multiplexing ofdmoqam systems with time reversal technique
ijwmn
 
OFDM synchronization system using wavelet transform for symbol rate detection
OFDM synchronization system using wavelet transform for symbol rate detectionOFDM synchronization system using wavelet transform for symbol rate detection
OFDM synchronization system using wavelet transform for symbol rate detection
TELKOMNIKA JOURNAL
 
Timing synchronization F Ling_v1
Timing synchronization F Ling_v1Timing synchronization F Ling_v1
Timing synchronization F Ling_v1
Fuyun Ling
 
D0542130
D0542130D0542130
D0542130
IOSR Journals
 
A Review: Significant Research on Time And Frequency Synchronization In MIMO ...
A Review: Significant Research on Time And Frequency Synchronization In MIMO ...A Review: Significant Research on Time And Frequency Synchronization In MIMO ...
A Review: Significant Research on Time And Frequency Synchronization In MIMO ...
IJERA Editor
 
Aa04606162167
Aa04606162167Aa04606162167
Aa04606162167
IJERA Editor
 

Similar to Pitch and time scale modifications (20)

IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time Domain
 
Unit-4 Pulse analog Modulation.ppt
Unit-4  Pulse analog Modulation.pptUnit-4  Pulse analog Modulation.ppt
Unit-4 Pulse analog Modulation.ppt
 
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
 
A BER Performance Analysis of Shift Keying Technique with MMSE/MLSE estimatio...
A BER Performance Analysis of Shift Keying Technique with MMSE/MLSE estimatio...A BER Performance Analysis of Shift Keying Technique with MMSE/MLSE estimatio...
A BER Performance Analysis of Shift Keying Technique with MMSE/MLSE estimatio...
 
BEC604 -COMMUNICATION ENG II.pdf
BEC604 -COMMUNICATION ENG II.pdfBEC604 -COMMUNICATION ENG II.pdf
BEC604 -COMMUNICATION ENG II.pdf
 
Fundamentals of coherent systems
Fundamentals of coherent systemsFundamentals of coherent systems
Fundamentals of coherent systems
 
An Adaptive Approach to Switching Coded Modulation in OFDM System Under AWGN ...
An Adaptive Approach to Switching Coded Modulation in OFDM System Under AWGN ...An Adaptive Approach to Switching Coded Modulation in OFDM System Under AWGN ...
An Adaptive Approach to Switching Coded Modulation in OFDM System Under AWGN ...
 
communication concepts on sampling process
communication concepts on sampling processcommunication concepts on sampling process
communication concepts on sampling process
 
ESTIMATION OF SYMBOL TIMING AND CARRIER FREQUENCY OFFSET USING SYNCHRONIZATI...
 ESTIMATION OF SYMBOL TIMING AND CARRIER FREQUENCY OFFSET USING SYNCHRONIZATI... ESTIMATION OF SYMBOL TIMING AND CARRIER FREQUENCY OFFSET USING SYNCHRONIZATI...
ESTIMATION OF SYMBOL TIMING AND CARRIER FREQUENCY OFFSET USING SYNCHRONIZATI...
 
Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...
Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...
Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...
 
Bz33462466
Bz33462466Bz33462466
Bz33462466
 
Bz33462466
Bz33462466Bz33462466
Bz33462466
 
Kr2418101813
Kr2418101813Kr2418101813
Kr2418101813
 
Spatial multiplexing ofdmoqam systems with time reversal technique
Spatial multiplexing ofdmoqam systems with time reversal techniqueSpatial multiplexing ofdmoqam systems with time reversal technique
Spatial multiplexing ofdmoqam systems with time reversal technique
 
OFDM synchronization system using wavelet transform for symbol rate detection
OFDM synchronization system using wavelet transform for symbol rate detectionOFDM synchronization system using wavelet transform for symbol rate detection
OFDM synchronization system using wavelet transform for symbol rate detection
 
Timing synchronization F Ling_v1
Timing synchronization F Ling_v1Timing synchronization F Ling_v1
Timing synchronization F Ling_v1
 
Asee05
Asee05Asee05
Asee05
 
D0542130
D0542130D0542130
D0542130
 
A Review: Significant Research on Time And Frequency Synchronization In MIMO ...
A Review: Significant Research on Time And Frequency Synchronization In MIMO ...A Review: Significant Research on Time And Frequency Synchronization In MIMO ...
A Review: Significant Research on Time And Frequency Synchronization In MIMO ...
 
Aa04606162167
Aa04606162167Aa04606162167
Aa04606162167
 

Recently uploaded

Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
Jayaprasanna4
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
BrazilAccount1
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
ongomchris
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
karthi keyan
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
seandesed
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 

Recently uploaded (20)

Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
 

Pitch and time scale modifications

  • 1. Prepared by: Doaa Gamal Lecturer Assistant Faculty of Engineering – Suez Canal University 1
  • 2. Outline  Introduction  Applications  History of time and pitch modification  Time-domain techniques  Frequency-domain techniques  Parametric techniques  conclusion 2
  • 3. Introduction  Timescale modification: slow down or speed up a given signal, possibly in a time-varying manner, without altering the signal’s spectral content (and in particular its pitch when the signal is periodic).  pitch-scale modification: the aim is to modify the pitch of the signal, possibly in a time-varying manner, without altering the signal’s time-evolution (and in particular, its duration). 3
  • 4. Introduction  time-scaling or pitch-scaling is not easy because time and frequency characteristics of a signal, being related by the Fourier transform, are not independent.  the simplest method of time scaling a sound is to just replay it at a different rate. When using magnetic tapes, for example, the tape speed may be varied, but this incurs a simultaneous change in the pitch of the signal. 4
  • 5. applications  Speech Synthesizers  Post-synchronization  Data compression  Reading for the blind:  Foreign language learning  Voice transformation 5
  • 6. History of time and pitch modification Signal type method technique Analog tape recorder machine Time-domain Digital Digital tape recorder Time-domain Digital Periodicity-driven methods Time-domain Digital STFT Frequency-domain Digital Linear prediction models & sinusoidal models parametric models 6
  • 7. time and pitch modification techniques Non- parametric Frequency- domain techniques Time-domain techniques Parametric
  • 8. Time-domain techniques Pitch independent methods  requires very few calculations  very well to real-time implementation.  prone to artifacts because no precaution is taken at the splicing points, other than to guarantee continuity.
  • 9. Time-domain techniques Periodicity-driven methods The most popular method using pitch information is TD-PSOLA modification factors (between 0.5 and 2).
  • 11. TD-PSOLA analysis-synthesis process without modification The output speech waveform of PSOLA analysis-synthesis is perceptually indistinguishable from the original waveform. 4
  • 14. Computation of synthesis pitch- marks for pitch modification 14
  • 15. Computation of synthesis pitch-marks for pitch modification (raising)
  • 16. Computation of synthesis pitch- marks for duration modification 16
  • 17. Computation of synthesis pitch-marks for time-scale modification (lengthening)
  • 18. From the synthesis pitch-marks to the modified waveform The simple way is  calculate the nearest analysis pitch-mark to the virtual pitch-mark is found  The frames which corresponds to the nearest analysis pitch-marks are centered on the synthesis pitch-marks.  The overlapping regions are added together. 18
  • 19. From the synthesis pitch-marks to the modified waveform 19 • In more sophisticated systems, the mapping involves linear interpolation between the two successive short-time analysis signals lying the closest to the virtual pitch-mark  The perceptual quality of the prosody modified speech using PSOLA methods depends on the accuracy of the pitch markers estimation. As estimating epochs from speech provide more accurate pitch marker locations
  • 20. LP-PSOLA & FD-PSOLA  The Frequency-Domain PSOLA (FD-PSOLA) and the Linear-Predictive PSOLA (LP-PSOLA) approaches are theoretically more appropriate than the time-domain PSOLA method for pitch-scale modifications because they provide independent control over the spectral envelope of the synthesis signal.
  • 21. Frequency-domain techniques  Frequency-domain algorithms operate with a short- time spectrum of the signal (phase-vocoder) 1. Calculate shift-time Fourier transform (STFT) of a signal 2. Modify phases of each frequency channel. 3. Synthesize a signal using inverse STFT with a different time stride 21
  • 22. Parametric techniques  linear prediction models  sinusoidal models  the Harmonic plus Noise Model, HNM  wideband models  STRAIGHT
  • 23. conclusion  Time-domain approaches are computationally cheap and perform good for small modification factors.  Good for real-time implementations  possible to incorporate such systems in consumer products such as telephone answering systems.  suffering from echos.  In particular, time or pitch-scale modifications by large factors cannot be carried out by time-domain methods and usually require the use of the more elaborate frequency-domain techniques. 23
  • 24. conclusion  Frequency-domain techniques are capable of providing very high quality output. However, they still suffer from some distortion, mainly due to the effects of “phase dispersion.”  computationally intensive. 24
  • 25. conclusion  Parametric techniques tend to outperform non- parametric methods when the adequation between the signal to be modified and the underlying model is good. When this is not the case however, the methods break down and the results are unreliable.  Parametric techniques usually are more costly in terms of computations, because they require an explicit preliminary analysis stage for the estimation of the model parameters. 25