SlideShare a Scribd company logo
1 of 30
Download to read offline
Introduction to Audio Content Analysis
module 7.3.3: fundamental frequency detection in monophonic signals
alexander lerch
overview intro mono f0 time domain frequency domain summary
introduction
overview
corresponding textbook section
section 7.3.3
lecture content
• established approaches to monophonic pitch tracking in
▶ time domain
▶ frequency domain
learning objectives
• define the task of monophonic pitch tracking
• summarize the principles of time-domain ˆ
f0-trackers and describe one approach in
detail
• summarize the principles of frequency-domain ˆ
f0-trackers and describe one approach
in detail
module 7.3.3: fundamental frequency detection in monophonic signals 1 / 15
overview intro mono f0 time domain frequency domain summary
introduction
overview
corresponding textbook section
section 7.3.3
lecture content
• established approaches to monophonic pitch tracking in
▶ time domain
▶ frequency domain
learning objectives
• define the task of monophonic pitch tracking
• summarize the principles of time-domain ˆ
f0-trackers and describe one approach in
detail
• summarize the principles of frequency-domain ˆ
f0-trackers and describe one approach
in detail
module 7.3.3: fundamental frequency detection in monophonic signals 1 / 15
overview intro mono f0 time domain frequency domain summary
fundamental frequency
introduction
remember
Fourier series: every (quasi-)periodic sound is a combination of sinusoidals with integer
frequency ratios
x(t) ≈ x(t + T̂0)
x(t) ≈
∞
X
k=−∞
a(k)ejω0kt
ˆ
f0: musically, perceptually most “relevant” frequency
module 7.3.3: fundamental frequency detection in monophonic signals 2 / 15
overview intro mono f0 time domain frequency domain summary
fundamental frequency
introduction
remember
Fourier series: every (quasi-)periodic sound is a combination of sinusoidals with integer
frequency ratios
x(t) ≈ x(t + T̂0)
x(t) ≈
∞
X
k=−∞
a(k)ejω0kt
ˆ
f0: musically, perceptually most “relevant” frequency
module 7.3.3: fundamental frequency detection in monophonic signals 2 / 15
overview intro mono f0 time domain frequency domain summary
pitch detection
task definition
detect the fundamental frequency ˆ
f0
monophonic: only one fundamental frequency at a time
related subtasks:
• voice activity: detect when there is no voice/no fundamental frequency
• note segmentation
▶ note start time and stop time
▶ average note frequency
▶ average note velocity
▶ vibrato detection
• frequency to pitch mapping
module 7.3.3: fundamental frequency detection in monophonic signals 3 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
zero crossing rate
ZCR per block (bad)
T̂0(n) =
2 · ie(n) − is(n)

fS ·
ie(n)
P
i=is(n)
|sign [x(i)] − sign [x(i − 1)]|
average period length
T̂0(n) =
2
Z − 1
Z−2
X
j=0
∆tZC(j).
variants:
• create distance histogram
and choose maximum
• also use distances between
local extrema
module 7.3.3: fundamental frequency detection in monophonic signals 4 / 15
matlab
source:
plotF0Zcr.m
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
zero crossing rate
ZCR per block (bad)
T̂0(n) =
2 · ie(n) − is(n)

fS ·
ie(n)
P
i=is(n)
|sign [x(i)] − sign [x(i − 1)]|
average period length
T̂0(n) =
2
Z − 1
Z−2
X
j=0
∆tZC(j).
variants:
• create distance histogram
and choose maximum
• also use distances between
local extrema
module 7.3.3: fundamental frequency detection in monophonic signals 4 / 15
matlab
source:
plotF0Zcr.m
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
zero crossing rate
ZCR per block (bad)
T̂0(n) =
2 · ie(n) − is(n)

fS ·
ie(n)
P
i=is(n)
|sign [x(i)] − sign [x(i − 1)]|
average period length
T̂0(n) =
2
Z − 1
Z−2
X
j=0
∆tZC(j).
variants:
• create distance histogram
and choose maximum
• also use distances between
local extrema
module 7.3.3: fundamental frequency detection in monophonic signals 4 / 15
matlab
source:
plotF0Zcr.m
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
auto correlation function
find lag of ACF maximum
rxx (η, n) =
ie(n)−η
X
i=is(n)
x(i) · x(i + η)
T̂0(n) = argmax rxx (η, n)

variants:
• center clipping
x
χ(x)
−cL +cL
x
χ′
(x)
−cL +cL
−1
+1
module 7.3.3: fundamental frequency detection in monophonic signals 5 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
auto correlation function
find lag of ACF maximum
rxx (η, n) =
ie(n)−η
X
i=is(n)
x(i) · x(i + η)
T̂0(n) = argmax rxx (η, n)

variants:
• center clipping
x
χ(x)
−cL +cL
x
χ′
(x)
−cL +cL
−1
+1
module 7.3.3: fundamental frequency detection in monophonic signals 5 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
auto correlation function
module 7.3.3: fundamental frequency detection in monophonic signals 5 / 15
matlab
source:
plotF0Acf.m
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
average magnitude difference function
find lag of AMDF minimum
AMDFxx (η, n) =
1
ie(n) − is(n) + 1
ie(n)−η
X
i=is(n)
|x(i) − x(i + η)|
variants:
• AMDF-weighted ACF
r′
xx (η, n) =
rxx (η, n)
AMDFxx (η, n) + 1
module 7.3.3: fundamental frequency detection in monophonic signals 6 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
average magnitude difference function
find lag of AMDF minimum
AMDFxx (η, n) =
1
ie(n) − is(n) + 1
ie(n)−η
X
i=is(n)
|x(i) − x(i + η)|
variants:
• AMDF-weighted ACF
r′
xx (η, n) =
rxx (η, n)
AMDFxx (η, n) + 1
module 7.3.3: fundamental frequency detection in monophonic signals 6 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
average magnitude difference function
module 7.3.3: fundamental frequency detection in monophonic signals 6 / 15
matlab
source:
plotF0Amdf.m
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
harmonic product spectrum 1/2
XHPS(k, n) =
O
Y
j=1
|X(j · k, n)|2
ˆ
f0(n) = argmax XHPS(k, n)

first published in the 1960s by Noll
1
A. M. Noll, “Pitch Determination of Human Speech by the Harmonic Product Spectrum, the Harmonic Sum Spectrum, and a Maximum
Likelihood Estimate,” in Proceedings of the Symposium on Computer Processing in Communications, vol. 19, Brooklyn: Polytechnic Press of the
University of Brooklyn, 1969, pp. 779–797.
module 7.3.3: fundamental frequency detection in monophonic signals 7 / 15
matlab
source:
plotF0HpsMethod.m
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
harmonic product spectrum 2/2
module 7.3.3: fundamental frequency detection in monophonic signals 8 / 15
matlab
source:
plotF0Hps.m
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
harmonic sum spectrum
sum instead product sum
XHSS(k, n) =
O
X
j=1
|X(j · k, n)|2
• advantage
▶ robust against missing harmonics
• disadvantage
▶ less pronounced peak
module 7.3.3: fundamental frequency detection in monophonic signals 9 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
ACF of magnitude spectrum
rXX (η, n) =
K/2−1
X
k=−K/2
|X(k, n)| · |X(k + η, n)|
⇒ detect maximum location
module 7.3.3: fundamental frequency detection in monophonic signals 10 / 15
matlab
source:
plotF0AcfOfFft.m
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
cepstral pitch detection
1 compute cepstrum
2 detect periodicities
module 7.3.3: fundamental frequency detection in monophonic signals 11 / 15
matlab
source:
plotF0Cepstrum.m
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
spectral maximum likelihood
create template matrix with (smoothed) delta pulses for all possible frequencies
compute the cross correlation (lag = 0) between spectrum and all templates
pick the result with the highest correlation ⇒ frequency estimate
module 7.3.3: fundamental frequency detection in monophonic signals 12 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
spectral maximum likelihood
create template matrix with (smoothed) delta pulses for all possible frequencies
compute the cross correlation (lag = 0) between spectrum and all templates
pick the result with the highest correlation ⇒ frequency estimate
module 7.3.3: fundamental frequency detection in monophonic signals 12 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
spectral maximum likelihood
create template matrix with (smoothed) delta pulses for all possible frequencies
compute the cross correlation (lag = 0) between spectrum and all templates
pick the result with the highest correlation ⇒ frequency estimate
module 7.3.3: fundamental frequency detection in monophonic signals 12 / 15
matlab
source:
plotF0Template.m
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
auditory-motivated pitch tracking 1/2
1 filterbank of band pass filters (e.g., mel scale)
2 HWR
3 smoothing
4 within-band periodicity estimate (e.g. ACF)
5 combination of bands
module 7.3.3: fundamental frequency detection in monophonic signals 13 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
auditory-motivated pitch tracking 1/2
1 filterbank of band pass filters (e.g., mel scale)
2 HWR
3 smoothing
4 within-band periodicity estimate (e.g. ACF)
5 combination of bands
module 7.3.3: fundamental frequency detection in monophonic signals 13 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
auditory-motivated pitch tracking 1/2
1 filterbank of band pass filters (e.g., mel scale)
2 HWR
3 smoothing
4 within-band periodicity estimate (e.g. ACF)
5 combination of bands
module 7.3.3: fundamental frequency detection in monophonic signals 13 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
auditory-motivated pitch tracking 1/2
1 filterbank of band pass filters (e.g., mel scale)
2 HWR
3 smoothing
4 within-band periodicity estimate (e.g. ACF)
5 combination of bands
module 7.3.3: fundamental frequency detection in monophonic signals 13 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
auditory-motivated pitch tracking 1/2
1 filterbank of band pass filters (e.g., mel scale)
2 HWR
3 smoothing
4 within-band periodicity estimate (e.g. ACF)
5 combination of bands
module 7.3.3: fundamental frequency detection in monophonic signals 13 / 15
overview intro mono f0 time domain frequency domain summary
monophonic fundamental frequency detection
auditory-motivated pitch tracking 2/2
1 filterbank output
2 half wave
rectification
3 smoothed output
module 7.3.3: fundamental frequency detection in monophonic signals 14 / 15
matlab
source:
plotF0Auditory.m
overview intro mono f0 time domain frequency domain summary
summary
lecture content
basic commonality
• all approaches look for periodicity
▶ waveform similarity in time domain
▶ equidistant harmonics/peaks in freq domain
state-of-the-art
• despite the age of the presented methods, tweaked versions of the presented
approaches are still often considered state-of-the-art
• combinations of different approaches can be robust
module 7.3.3: fundamental frequency detection in monophonic signals 15 / 15

More Related Content

Similar to 07-03-03-ACA-Tonal-Monof0.pdf

An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...
An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...
An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...Editor IJCATR
 
Feature Extraction of Musical Instrument Tones using FFT and Segment Averaging
Feature Extraction of Musical Instrument Tones using FFT and Segment AveragingFeature Extraction of Musical Instrument Tones using FFT and Segment Averaging
Feature Extraction of Musical Instrument Tones using FFT and Segment AveragingTELKOMNIKA JOURNAL
 
03-01-ACA-Input-Signals.pdf
03-01-ACA-Input-Signals.pdf03-01-ACA-Input-Signals.pdf
03-01-ACA-Input-Signals.pdfAlexanderLerch4
 
Dereverberation in the stft and log mel frequency feature domains
Dereverberation in the stft and log mel frequency feature domainsDereverberation in the stft and log mel frequency feature domains
Dereverberation in the stft and log mel frequency feature domainsTakuya Yoshioka
 
09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdfAlexanderLerch4
 
Construction of The Sampled Signal Up To Any Frequency While Keeping The Samp...
Construction of The Sampled Signal Up To Any Frequency While Keeping The Samp...Construction of The Sampled Signal Up To Any Frequency While Keeping The Samp...
Construction of The Sampled Signal Up To Any Frequency While Keeping The Samp...CSCJournals
 
Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...
Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...
Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...Rohde & Schwarz North America
 
Single Electron Spin Detection Slides For Uno Interview
Single Electron Spin Detection Slides For Uno InterviewSingle Electron Spin Detection Slides For Uno Interview
Single Electron Spin Detection Slides For Uno Interviewchenhm
 
Radar 2009 a 11 waveforms and pulse compression
Radar 2009 a 11 waveforms and pulse compressionRadar 2009 a 11 waveforms and pulse compression
Radar 2009 a 11 waveforms and pulse compressionForward2025
 
(2012) Rigaud, Falaize, David, Daudet - Does Inharmonicity Improve an NMF-Bas...
(2012) Rigaud, Falaize, David, Daudet - Does Inharmonicity Improve an NMF-Bas...(2012) Rigaud, Falaize, David, Daudet - Does Inharmonicity Improve an NMF-Bas...
(2012) Rigaud, Falaize, David, Daudet - Does Inharmonicity Improve an NMF-Bas...François Rigaud
 
Introduction to deep learning based voice activity detection
Introduction to deep learning based voice activity detectionIntroduction to deep learning based voice activity detection
Introduction to deep learning based voice activity detectionNAVER Engineering
 
DSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time Signals
DSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time SignalsDSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time Signals
DSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time SignalsAmr E. Mohamed
 
Frequency and Power Estimator for Digital Receivers in Doppler Shift Environm...
Frequency and Power Estimator for Digital Receivers in Doppler Shift Environm...Frequency and Power Estimator for Digital Receivers in Doppler Shift Environm...
Frequency and Power Estimator for Digital Receivers in Doppler Shift Environm...CSCJournals
 
DSP_2018_FOEHU - Lec 06 - FIR Filter Design
DSP_2018_FOEHU - Lec 06 - FIR Filter DesignDSP_2018_FOEHU - Lec 06 - FIR Filter Design
DSP_2018_FOEHU - Lec 06 - FIR Filter DesignAmr E. Mohamed
 
Frequency based criterion for distinguishing tonal and noisy spectral components
Frequency based criterion for distinguishing tonal and noisy spectral componentsFrequency based criterion for distinguishing tonal and noisy spectral components
Frequency based criterion for distinguishing tonal and noisy spectral componentsCSCJournals
 
D ESIGN A ND I MPLEMENTATION OF D IGITAL F ILTER B ANK T O R EDUCE N O...
D ESIGN  A ND  I MPLEMENTATION OF  D IGITAL F ILTER  B ANK  T O  R EDUCE  N O...D ESIGN  A ND  I MPLEMENTATION OF  D IGITAL F ILTER  B ANK  T O  R EDUCE  N O...
D ESIGN A ND I MPLEMENTATION OF D IGITAL F ILTER B ANK T O R EDUCE N O...sipij
 
Acquisition of Long Pseudo Code in Dsss Signal
Acquisition of Long Pseudo Code in Dsss SignalAcquisition of Long Pseudo Code in Dsss Signal
Acquisition of Long Pseudo Code in Dsss SignalIJMER
 

Similar to 07-03-03-ACA-Tonal-Monof0.pdf (20)

An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...
An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...
An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...
 
Feature Extraction of Musical Instrument Tones using FFT and Segment Averaging
Feature Extraction of Musical Instrument Tones using FFT and Segment AveragingFeature Extraction of Musical Instrument Tones using FFT and Segment Averaging
Feature Extraction of Musical Instrument Tones using FFT and Segment Averaging
 
03-01-ACA-Input-Signals.pdf
03-01-ACA-Input-Signals.pdf03-01-ACA-Input-Signals.pdf
03-01-ACA-Input-Signals.pdf
 
Dereverberation in the stft and log mel frequency feature domains
Dereverberation in the stft and log mel frequency feature domainsDereverberation in the stft and log mel frequency feature domains
Dereverberation in the stft and log mel frequency feature domains
 
09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf
 
Construction of The Sampled Signal Up To Any Frequency While Keeping The Samp...
Construction of The Sampled Signal Up To Any Frequency While Keeping The Samp...Construction of The Sampled Signal Up To Any Frequency While Keeping The Samp...
Construction of The Sampled Signal Up To Any Frequency While Keeping The Samp...
 
FFT
FFTFFT
FFT
 
Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...
Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...
Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...
 
Unit-4 Pulse analog Modulation.ppt
Unit-4  Pulse analog Modulation.pptUnit-4  Pulse analog Modulation.ppt
Unit-4 Pulse analog Modulation.ppt
 
Single Electron Spin Detection Slides For Uno Interview
Single Electron Spin Detection Slides For Uno InterviewSingle Electron Spin Detection Slides For Uno Interview
Single Electron Spin Detection Slides For Uno Interview
 
Radar 2009 a 11 waveforms and pulse compression
Radar 2009 a 11 waveforms and pulse compressionRadar 2009 a 11 waveforms and pulse compression
Radar 2009 a 11 waveforms and pulse compression
 
(2012) Rigaud, Falaize, David, Daudet - Does Inharmonicity Improve an NMF-Bas...
(2012) Rigaud, Falaize, David, Daudet - Does Inharmonicity Improve an NMF-Bas...(2012) Rigaud, Falaize, David, Daudet - Does Inharmonicity Improve an NMF-Bas...
(2012) Rigaud, Falaize, David, Daudet - Does Inharmonicity Improve an NMF-Bas...
 
Introduction to deep learning based voice activity detection
Introduction to deep learning based voice activity detectionIntroduction to deep learning based voice activity detection
Introduction to deep learning based voice activity detection
 
DSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time Signals
DSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time SignalsDSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time Signals
DSP_2018_FOEHU - Lec 02 - Sampling of Continuous Time Signals
 
Frequency and Power Estimator for Digital Receivers in Doppler Shift Environm...
Frequency and Power Estimator for Digital Receivers in Doppler Shift Environm...Frequency and Power Estimator for Digital Receivers in Doppler Shift Environm...
Frequency and Power Estimator for Digital Receivers in Doppler Shift Environm...
 
Asee gsw 2000
Asee gsw 2000Asee gsw 2000
Asee gsw 2000
 
DSP_2018_FOEHU - Lec 06 - FIR Filter Design
DSP_2018_FOEHU - Lec 06 - FIR Filter DesignDSP_2018_FOEHU - Lec 06 - FIR Filter Design
DSP_2018_FOEHU - Lec 06 - FIR Filter Design
 
Frequency based criterion for distinguishing tonal and noisy spectral components
Frequency based criterion for distinguishing tonal and noisy spectral componentsFrequency based criterion for distinguishing tonal and noisy spectral components
Frequency based criterion for distinguishing tonal and noisy spectral components
 
D ESIGN A ND I MPLEMENTATION OF D IGITAL F ILTER B ANK T O R EDUCE N O...
D ESIGN  A ND  I MPLEMENTATION OF  D IGITAL F ILTER  B ANK  T O  R EDUCE  N O...D ESIGN  A ND  I MPLEMENTATION OF  D IGITAL F ILTER  B ANK  T O  R EDUCE  N O...
D ESIGN A ND I MPLEMENTATION OF D IGITAL F ILTER B ANK T O R EDUCE N O...
 
Acquisition of Long Pseudo Code in Dsss Signal
Acquisition of Long Pseudo Code in Dsss SignalAcquisition of Long Pseudo Code in Dsss Signal
Acquisition of Long Pseudo Code in Dsss Signal
 

More from AlexanderLerch4

0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdfAlexanderLerch4
 
09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdfAlexanderLerch4
 
09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdf09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdfAlexanderLerch4
 
07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdfAlexanderLerch4
 
15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdfAlexanderLerch4
 
12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdfAlexanderLerch4
 
09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdfAlexanderLerch4
 
09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdfAlexanderLerch4
 
11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdfAlexanderLerch4
 
03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdfAlexanderLerch4
 
05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdfAlexanderLerch4
 
07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdfAlexanderLerch4
 
03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdfAlexanderLerch4
 
03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdfAlexanderLerch4
 
03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdfAlexanderLerch4
 
07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdfAlexanderLerch4
 

More from AlexanderLerch4 (20)

0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf
 
12-02-ACA-Genre.pdf
12-02-ACA-Genre.pdf12-02-ACA-Genre.pdf
12-02-ACA-Genre.pdf
 
10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf
 
13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf
 
09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf
 
09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdf09-01-ACA-Temporal-Intro.pdf
09-01-ACA-Temporal-Intro.pdf
 
07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf
 
15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf
 
12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf
 
09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf
 
09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf
 
11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf
 
08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf
 
03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf
 
05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf
 
07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf
 
03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf
 
03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf
 
03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf
 
07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf
 

Recently uploaded

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

07-03-03-ACA-Tonal-Monof0.pdf

  • 1. Introduction to Audio Content Analysis module 7.3.3: fundamental frequency detection in monophonic signals alexander lerch
  • 2. overview intro mono f0 time domain frequency domain summary introduction overview corresponding textbook section section 7.3.3 lecture content • established approaches to monophonic pitch tracking in ▶ time domain ▶ frequency domain learning objectives • define the task of monophonic pitch tracking • summarize the principles of time-domain ˆ f0-trackers and describe one approach in detail • summarize the principles of frequency-domain ˆ f0-trackers and describe one approach in detail module 7.3.3: fundamental frequency detection in monophonic signals 1 / 15
  • 3. overview intro mono f0 time domain frequency domain summary introduction overview corresponding textbook section section 7.3.3 lecture content • established approaches to monophonic pitch tracking in ▶ time domain ▶ frequency domain learning objectives • define the task of monophonic pitch tracking • summarize the principles of time-domain ˆ f0-trackers and describe one approach in detail • summarize the principles of frequency-domain ˆ f0-trackers and describe one approach in detail module 7.3.3: fundamental frequency detection in monophonic signals 1 / 15
  • 4. overview intro mono f0 time domain frequency domain summary fundamental frequency introduction remember Fourier series: every (quasi-)periodic sound is a combination of sinusoidals with integer frequency ratios x(t) ≈ x(t + T̂0) x(t) ≈ ∞ X k=−∞ a(k)ejω0kt ˆ f0: musically, perceptually most “relevant” frequency module 7.3.3: fundamental frequency detection in monophonic signals 2 / 15
  • 5. overview intro mono f0 time domain frequency domain summary fundamental frequency introduction remember Fourier series: every (quasi-)periodic sound is a combination of sinusoidals with integer frequency ratios x(t) ≈ x(t + T̂0) x(t) ≈ ∞ X k=−∞ a(k)ejω0kt ˆ f0: musically, perceptually most “relevant” frequency module 7.3.3: fundamental frequency detection in monophonic signals 2 / 15
  • 6. overview intro mono f0 time domain frequency domain summary pitch detection task definition detect the fundamental frequency ˆ f0 monophonic: only one fundamental frequency at a time related subtasks: • voice activity: detect when there is no voice/no fundamental frequency • note segmentation ▶ note start time and stop time ▶ average note frequency ▶ average note velocity ▶ vibrato detection • frequency to pitch mapping module 7.3.3: fundamental frequency detection in monophonic signals 3 / 15
  • 7. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection zero crossing rate ZCR per block (bad) T̂0(n) = 2 · ie(n) − is(n) fS · ie(n) P i=is(n) |sign [x(i)] − sign [x(i − 1)]| average period length T̂0(n) = 2 Z − 1 Z−2 X j=0 ∆tZC(j). variants: • create distance histogram and choose maximum • also use distances between local extrema module 7.3.3: fundamental frequency detection in monophonic signals 4 / 15 matlab source: plotF0Zcr.m
  • 8. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection zero crossing rate ZCR per block (bad) T̂0(n) = 2 · ie(n) − is(n) fS · ie(n) P i=is(n) |sign [x(i)] − sign [x(i − 1)]| average period length T̂0(n) = 2 Z − 1 Z−2 X j=0 ∆tZC(j). variants: • create distance histogram and choose maximum • also use distances between local extrema module 7.3.3: fundamental frequency detection in monophonic signals 4 / 15 matlab source: plotF0Zcr.m
  • 9. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection zero crossing rate ZCR per block (bad) T̂0(n) = 2 · ie(n) − is(n) fS · ie(n) P i=is(n) |sign [x(i)] − sign [x(i − 1)]| average period length T̂0(n) = 2 Z − 1 Z−2 X j=0 ∆tZC(j). variants: • create distance histogram and choose maximum • also use distances between local extrema module 7.3.3: fundamental frequency detection in monophonic signals 4 / 15 matlab source: plotF0Zcr.m
  • 10. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection auto correlation function find lag of ACF maximum rxx (η, n) = ie(n)−η X i=is(n) x(i) · x(i + η) T̂0(n) = argmax rxx (η, n) variants: • center clipping x χ(x) −cL +cL x χ′ (x) −cL +cL −1 +1 module 7.3.3: fundamental frequency detection in monophonic signals 5 / 15
  • 11. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection auto correlation function find lag of ACF maximum rxx (η, n) = ie(n)−η X i=is(n) x(i) · x(i + η) T̂0(n) = argmax rxx (η, n) variants: • center clipping x χ(x) −cL +cL x χ′ (x) −cL +cL −1 +1 module 7.3.3: fundamental frequency detection in monophonic signals 5 / 15
  • 12. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection auto correlation function module 7.3.3: fundamental frequency detection in monophonic signals 5 / 15 matlab source: plotF0Acf.m
  • 13. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection average magnitude difference function find lag of AMDF minimum AMDFxx (η, n) = 1 ie(n) − is(n) + 1 ie(n)−η X i=is(n) |x(i) − x(i + η)| variants: • AMDF-weighted ACF r′ xx (η, n) = rxx (η, n) AMDFxx (η, n) + 1 module 7.3.3: fundamental frequency detection in monophonic signals 6 / 15
  • 14. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection average magnitude difference function find lag of AMDF minimum AMDFxx (η, n) = 1 ie(n) − is(n) + 1 ie(n)−η X i=is(n) |x(i) − x(i + η)| variants: • AMDF-weighted ACF r′ xx (η, n) = rxx (η, n) AMDFxx (η, n) + 1 module 7.3.3: fundamental frequency detection in monophonic signals 6 / 15
  • 15. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection average magnitude difference function module 7.3.3: fundamental frequency detection in monophonic signals 6 / 15 matlab source: plotF0Amdf.m
  • 16. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection harmonic product spectrum 1/2 XHPS(k, n) = O Y j=1 |X(j · k, n)|2 ˆ f0(n) = argmax XHPS(k, n) first published in the 1960s by Noll 1 A. M. Noll, “Pitch Determination of Human Speech by the Harmonic Product Spectrum, the Harmonic Sum Spectrum, and a Maximum Likelihood Estimate,” in Proceedings of the Symposium on Computer Processing in Communications, vol. 19, Brooklyn: Polytechnic Press of the University of Brooklyn, 1969, pp. 779–797. module 7.3.3: fundamental frequency detection in monophonic signals 7 / 15 matlab source: plotF0HpsMethod.m
  • 17. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection harmonic product spectrum 2/2 module 7.3.3: fundamental frequency detection in monophonic signals 8 / 15 matlab source: plotF0Hps.m
  • 18. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection harmonic sum spectrum sum instead product sum XHSS(k, n) = O X j=1 |X(j · k, n)|2 • advantage ▶ robust against missing harmonics • disadvantage ▶ less pronounced peak module 7.3.3: fundamental frequency detection in monophonic signals 9 / 15
  • 19. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection ACF of magnitude spectrum rXX (η, n) = K/2−1 X k=−K/2 |X(k, n)| · |X(k + η, n)| ⇒ detect maximum location module 7.3.3: fundamental frequency detection in monophonic signals 10 / 15 matlab source: plotF0AcfOfFft.m
  • 20. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection cepstral pitch detection 1 compute cepstrum 2 detect periodicities module 7.3.3: fundamental frequency detection in monophonic signals 11 / 15 matlab source: plotF0Cepstrum.m
  • 21. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection spectral maximum likelihood create template matrix with (smoothed) delta pulses for all possible frequencies compute the cross correlation (lag = 0) between spectrum and all templates pick the result with the highest correlation ⇒ frequency estimate module 7.3.3: fundamental frequency detection in monophonic signals 12 / 15
  • 22. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection spectral maximum likelihood create template matrix with (smoothed) delta pulses for all possible frequencies compute the cross correlation (lag = 0) between spectrum and all templates pick the result with the highest correlation ⇒ frequency estimate module 7.3.3: fundamental frequency detection in monophonic signals 12 / 15
  • 23. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection spectral maximum likelihood create template matrix with (smoothed) delta pulses for all possible frequencies compute the cross correlation (lag = 0) between spectrum and all templates pick the result with the highest correlation ⇒ frequency estimate module 7.3.3: fundamental frequency detection in monophonic signals 12 / 15 matlab source: plotF0Template.m
  • 24. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection auditory-motivated pitch tracking 1/2 1 filterbank of band pass filters (e.g., mel scale) 2 HWR 3 smoothing 4 within-band periodicity estimate (e.g. ACF) 5 combination of bands module 7.3.3: fundamental frequency detection in monophonic signals 13 / 15
  • 25. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection auditory-motivated pitch tracking 1/2 1 filterbank of band pass filters (e.g., mel scale) 2 HWR 3 smoothing 4 within-band periodicity estimate (e.g. ACF) 5 combination of bands module 7.3.3: fundamental frequency detection in monophonic signals 13 / 15
  • 26. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection auditory-motivated pitch tracking 1/2 1 filterbank of band pass filters (e.g., mel scale) 2 HWR 3 smoothing 4 within-band periodicity estimate (e.g. ACF) 5 combination of bands module 7.3.3: fundamental frequency detection in monophonic signals 13 / 15
  • 27. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection auditory-motivated pitch tracking 1/2 1 filterbank of band pass filters (e.g., mel scale) 2 HWR 3 smoothing 4 within-band periodicity estimate (e.g. ACF) 5 combination of bands module 7.3.3: fundamental frequency detection in monophonic signals 13 / 15
  • 28. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection auditory-motivated pitch tracking 1/2 1 filterbank of band pass filters (e.g., mel scale) 2 HWR 3 smoothing 4 within-band periodicity estimate (e.g. ACF) 5 combination of bands module 7.3.3: fundamental frequency detection in monophonic signals 13 / 15
  • 29. overview intro mono f0 time domain frequency domain summary monophonic fundamental frequency detection auditory-motivated pitch tracking 2/2 1 filterbank output 2 half wave rectification 3 smoothed output module 7.3.3: fundamental frequency detection in monophonic signals 14 / 15 matlab source: plotF0Auditory.m
  • 30. overview intro mono f0 time domain frequency domain summary summary lecture content basic commonality • all approaches look for periodicity ▶ waveform similarity in time domain ▶ equidistant harmonics/peaks in freq domain state-of-the-art • despite the age of the presented methods, tweaked versions of the presented approaches are still often considered state-of-the-art • combinations of different approaches can be robust module 7.3.3: fundamental frequency detection in monophonic signals 15 / 15