SlideShare a Scribd company logo
Introduction to Audio Content Analysis
module 7.1: human perception of pitch
alexander lerch
overview intro perception summary
introduction
overview
corresponding textbook section
section 7.1
lecture content
• pitch as perceptual phenomenon
• non-linear relation of frequency and pitch
• frequency content of a simple pitched sound
• dimensions of pitch perception
learning objectives
• describe basic properties of models for pitch
• explain the two dimensions of pitch perception
module 7.1: human perception of pitch 1 / 8
overview intro perception summary
introduction
overview
corresponding textbook section
section 7.1
lecture content
• pitch as perceptual phenomenon
• non-linear relation of frequency and pitch
• frequency content of a simple pitched sound
• dimensions of pitch perception
learning objectives
• describe basic properties of models for pitch
• explain the two dimensions of pitch perception
module 7.1: human perception of pitch 1 / 8
overview intro perception summary
tonal analysis
introduction
pitch & pitch-based properties belong to the most important parameters
describing music
• melody
• harmony
• tonality
• tuning & intonation
related ACA tasks
• fundamental frequency detection
• key detection
• chord detection
• tuning frequency & temperament estimation
module 7.1: human perception of pitch 2 / 8
overview intro perception summary
tonal analysis
introduction
pitch & pitch-based properties belong to the most important parameters
describing music
• melody
• harmony
• tonality
• tuning & intonation
related ACA tasks
• fundamental frequency detection
• key detection
• chord detection
• tuning frequency & temperament estimation
module 7.1: human perception of pitch 2 / 8
overview intro perception summary
pitch perception
pitch definition
definition (American Standards Association)
pitch is that attribute of auditory sensation in terms of which sounds may be ordered
on a musical scale1
temporal variations in pitch give rise to a sense of melody
closely related to frequency, but subjective
⇒ assigning a pitch value to a sound means specifying the frequency of a pure
tone having the same subjective pitch as the sound
1
ASA, “Acoustical Terminology,” American Standards Association (ASA), Standard, 1960.
module 7.1: human perception of pitch 3 / 8
overview intro perception summary
pitch perception
pitch definition
definition (American Standards Association)
pitch is that attribute of auditory sensation in terms of which sounds may be ordered
on a musical scale1
temporal variations in pitch give rise to a sense of melody
closely related to frequency, but subjective
⇒ assigning a pitch value to a sound means specifying the frequency of a pure
tone having the same subjective pitch as the sound
1
ASA, “Acoustical Terminology,” American Standards Association (ASA), Standard, 1960.
module 7.1: human perception of pitch 3 / 8
overview intro perception summary
pitch perception
pitch definition
definition (American Standards Association)
pitch is that attribute of auditory sensation in terms of which sounds may be ordered
on a musical scale1
temporal variations in pitch give rise to a sense of melody
closely related to frequency, but subjective
⇒ assigning a pitch value to a sound means specifying the frequency of a pure
tone having the same subjective pitch as the sound
1
ASA, “Acoustical Terminology,” American Standards Association (ASA), Standard, 1960.
module 7.1: human perception of pitch 3 / 8
overview intro perception summary
pitch perception
fundamental frequency
fundamental frequency is relevant for pitch perception (f0, 2f0, 3f0, . . .)
higher fundamental frequency ⇒ higher pitch (mono-dimensional)
module 7.1: human perception of pitch 4 / 8
matlab
source:
plotHarmonics.m
overview intro perception summary
pitch perception
missing fundamental
basilar membrane location does not explain the pitch perception of complex
tones
⇒ virtual pitch, residue pitch
example 1: missing fundamental
• f0 = 120 Hz, 33 harmonics, with(out) bandpass 300-2400 Hz
example 2: missing fundamental
• speech f0 ≈ 100 Hz, with(out) bandpass 300-4000 Hz
module 7.1: human perception of pitch 5 / 8
overview intro perception summary
pitch perception
missing fundamental
basilar membrane location does not explain the pitch perception of complex
tones
⇒ virtual pitch, residue pitch
example 1: missing fundamental
• f0 = 120 Hz, 33 harmonics, with(out) bandpass 300-2400 Hz
example 2: missing fundamental
• speech f0 ≈ 100 Hz, with(out) bandpass 300-4000 Hz
module 7.1: human perception of pitch 5 / 8
overview intro perception summary
pitch perception
missing fundamental
basilar membrane location does not explain the pitch perception of complex
tones
⇒ virtual pitch, residue pitch
example 1: missing fundamental
• f0 = 120 Hz, 33 harmonics, with(out) bandpass 300-2400 Hz
example 2: missing fundamental
• speech f0 ≈ 100 Hz, with(out) bandpass 300-4000 Hz
module 7.1: human perception of pitch 5 / 8
overview intro perception summary
pitch perception
frequency & pitch
non-linear pitch frequency relation:
perceptual pitch distance ̸= frequency distance
⇒ models for psycho-acoustic/physiological data
• Mel scale (equal pitch distance)
• Bark scale (critical band width)
• physiological frequency location (basilar membrane)
module 7.1: human perception of pitch 6 / 8
matlab
source:
plotMelBark.m
overview intro perception summary
pitch perception
frequency & pitch
Fant: mF(f ) = 1000 · log2 1 + f
1000 Hz

O’Shaughnessy: mS(f ) = 2595 · log10 1 + f
700 Hz

mS(f ) = 1127 · log 1 + f
700 Hz

module 7.1: human perception of pitch 6 / 8
matlab
source:
plotMelBark.m
overview intro perception summary
pitch perception
frequency  pitch
Fant: mF(f ) = 1000 · log2 1 + f
1000 Hz

O’Shaughnessy: mS(f ) = 2595 · log10 1 + f
700 Hz

mS(f ) = 1127 · log 1 + f
700 Hz

module 7.1: human perception of pitch 6 / 8
matlab
source:
plotMelBark.m
overview intro perception summary
pitch perception
frequency  pitch
Schröder: zS(f ) = 7 · arcsinh f
650 Hz

Terhardt: zT(f ) = 13.3 · arctan 0.75 · f
1000 Hz

Zwicker: zZ (f ) = 13 · atan 0.76 · f
1000 Hz

+ 3.5 · atan f
7500 Hz

module 7.1: human perception of pitch 6 / 8
matlab
source:
plotMelBark.m
overview intro perception summary
pitch perception
frequency  pitch
ERB: e(f ) = 9.26 log 1 + f
228.7

Cochlear Map: x(f ) = 1
2.1 log10
f
165.4 + 1

module 7.1: human perception of pitch 6 / 8
matlab
source:
plotMelBark.m
overview intro perception summary
pitch perception
pitch dimensions
2 dimensions of musical pitch
tone height: monotonic relationship to frequency (increasing frequency ⇒
increasing pitch)
tone chroma: two tones separated by octave sound similar (same pitch class)
module 7.1: human perception of pitch 7 / 8
overview intro perception summary
pitch perception
pitch dimensions
2 dimensions of musical pitch
tone height: monotonic relationship to frequency (increasing frequency ⇒
increasing pitch)
tone chroma: two tones separated by octave sound similar (same pitch class)
module 7.1: human perception of pitch 7 / 8
matlab
source:
plotPitchHelix.m
overview intro perception summary
summary
lecture content
pitch
• subjective phenomenon
• non-linear monotonic relationship to frequency (tone height increases with
fundamental frequency)
• pitch grouping based on powers of two: tone chroma perception
module 7.1: human perception of pitch 8 / 8

More Related Content

Similar to 07-01-ACA-Tonal-Perception.pdf

6 loudness
6 loudness6 loudness
6 loudness
vishal48
 
seminar slides
 seminar slides  seminar slides
seminar slides
Sindhu Csprasanth
 
Build Your Own VR Display Course - SIGGRAPH 2017: Part 4
Build Your Own VR Display Course - SIGGRAPH 2017: Part 4Build Your Own VR Display Course - SIGGRAPH 2017: Part 4
Build Your Own VR Display Course - SIGGRAPH 2017: Part 4
StanfordComputationalImaging
 
207 intro lecture2010
207 intro lecture2010207 intro lecture2010
207 intro lecture2010
guest1db8cc6
 
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive InstrumentsA Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
IJMTST Journal
 
Ft and FFT
Ft and FFTFt and FFT
Unit 9 Summary 2009
Unit 9 Summary 2009Unit 9 Summary 2009
Unit 9 Summary 2009
mjgvalcarce
 
Lec 2
Lec 2Lec 2
Lec 2
quchani
 
Paper on Modeling and Subtractive synthesis of virtual violin
Paper on Modeling and Subtractive synthesis of virtual violinPaper on Modeling and Subtractive synthesis of virtual violin
Paper on Modeling and Subtractive synthesis of virtual violin
Vinoth Punniyamoorthy
 
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
CSCJournals
 
7 PITCH AND TIMBRE.ppt
7 PITCH AND TIMBRE.ppt7 PITCH AND TIMBRE.ppt
7 PITCH AND TIMBRE.ppt
JocenAblona
 
Application of Fisher Linear Discriminant Analysis to Speech/Music Classifica...
Application of Fisher Linear Discriminant Analysis to Speech/Music Classifica...Application of Fisher Linear Discriminant Analysis to Speech/Music Classifica...
Application of Fisher Linear Discriminant Analysis to Speech/Music Classifica...
Lushanthan Sivaneasharajah
 
Characterization of sound
Characterization of soundCharacterization of sound
Characterization of sound
R.Narasimha Swamy
 
The Overtone Spectrum
The Overtone SpectrumThe Overtone Spectrum
The Overtone Spectrum
Alexis Baskind
 
Introduction to microphones
Introduction to microphonesIntroduction to microphones
Introduction to microphones
DJShirlee
 
My Presentation @Ryerson University
My Presentation @Ryerson UniversityMy Presentation @Ryerson University
My Presentation @Ryerson University
Jagdish Bhatt
 
The_impact_of_sound_control_room_acoustics_on_the_.pdf
The_impact_of_sound_control_room_acoustics_on_the_.pdfThe_impact_of_sound_control_room_acoustics_on_the_.pdf
The_impact_of_sound_control_room_acoustics_on_the_.pdf
EricSifuna1
 
Spectrograms
SpectrogramsSpectrograms
Spectrograms
ola khaza'leh
 
Phase noise
Phase noisePhase noise
Phase noise
Surendra Bachina
 
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
kevig
 

Similar to 07-01-ACA-Tonal-Perception.pdf (20)

6 loudness
6 loudness6 loudness
6 loudness
 
seminar slides
 seminar slides  seminar slides
seminar slides
 
Build Your Own VR Display Course - SIGGRAPH 2017: Part 4
Build Your Own VR Display Course - SIGGRAPH 2017: Part 4Build Your Own VR Display Course - SIGGRAPH 2017: Part 4
Build Your Own VR Display Course - SIGGRAPH 2017: Part 4
 
207 intro lecture2010
207 intro lecture2010207 intro lecture2010
207 intro lecture2010
 
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive InstrumentsA Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
 
Ft and FFT
Ft and FFTFt and FFT
Ft and FFT
 
Unit 9 Summary 2009
Unit 9 Summary 2009Unit 9 Summary 2009
Unit 9 Summary 2009
 
Lec 2
Lec 2Lec 2
Lec 2
 
Paper on Modeling and Subtractive synthesis of virtual violin
Paper on Modeling and Subtractive synthesis of virtual violinPaper on Modeling and Subtractive synthesis of virtual violin
Paper on Modeling and Subtractive synthesis of virtual violin
 
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
 
7 PITCH AND TIMBRE.ppt
7 PITCH AND TIMBRE.ppt7 PITCH AND TIMBRE.ppt
7 PITCH AND TIMBRE.ppt
 
Application of Fisher Linear Discriminant Analysis to Speech/Music Classifica...
Application of Fisher Linear Discriminant Analysis to Speech/Music Classifica...Application of Fisher Linear Discriminant Analysis to Speech/Music Classifica...
Application of Fisher Linear Discriminant Analysis to Speech/Music Classifica...
 
Characterization of sound
Characterization of soundCharacterization of sound
Characterization of sound
 
The Overtone Spectrum
The Overtone SpectrumThe Overtone Spectrum
The Overtone Spectrum
 
Introduction to microphones
Introduction to microphonesIntroduction to microphones
Introduction to microphones
 
My Presentation @Ryerson University
My Presentation @Ryerson UniversityMy Presentation @Ryerson University
My Presentation @Ryerson University
 
The_impact_of_sound_control_room_acoustics_on_the_.pdf
The_impact_of_sound_control_room_acoustics_on_the_.pdfThe_impact_of_sound_control_room_acoustics_on_the_.pdf
The_impact_of_sound_control_room_acoustics_on_the_.pdf
 
Spectrograms
SpectrogramsSpectrograms
Spectrograms
 
Phase noise
Phase noisePhase noise
Phase noise
 
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
 

More from AlexanderLerch4

0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf
AlexanderLerch4
 
12-02-ACA-Genre.pdf
12-02-ACA-Genre.pdf12-02-ACA-Genre.pdf
12-02-ACA-Genre.pdf
AlexanderLerch4
 
10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf
AlexanderLerch4
 
09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf
AlexanderLerch4
 
13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf
AlexanderLerch4
 
09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf
AlexanderLerch4
 
07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf
AlexanderLerch4
 
15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf
AlexanderLerch4
 
12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf
AlexanderLerch4
 
09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf
AlexanderLerch4
 
09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf
AlexanderLerch4
 
11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf
AlexanderLerch4
 
08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf
AlexanderLerch4
 
03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf
AlexanderLerch4
 
05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf
AlexanderLerch4
 
07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf
AlexanderLerch4
 
03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf
AlexanderLerch4
 
03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf
AlexanderLerch4
 
03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf
AlexanderLerch4
 
07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf
AlexanderLerch4
 

More from AlexanderLerch4 (20)

0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf0A-02-ACA-Fundamentals-Convolution.pdf
0A-02-ACA-Fundamentals-Convolution.pdf
 
12-02-ACA-Genre.pdf
12-02-ACA-Genre.pdf12-02-ACA-Genre.pdf
12-02-ACA-Genre.pdf
 
10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf10-02-ACA-Alignment.pdf
10-02-ACA-Alignment.pdf
 
09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf09-03-ACA-Temporal-Onsets.pdf
09-03-ACA-Temporal-Onsets.pdf
 
13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf13-00-ACA-Mood.pdf
13-00-ACA-Mood.pdf
 
09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf09-07-ACA-Temporal-Structure.pdf
09-07-ACA-Temporal-Structure.pdf
 
07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf07-06-ACA-Tonal-Chord.pdf
07-06-ACA-Tonal-Chord.pdf
 
15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf15-00-ACA-Music-Performance-Analysis.pdf
15-00-ACA-Music-Performance-Analysis.pdf
 
12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf12-01-ACA-Similarity.pdf
12-01-ACA-Similarity.pdf
 
09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf09-04-ACA-Temporal-BeatHisto.pdf
09-04-ACA-Temporal-BeatHisto.pdf
 
09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf09-05-ACA-Temporal-Tempo.pdf
09-05-ACA-Temporal-Tempo.pdf
 
11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf11-00-ACA-Fingerprinting.pdf
11-00-ACA-Fingerprinting.pdf
 
08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf08-00-ACA-Intensity.pdf
08-00-ACA-Intensity.pdf
 
03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf03-05-ACA-Input-Features.pdf
03-05-ACA-Input-Features.pdf
 
05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf05-00-ACA-Data-Intro.pdf
05-00-ACA-Data-Intro.pdf
 
07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf07-03-05-ACA-Tonal-F0Eval.pdf
07-03-05-ACA-Tonal-F0Eval.pdf
 
03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf03-03-01-ACA-Input-TF-Fourier.pdf
03-03-01-ACA-Input-TF-Fourier.pdf
 
03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf03-06-ACA-Input-FeatureLearning.pdf
03-06-ACA-Input-FeatureLearning.pdf
 
03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf03-07-01-ACA-Input-Post-Processing.pdf
03-07-01-ACA-Input-Post-Processing.pdf
 
07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf07-02-ACA-Tonal-Music.pdf
07-02-ACA-Tonal-Music.pdf
 

Recently uploaded

Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 

Recently uploaded (20)

Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 

07-01-ACA-Tonal-Perception.pdf

  • 1. Introduction to Audio Content Analysis module 7.1: human perception of pitch alexander lerch
  • 2. overview intro perception summary introduction overview corresponding textbook section section 7.1 lecture content • pitch as perceptual phenomenon • non-linear relation of frequency and pitch • frequency content of a simple pitched sound • dimensions of pitch perception learning objectives • describe basic properties of models for pitch • explain the two dimensions of pitch perception module 7.1: human perception of pitch 1 / 8
  • 3. overview intro perception summary introduction overview corresponding textbook section section 7.1 lecture content • pitch as perceptual phenomenon • non-linear relation of frequency and pitch • frequency content of a simple pitched sound • dimensions of pitch perception learning objectives • describe basic properties of models for pitch • explain the two dimensions of pitch perception module 7.1: human perception of pitch 1 / 8
  • 4. overview intro perception summary tonal analysis introduction pitch & pitch-based properties belong to the most important parameters describing music • melody • harmony • tonality • tuning & intonation related ACA tasks • fundamental frequency detection • key detection • chord detection • tuning frequency & temperament estimation module 7.1: human perception of pitch 2 / 8
  • 5. overview intro perception summary tonal analysis introduction pitch & pitch-based properties belong to the most important parameters describing music • melody • harmony • tonality • tuning & intonation related ACA tasks • fundamental frequency detection • key detection • chord detection • tuning frequency & temperament estimation module 7.1: human perception of pitch 2 / 8
  • 6. overview intro perception summary pitch perception pitch definition definition (American Standards Association) pitch is that attribute of auditory sensation in terms of which sounds may be ordered on a musical scale1 temporal variations in pitch give rise to a sense of melody closely related to frequency, but subjective ⇒ assigning a pitch value to a sound means specifying the frequency of a pure tone having the same subjective pitch as the sound 1 ASA, “Acoustical Terminology,” American Standards Association (ASA), Standard, 1960. module 7.1: human perception of pitch 3 / 8
  • 7. overview intro perception summary pitch perception pitch definition definition (American Standards Association) pitch is that attribute of auditory sensation in terms of which sounds may be ordered on a musical scale1 temporal variations in pitch give rise to a sense of melody closely related to frequency, but subjective ⇒ assigning a pitch value to a sound means specifying the frequency of a pure tone having the same subjective pitch as the sound 1 ASA, “Acoustical Terminology,” American Standards Association (ASA), Standard, 1960. module 7.1: human perception of pitch 3 / 8
  • 8. overview intro perception summary pitch perception pitch definition definition (American Standards Association) pitch is that attribute of auditory sensation in terms of which sounds may be ordered on a musical scale1 temporal variations in pitch give rise to a sense of melody closely related to frequency, but subjective ⇒ assigning a pitch value to a sound means specifying the frequency of a pure tone having the same subjective pitch as the sound 1 ASA, “Acoustical Terminology,” American Standards Association (ASA), Standard, 1960. module 7.1: human perception of pitch 3 / 8
  • 9. overview intro perception summary pitch perception fundamental frequency fundamental frequency is relevant for pitch perception (f0, 2f0, 3f0, . . .) higher fundamental frequency ⇒ higher pitch (mono-dimensional) module 7.1: human perception of pitch 4 / 8 matlab source: plotHarmonics.m
  • 10. overview intro perception summary pitch perception missing fundamental basilar membrane location does not explain the pitch perception of complex tones ⇒ virtual pitch, residue pitch example 1: missing fundamental • f0 = 120 Hz, 33 harmonics, with(out) bandpass 300-2400 Hz example 2: missing fundamental • speech f0 ≈ 100 Hz, with(out) bandpass 300-4000 Hz module 7.1: human perception of pitch 5 / 8
  • 11. overview intro perception summary pitch perception missing fundamental basilar membrane location does not explain the pitch perception of complex tones ⇒ virtual pitch, residue pitch example 1: missing fundamental • f0 = 120 Hz, 33 harmonics, with(out) bandpass 300-2400 Hz example 2: missing fundamental • speech f0 ≈ 100 Hz, with(out) bandpass 300-4000 Hz module 7.1: human perception of pitch 5 / 8
  • 12. overview intro perception summary pitch perception missing fundamental basilar membrane location does not explain the pitch perception of complex tones ⇒ virtual pitch, residue pitch example 1: missing fundamental • f0 = 120 Hz, 33 harmonics, with(out) bandpass 300-2400 Hz example 2: missing fundamental • speech f0 ≈ 100 Hz, with(out) bandpass 300-4000 Hz module 7.1: human perception of pitch 5 / 8
  • 13. overview intro perception summary pitch perception frequency & pitch non-linear pitch frequency relation: perceptual pitch distance ̸= frequency distance ⇒ models for psycho-acoustic/physiological data • Mel scale (equal pitch distance) • Bark scale (critical band width) • physiological frequency location (basilar membrane) module 7.1: human perception of pitch 6 / 8 matlab source: plotMelBark.m
  • 14. overview intro perception summary pitch perception frequency & pitch Fant: mF(f ) = 1000 · log2 1 + f 1000 Hz O’Shaughnessy: mS(f ) = 2595 · log10 1 + f 700 Hz mS(f ) = 1127 · log 1 + f 700 Hz module 7.1: human perception of pitch 6 / 8 matlab source: plotMelBark.m
  • 15. overview intro perception summary pitch perception frequency pitch Fant: mF(f ) = 1000 · log2 1 + f 1000 Hz O’Shaughnessy: mS(f ) = 2595 · log10 1 + f 700 Hz mS(f ) = 1127 · log 1 + f 700 Hz module 7.1: human perception of pitch 6 / 8 matlab source: plotMelBark.m
  • 16. overview intro perception summary pitch perception frequency pitch Schröder: zS(f ) = 7 · arcsinh f 650 Hz Terhardt: zT(f ) = 13.3 · arctan 0.75 · f 1000 Hz Zwicker: zZ (f ) = 13 · atan 0.76 · f 1000 Hz + 3.5 · atan f 7500 Hz module 7.1: human perception of pitch 6 / 8 matlab source: plotMelBark.m
  • 17. overview intro perception summary pitch perception frequency pitch ERB: e(f ) = 9.26 log 1 + f 228.7 Cochlear Map: x(f ) = 1 2.1 log10 f 165.4 + 1 module 7.1: human perception of pitch 6 / 8 matlab source: plotMelBark.m
  • 18. overview intro perception summary pitch perception pitch dimensions 2 dimensions of musical pitch tone height: monotonic relationship to frequency (increasing frequency ⇒ increasing pitch) tone chroma: two tones separated by octave sound similar (same pitch class) module 7.1: human perception of pitch 7 / 8
  • 19. overview intro perception summary pitch perception pitch dimensions 2 dimensions of musical pitch tone height: monotonic relationship to frequency (increasing frequency ⇒ increasing pitch) tone chroma: two tones separated by octave sound similar (same pitch class) module 7.1: human perception of pitch 7 / 8 matlab source: plotPitchHelix.m
  • 20. overview intro perception summary summary lecture content pitch • subjective phenomenon • non-linear monotonic relationship to frequency (tone height increases with fundamental frequency) • pitch grouping based on powers of two: tone chroma perception module 7.1: human perception of pitch 8 / 8