SlideShare a Scribd company logo
Spectrograms
An essential tool for instrumental phonetics
Voice Spectra and Spectrograms
• Recall that power spectra allow us to see the component frequencies and amplitudes
that make up a complex wave.
• Spectrograms are a way of viewing hundreds of sequential power spectra, so we can
see how the spectrum of a speech signal changes over time.
– This is done by turning the spectrum sideways, reducing each bar on the spectrum to a
dot, and using the color of the dot to symbolize the height of the bar (i.e., the amplitude or
intensity of the component frequency).
2
Building a Spectrogram
1. Power spectrum
amplitudes are
represented as different
shades of grey (taller =
darker)
2. Spectrum is rotated and
collapsed to 1-dimension.
3. Hundreds of spectra are
arrayed side by side,
allowing us to see
changes in formant
frequencies over time.
3
Reading a Spectrogram
• A spectrogram plots three variables:
– The horizontal axis is time, usually
measured in milliseconds.
– The vertical axis is frequency (Hz).
– The third variable, intensity, is represented
by the color of each point that is plotted.
• The dark bands show the formants of the
voiced sounds.
– The first three formants vary depending on
the shape of the vocal tract and are
important for understanding the speech
signal.
frequency
time (ms)
4
Narrowband Spectrograms
• When calculating the frequency components that make up the speech signal, it is necessary
to take a chunk of time to figure out the frequency of the wave.
• Narrowband spectrograms show calculations based on longer chunks of time, which allows
more accurate calculation of the frequencies making up the signal and can reveal individual
harmonics of the signal.
– Because the chunks of time are longer, narrowband spectrograms are less precise about when a
given speech event happened (e.g., a release burst or the onset of voicing). This lack of precision in
the temporal dimension is called time smear.
– Because of their detailed frequency information, narrowband spectrograms are useful for
investigating changes in pitch, tone, and intonation.
5
Narrowband Spectrograms
• [ɑbɑ]
– Frequency range: 0 – 5000 Hz; duration: ≈0.75 s
– Window size: 0.05 s
Individual
harmonics
6
Wideband Spectrograms
• Wideband spectrograms show calculations based on short chunks of time, which allows
accurate measurement of event timing at the expense of detailed frequency information.
Wideband spectrograms are vertically striated; each vertical striation shows an individual
vibration of the vocal folds.
– Remember that formants are broad peaks of intensity that range over several neighboring
frequencies. Since the wideband spectrograms “smear together” neighboring frequencies, this
actually helps us to see the formants more clearly.
– Vowel quality, place of articulation, and other segmental features are all aspects of the filter and thus
cause broad-frequency changes to the signal. Thus they are best viewed using wideband
spectrograms.
– Because of their accurate timing calculations, wideband spectrograms are also useful for measuring
VOT, vowel duration, and other time-based characteristics of speech.
7
Wideband Spectrograms
• [ɑbɑ]
– Frequency range: 0 – 5000 Hz; duration: ≈0.75 s
– Window size: 0.01 s
Formants
vertical striations
8
Reading a Spectrogram: Voicing
• Voiced sounds have a dark band very close to the bottom of the spectrogram,
corresponding to the fundamental frequency of the speaker’s voice.
– Many “voiced” consonants are allophonically devoiced, so the absence of a voicing band
does not guarantee that the sound is a voiceless phoneme.
9
Reading a Spectrogram: Voicing
• [ɑzɑ]
– Frequency range: 0 – 5000 Hz; duration: ≈0.75 s
Voicing band
10
Reading a Spectrogram: Voicing
• [ɑsɑ]
– Frequency range: 0 – 5000 Hz; duration: ≈0.75 s
No voicing band
11
Reading a Spectrogram: Approximants
• Vowels and approximants show vertical striations in wideband spectrograms, and
clear dark formant bands.
– Vowels are identified on a spectrogram primarily by the values of their F1 and F2 formants.
– Approximants involve constriction comparable to that of a high vowel, and so tend to have
fairly low F1 values. Approximants can often be distinguished by the movement of their F2
and F3 formants.
12
Reading a Spectrogram: Approximants
• [ɑlɑ]
– Frequency range: 0 – 5000 Hz; duration: ≈0.75 s
Formants
low F1 value for the approximant [ l]
13
Reading a Spectrogram: Oral Stops
• Oral stops involve complete stoppage of airflow and thus show up as a “gap” in the
spectrogram.
– The formant transitions of the voiced sounds before or after a stop are the best way to
determine the stop’s place of articulation.
– The release burst of a stop may be continuous with a following voiced sound, or may stand
alone at the end of a word as a brief vertical band of high energy. The distribution of that
energy can also provide clues to the place of articulation of the stop.
14
Reading a Spectrogram: Oral Stops
• [ɑpɑ]
– Frequency range: 0 – 5000 Hz; duration: ≈0.75 s
period of closure aspiration noise
release burst
15
Reading a Spectrogram: Nasal Stops
• Nasal stops allow continuation of airflow and show formant structure much like
vowels.
– Like oral stops, nasal place of articulation is most easily determined by the formant
transitions of the surrounding vowels.
– Unlike vowels, nasals typically show “zeroes” (white areas) between the lower formants.
– Nasals can be distinguished from vowels by a much lower amplitude in the waveform (not
the spectrogram).
16
Reading a Spectrogram: Nasal Stops
• [ɑnɑ]
– Frequency range: 0 – 5000 Hz; duration: ≈0.75 s
“zeroes”
17
Reading a Spectrogram: Nasal Stops
• [ɑnɑ]
lower amplitude
18
Reading a Spectrogram: Fricatives
• Fricatives are characterized by fairly high energy (darker color) especially in the
higher frequencies, and (usually) a lack of formant bands.
– [h] often does show formant bands. This is because [h] generates turbulence in the same
place that voicing originates (the glottis), so the turbulence of [h] is subject to the same
filter that vowels are (namely, the entire vocal tract). The spectrum of [h] can vary widely
depending on the following vowel.
19
Reading a Spectrogram: Fricatives
• [ɑʃɑ]
– Frequency range: 0 – 10,000 Hz; duration: ≈0.75 s
high energy in the higher frequencies
20
Reading a Spectrogram: Fricatives
• [ɑhɑ]
– Frequency range: 0 – 10,000 Hz; duration: ≈0.75 s
formant bands visible through the [h]
21
Reading a Spectrogram: More info
• For more detailed information on spectrogram reading, see:
– Ladefoged and Johnson ch. 8, esp. p. 193-212
– Rob Hagiwara’s “Mystery Spectrogram” website:
http://home.cc.umanitoba.ca/~robh/howto.html#intro
22
Reading a Spectrogram: Vowels
from Ladefoged, P. & Johnson, K. (2011). A Course in Phonetics (6th ed.), p. 194.
23
Formant Transitions:
Cues to Place of Articulation
from Johnson, K. (2003). Acoustic & Auditory Phonetics (2nd ed.). Malden, MA: Blackwell.
24
Reading a Spectrogram: Place
• F2 transition is higher the farther back the place of articulation:
Labiodental Dental Alveolar Post-alveolar
25
from Ladefoged, P. & Johnson, K. (2011). A Course in Phonetics (6th ed.), p. 201.
Reading a Spectrogram:
Distinguishing Fricatives
• Look at the spread and density of frication
• Quiet: Frication light, spread out, present in lower frequencies
• Loud (e.g. sibilants): Dense, concentrated in high frequencies.
26
from Ladefoged, P. & Johnson, K. (2011). A Course in Phonetics (6th ed.), p. 202.
Reading a Spectrogram:
Distinguishing Approximants
• Approximants flow smoothly into vowels
27
from Ladefoged, P. & Johnson, K. (2011). A Course in Phonetics (6th ed.), p. 202.
• [l] – dip in F2, slight rise in F3
• [ɹ] – central F2, scoop in F3
• [w] – like high back vowel [u/ʊ]
• [j] – like high front vowel [i/ɪ]
Spectrogram Examples: [ɹ]
• Scoop shape to F3 • F2 central (between front & back vowels)
28
1800 Hz
1500 Hz
500 Hz
[ m ɛ ɹ i ] [ k ɑ ɹ ]

More Related Content

What's hot

Visual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to MeaningVisual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to Meaning
fawzia
 
SPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLPSPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLP
HimaniBansal15
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
Ahmed Moawad
 
Acoustics of Speech: The Voice Mechanism
Acoustics of Speech: The Voice MechanismAcoustics of Speech: The Voice Mechanism
Acoustics of Speech: The Voice Mechanism
Farhat Surve
 
PHONATION.pptx
PHONATION.pptxPHONATION.pptx
PHONATION.pptx
Uday Dutta
 
Periodic and aperiodic sounds (2)
Periodic and aperiodic sounds (2)Periodic and aperiodic sounds (2)
Periodic and aperiodic sounds (2)
Ahmed Qadoury Abed
 
1. fluency definition.Dys and dis fluency difference.Definition and introduct...
1. fluency definition.Dys and dis fluency difference.Definition and introduct...1. fluency definition.Dys and dis fluency difference.Definition and introduct...
1. fluency definition.Dys and dis fluency difference.Definition and introduct...
Soorya Sunil
 
1. fluency introduction
1. fluency introduction1. fluency introduction
1. fluency introduction
Hemaraja Nayaka S
 
Dimensions of fluency final
Dimensions of fluency finalDimensions of fluency final
Dimensions of fluency final
MSSumantra
 
1. fluency introduction
1. fluency introduction1. fluency introduction
1. fluency introduction
Hemaraja Nayaka S
 
Acoustic phonetics
Acoustic phoneticsAcoustic phonetics
Acoustic phonetics
Junaid Amjed
 
Process of speech production.pptx
Process of speech production.pptxProcess of speech production.pptx
Process of speech production.pptx
LearnwithAnshita
 
Resonators
ResonatorsResonators
Resonators
Ahmed Qadoury Abed
 
Speech synthesis technology
Speech synthesis technologySpeech synthesis technology
Speech synthesis technology
Kalluri Madhuri
 
1. models of word recognition
1. models of word recognition1. models of word recognition
1. models of word recognition
Hemaraja Nayaka S
 
Distinctive features.pptx
Distinctive features.pptxDistinctive features.pptx
Distinctive features.pptx
SunilKumarRavi3
 
Discourse analysis
Discourse analysisDiscourse analysis
Discourse analysis
Vale Caicedo
 
Sp616 adult lexical processing for students
Sp616 adult lexical processing for studentsSp616 adult lexical processing for students
Sp616 adult lexical processing for students
Lynette Chan
 
Neuroanatomy of language functions
Neuroanatomy of language functionsNeuroanatomy of language functions
Neuroanatomy of language functions
Hemaraja Nayaka S
 
Coarticulation
CoarticulationCoarticulation
Coarticulation
zahraa Aamir
 

What's hot (20)

Visual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to MeaningVisual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to Meaning
 
SPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLPSPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLP
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Acoustics of Speech: The Voice Mechanism
Acoustics of Speech: The Voice MechanismAcoustics of Speech: The Voice Mechanism
Acoustics of Speech: The Voice Mechanism
 
PHONATION.pptx
PHONATION.pptxPHONATION.pptx
PHONATION.pptx
 
Periodic and aperiodic sounds (2)
Periodic and aperiodic sounds (2)Periodic and aperiodic sounds (2)
Periodic and aperiodic sounds (2)
 
1. fluency definition.Dys and dis fluency difference.Definition and introduct...
1. fluency definition.Dys and dis fluency difference.Definition and introduct...1. fluency definition.Dys and dis fluency difference.Definition and introduct...
1. fluency definition.Dys and dis fluency difference.Definition and introduct...
 
1. fluency introduction
1. fluency introduction1. fluency introduction
1. fluency introduction
 
Dimensions of fluency final
Dimensions of fluency finalDimensions of fluency final
Dimensions of fluency final
 
1. fluency introduction
1. fluency introduction1. fluency introduction
1. fluency introduction
 
Acoustic phonetics
Acoustic phoneticsAcoustic phonetics
Acoustic phonetics
 
Process of speech production.pptx
Process of speech production.pptxProcess of speech production.pptx
Process of speech production.pptx
 
Resonators
ResonatorsResonators
Resonators
 
Speech synthesis technology
Speech synthesis technologySpeech synthesis technology
Speech synthesis technology
 
1. models of word recognition
1. models of word recognition1. models of word recognition
1. models of word recognition
 
Distinctive features.pptx
Distinctive features.pptxDistinctive features.pptx
Distinctive features.pptx
 
Discourse analysis
Discourse analysisDiscourse analysis
Discourse analysis
 
Sp616 adult lexical processing for students
Sp616 adult lexical processing for studentsSp616 adult lexical processing for students
Sp616 adult lexical processing for students
 
Neuroanatomy of language functions
Neuroanatomy of language functionsNeuroanatomy of language functions
Neuroanatomy of language functions
 
Coarticulation
CoarticulationCoarticulation
Coarticulation
 

Viewers also liked

Investigation Periodic Faults in Yarn
Investigation Periodic Faults in YarnInvestigation Periodic Faults in Yarn
Investigation Periodic Faults in Yarn
Engr Rizwan
 
Prezentace mya
Prezentace myaPrezentace mya
Prezentace mya
Jan Bajus
 
Qqm4 en
Qqm4 enQqm4 en
Qqm4 en
Jan Bajus
 
Hairiness
HairinessHairiness
Hairiness
Harsha157
 
LATEST DEVELOPMENT IN YARN EVENNESS TESTING MACHINE ...
LATEST DEVELOPMENT IN YARN EVENNESS TESTING MACHINE                          ...LATEST DEVELOPMENT IN YARN EVENNESS TESTING MACHINE                          ...
LATEST DEVELOPMENT IN YARN EVENNESS TESTING MACHINE ...
Praveen Rams
 
Phonetics
PhoneticsPhonetics
Unit3presentation
Unit3presentationUnit3presentation
Unit3presentation
poags25
 
Spinning With Technical Acess
Spinning With Technical AcessSpinning With Technical Acess
Spinning With Technical Acess
eshan bansal
 
Importance, Effect & Testing of Yarn Evenness
Importance, Effect & Testing of Yarn EvennessImportance, Effect & Testing of Yarn Evenness
Importance, Effect & Testing of Yarn Evenness
Amirul Eahsan
 

Viewers also liked (9)

Investigation Periodic Faults in Yarn
Investigation Periodic Faults in YarnInvestigation Periodic Faults in Yarn
Investigation Periodic Faults in Yarn
 
Prezentace mya
Prezentace myaPrezentace mya
Prezentace mya
 
Qqm4 en
Qqm4 enQqm4 en
Qqm4 en
 
Hairiness
HairinessHairiness
Hairiness
 
LATEST DEVELOPMENT IN YARN EVENNESS TESTING MACHINE ...
LATEST DEVELOPMENT IN YARN EVENNESS TESTING MACHINE                          ...LATEST DEVELOPMENT IN YARN EVENNESS TESTING MACHINE                          ...
LATEST DEVELOPMENT IN YARN EVENNESS TESTING MACHINE ...
 
Phonetics
PhoneticsPhonetics
Phonetics
 
Unit3presentation
Unit3presentationUnit3presentation
Unit3presentation
 
Spinning With Technical Acess
Spinning With Technical AcessSpinning With Technical Acess
Spinning With Technical Acess
 
Importance, Effect & Testing of Yarn Evenness
Importance, Effect & Testing of Yarn EvennessImportance, Effect & Testing of Yarn Evenness
Importance, Effect & Testing of Yarn Evenness
 

Similar to Spectrograms

Spectrogram.pptx
Spectrogram.pptxSpectrogram.pptx
Spectrogram.pptx
Subramanian Mani
 
Unit 9 Summary 2009
Unit 9 Summary 2009Unit 9 Summary 2009
Unit 9 Summary 2009
mjgvalcarce
 
Physics basic
Physics basicPhysics basic
Physics basic
Syed Amir Gilani
 
Medical Equipment Section2
Medical Equipment Section2Medical Equipment Section2
Medical Equipment Section2
cairo university
 
Lec 2
Lec 2Lec 2
Lec 2
quchani
 
acoustics on human ear
acoustics on human earacoustics on human ear
acoustics on human ear
Abhishek kumar
 
Resonance
ResonanceResonance
USphysics.pptx
USphysics.pptxUSphysics.pptx
USphysics.pptx
MdSohailAhmed7
 
Module No. 46
Module No. 46Module No. 46
ULTRASOUND IMAGING PRINCIPLES
ULTRASOUND IMAGING PRINCIPLESULTRASOUND IMAGING PRINCIPLES
ULTRASOUND IMAGING PRINCIPLES
INDIA ULTRASOUND
 
Cube model Theory of acoustic phonetics
Cube model Theory of acoustic phonetics Cube model Theory of acoustic phonetics
Cube model Theory of acoustic phonetics
KarloHammer
 
Visualizing sound
Visualizing soundVisualizing sound
Visualizing sound
Rodrigo De la Prida
 
5 most important synthesis modules
5 most important synthesis modules5 most important synthesis modules
5 most important synthesis modules
philipsugumaran
 
Preliminary Physics - World communicates 2
Preliminary Physics - World communicates 2Preliminary Physics - World communicates 2
Preliminary Physics - World communicates 2
Silvia Choi
 
Physics P1.5
Physics  P1.5Physics  P1.5
Physics P1.5
Jaskirat Kanwal
 
Chapter 2 The Nature of Sound
Chapter 2 The Nature of SoundChapter 2 The Nature of Sound
Chapter 2 The Nature of Sound
markgr82nz
 
Year 9 Physics Term 2 Lesson Presentation Week 1.pptx
Year 9 Physics Term 2 Lesson Presentation Week 1.pptxYear 9 Physics Term 2 Lesson Presentation Week 1.pptx
Year 9 Physics Term 2 Lesson Presentation Week 1.pptx
PayMyEvery
 
production of ultrasound and physical characteristics-
production of ultrasound and physical characteristics-production of ultrasound and physical characteristics-
production of ultrasound and physical characteristics-
Lushinga Mourice
 
sound basic & microphone
sound basic & microphonesound basic & microphone
sound basic & microphone
Nishith Kumar
 
Characterization of sound
Characterization of soundCharacterization of sound
Characterization of sound
R.Narasimha Swamy
 

Similar to Spectrograms (20)

Spectrogram.pptx
Spectrogram.pptxSpectrogram.pptx
Spectrogram.pptx
 
Unit 9 Summary 2009
Unit 9 Summary 2009Unit 9 Summary 2009
Unit 9 Summary 2009
 
Physics basic
Physics basicPhysics basic
Physics basic
 
Medical Equipment Section2
Medical Equipment Section2Medical Equipment Section2
Medical Equipment Section2
 
Lec 2
Lec 2Lec 2
Lec 2
 
acoustics on human ear
acoustics on human earacoustics on human ear
acoustics on human ear
 
Resonance
ResonanceResonance
Resonance
 
USphysics.pptx
USphysics.pptxUSphysics.pptx
USphysics.pptx
 
Module No. 46
Module No. 46Module No. 46
Module No. 46
 
ULTRASOUND IMAGING PRINCIPLES
ULTRASOUND IMAGING PRINCIPLESULTRASOUND IMAGING PRINCIPLES
ULTRASOUND IMAGING PRINCIPLES
 
Cube model Theory of acoustic phonetics
Cube model Theory of acoustic phonetics Cube model Theory of acoustic phonetics
Cube model Theory of acoustic phonetics
 
Visualizing sound
Visualizing soundVisualizing sound
Visualizing sound
 
5 most important synthesis modules
5 most important synthesis modules5 most important synthesis modules
5 most important synthesis modules
 
Preliminary Physics - World communicates 2
Preliminary Physics - World communicates 2Preliminary Physics - World communicates 2
Preliminary Physics - World communicates 2
 
Physics P1.5
Physics  P1.5Physics  P1.5
Physics P1.5
 
Chapter 2 The Nature of Sound
Chapter 2 The Nature of SoundChapter 2 The Nature of Sound
Chapter 2 The Nature of Sound
 
Year 9 Physics Term 2 Lesson Presentation Week 1.pptx
Year 9 Physics Term 2 Lesson Presentation Week 1.pptxYear 9 Physics Term 2 Lesson Presentation Week 1.pptx
Year 9 Physics Term 2 Lesson Presentation Week 1.pptx
 
production of ultrasound and physical characteristics-
production of ultrasound and physical characteristics-production of ultrasound and physical characteristics-
production of ultrasound and physical characteristics-
 
sound basic & microphone
sound basic & microphonesound basic & microphone
sound basic & microphone
 
Characterization of sound
Characterization of soundCharacterization of sound
Characterization of sound
 

Recently uploaded

S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
Bisnar Chase Personal Injury Attorneys
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
mulvey2
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 
Assessment and Planning in Educational technology.pptx
Assessment and Planning in Educational technology.pptxAssessment and Planning in Educational technology.pptx
Assessment and Planning in Educational technology.pptx
Kavitha Krishnan
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
Celine George
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
Life upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for studentLife upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for student
NgcHiNguyn25
 

Recently uploaded (20)

S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 
Assessment and Planning in Educational technology.pptx
Assessment and Planning in Educational technology.pptxAssessment and Planning in Educational technology.pptx
Assessment and Planning in Educational technology.pptx
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
Life upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for studentLife upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for student
 

Spectrograms

  • 1. Spectrograms An essential tool for instrumental phonetics
  • 2. Voice Spectra and Spectrograms • Recall that power spectra allow us to see the component frequencies and amplitudes that make up a complex wave. • Spectrograms are a way of viewing hundreds of sequential power spectra, so we can see how the spectrum of a speech signal changes over time. – This is done by turning the spectrum sideways, reducing each bar on the spectrum to a dot, and using the color of the dot to symbolize the height of the bar (i.e., the amplitude or intensity of the component frequency). 2
  • 3. Building a Spectrogram 1. Power spectrum amplitudes are represented as different shades of grey (taller = darker) 2. Spectrum is rotated and collapsed to 1-dimension. 3. Hundreds of spectra are arrayed side by side, allowing us to see changes in formant frequencies over time. 3
  • 4. Reading a Spectrogram • A spectrogram plots three variables: – The horizontal axis is time, usually measured in milliseconds. – The vertical axis is frequency (Hz). – The third variable, intensity, is represented by the color of each point that is plotted. • The dark bands show the formants of the voiced sounds. – The first three formants vary depending on the shape of the vocal tract and are important for understanding the speech signal. frequency time (ms) 4
  • 5. Narrowband Spectrograms • When calculating the frequency components that make up the speech signal, it is necessary to take a chunk of time to figure out the frequency of the wave. • Narrowband spectrograms show calculations based on longer chunks of time, which allows more accurate calculation of the frequencies making up the signal and can reveal individual harmonics of the signal. – Because the chunks of time are longer, narrowband spectrograms are less precise about when a given speech event happened (e.g., a release burst or the onset of voicing). This lack of precision in the temporal dimension is called time smear. – Because of their detailed frequency information, narrowband spectrograms are useful for investigating changes in pitch, tone, and intonation. 5
  • 6. Narrowband Spectrograms • [ɑbɑ] – Frequency range: 0 – 5000 Hz; duration: ≈0.75 s – Window size: 0.05 s Individual harmonics 6
  • 7. Wideband Spectrograms • Wideband spectrograms show calculations based on short chunks of time, which allows accurate measurement of event timing at the expense of detailed frequency information. Wideband spectrograms are vertically striated; each vertical striation shows an individual vibration of the vocal folds. – Remember that formants are broad peaks of intensity that range over several neighboring frequencies. Since the wideband spectrograms “smear together” neighboring frequencies, this actually helps us to see the formants more clearly. – Vowel quality, place of articulation, and other segmental features are all aspects of the filter and thus cause broad-frequency changes to the signal. Thus they are best viewed using wideband spectrograms. – Because of their accurate timing calculations, wideband spectrograms are also useful for measuring VOT, vowel duration, and other time-based characteristics of speech. 7
  • 8. Wideband Spectrograms • [ɑbɑ] – Frequency range: 0 – 5000 Hz; duration: ≈0.75 s – Window size: 0.01 s Formants vertical striations 8
  • 9. Reading a Spectrogram: Voicing • Voiced sounds have a dark band very close to the bottom of the spectrogram, corresponding to the fundamental frequency of the speaker’s voice. – Many “voiced” consonants are allophonically devoiced, so the absence of a voicing band does not guarantee that the sound is a voiceless phoneme. 9
  • 10. Reading a Spectrogram: Voicing • [ɑzɑ] – Frequency range: 0 – 5000 Hz; duration: ≈0.75 s Voicing band 10
  • 11. Reading a Spectrogram: Voicing • [ɑsɑ] – Frequency range: 0 – 5000 Hz; duration: ≈0.75 s No voicing band 11
  • 12. Reading a Spectrogram: Approximants • Vowels and approximants show vertical striations in wideband spectrograms, and clear dark formant bands. – Vowels are identified on a spectrogram primarily by the values of their F1 and F2 formants. – Approximants involve constriction comparable to that of a high vowel, and so tend to have fairly low F1 values. Approximants can often be distinguished by the movement of their F2 and F3 formants. 12
  • 13. Reading a Spectrogram: Approximants • [ɑlɑ] – Frequency range: 0 – 5000 Hz; duration: ≈0.75 s Formants low F1 value for the approximant [ l] 13
  • 14. Reading a Spectrogram: Oral Stops • Oral stops involve complete stoppage of airflow and thus show up as a “gap” in the spectrogram. – The formant transitions of the voiced sounds before or after a stop are the best way to determine the stop’s place of articulation. – The release burst of a stop may be continuous with a following voiced sound, or may stand alone at the end of a word as a brief vertical band of high energy. The distribution of that energy can also provide clues to the place of articulation of the stop. 14
  • 15. Reading a Spectrogram: Oral Stops • [ɑpɑ] – Frequency range: 0 – 5000 Hz; duration: ≈0.75 s period of closure aspiration noise release burst 15
  • 16. Reading a Spectrogram: Nasal Stops • Nasal stops allow continuation of airflow and show formant structure much like vowels. – Like oral stops, nasal place of articulation is most easily determined by the formant transitions of the surrounding vowels. – Unlike vowels, nasals typically show “zeroes” (white areas) between the lower formants. – Nasals can be distinguished from vowels by a much lower amplitude in the waveform (not the spectrogram). 16
  • 17. Reading a Spectrogram: Nasal Stops • [ɑnɑ] – Frequency range: 0 – 5000 Hz; duration: ≈0.75 s “zeroes” 17
  • 18. Reading a Spectrogram: Nasal Stops • [ɑnɑ] lower amplitude 18
  • 19. Reading a Spectrogram: Fricatives • Fricatives are characterized by fairly high energy (darker color) especially in the higher frequencies, and (usually) a lack of formant bands. – [h] often does show formant bands. This is because [h] generates turbulence in the same place that voicing originates (the glottis), so the turbulence of [h] is subject to the same filter that vowels are (namely, the entire vocal tract). The spectrum of [h] can vary widely depending on the following vowel. 19
  • 20. Reading a Spectrogram: Fricatives • [ɑʃɑ] – Frequency range: 0 – 10,000 Hz; duration: ≈0.75 s high energy in the higher frequencies 20
  • 21. Reading a Spectrogram: Fricatives • [ɑhɑ] – Frequency range: 0 – 10,000 Hz; duration: ≈0.75 s formant bands visible through the [h] 21
  • 22. Reading a Spectrogram: More info • For more detailed information on spectrogram reading, see: – Ladefoged and Johnson ch. 8, esp. p. 193-212 – Rob Hagiwara’s “Mystery Spectrogram” website: http://home.cc.umanitoba.ca/~robh/howto.html#intro 22
  • 23. Reading a Spectrogram: Vowels from Ladefoged, P. & Johnson, K. (2011). A Course in Phonetics (6th ed.), p. 194. 23
  • 24. Formant Transitions: Cues to Place of Articulation from Johnson, K. (2003). Acoustic & Auditory Phonetics (2nd ed.). Malden, MA: Blackwell. 24
  • 25. Reading a Spectrogram: Place • F2 transition is higher the farther back the place of articulation: Labiodental Dental Alveolar Post-alveolar 25 from Ladefoged, P. & Johnson, K. (2011). A Course in Phonetics (6th ed.), p. 201.
  • 26. Reading a Spectrogram: Distinguishing Fricatives • Look at the spread and density of frication • Quiet: Frication light, spread out, present in lower frequencies • Loud (e.g. sibilants): Dense, concentrated in high frequencies. 26 from Ladefoged, P. & Johnson, K. (2011). A Course in Phonetics (6th ed.), p. 202.
  • 27. Reading a Spectrogram: Distinguishing Approximants • Approximants flow smoothly into vowels 27 from Ladefoged, P. & Johnson, K. (2011). A Course in Phonetics (6th ed.), p. 202. • [l] – dip in F2, slight rise in F3 • [ɹ] – central F2, scoop in F3 • [w] – like high back vowel [u/ʊ] • [j] – like high front vowel [i/ɪ]
  • 28. Spectrogram Examples: [ɹ] • Scoop shape to F3 • F2 central (between front & back vowels) 28 1800 Hz 1500 Hz 500 Hz [ m ɛ ɹ i ] [ k ɑ ɹ ]

Editor's Notes

  1. Good for viewing pitch changes, more qualitatively (it’s hard to measure because time smear makes things inaccurate). To change the window size in a Praat: from the Edit window for a sound file: Spectrum > Spectrogram settings… Window length
  2. Shorter window = wider band
  3. More often used, more accurate for measuring formants, place cues, duration, phonation, etc. To change the window size in a Praat: from the Edit window for a sound file: Spectrum > Spectrogram settings… Window length
  4. Frequency range to 5,000 Hz includes important filter information (F1, F2, F3), but notice that frication noise extends well above this range (up to 10,000 Hz). If you’re measuring fricatives, extend the range: In a Praat Edit window: Spectrum > Spectrogram settings… View range
  5. Note that a voicing band may extend partially through a voiceless phoneme, or a voiced phoneme may become devoiced partway thorough. Languages differ on how much of this coarticulation they allow before allophones are perceived as belonging to other phonemes. (Ex: English allows long periods of devoiced [z] word-finally, e.g. in plurals.) Go back a couple slides to [aba] and notice that the voicing band doesn’t continue all the way through the stop closure – you can’t voice all the way through the closure because air needs to flow past your vocal folds to make them vibrate.
  6. Lack of release burst common word-finally.
  7. Release burst = puff of air released from air built up behind the point of closure. Notice how the release burst is short, dark, and has sharp edges, while aspiration is spread out. Aspiration = voiceless portion after consonant release but before voicing begins (before vocal folds begin vibrating). This is similar to the voiceless glottal fricative [h]. You may see faint hints of formants because the vocal tract is in position for the following vowel. So, [h] can also be considered a voiceless vowel. The duration between the release burst and beginning (onset) of voicing = Voice Onset Time (VOT)
  8. Clear closure edges, voicing, formant structure, low F1, lighter formants, zeroes. Lighter formants: Nasal passages are additional tubes (side resonators) off the main resonating tube, dampening the energy in the formant readings. Nasal zeroes (lighter spots between formants): The side resonators (nasal passages) suck energy out of the signal (anti-resonance). Different formant locations than for oral vowels: Nasal tract is longer than oral tract, making your whole vocal tract longer. The longer the total tract (including pockets in the oral cavity), the lower the first zero – bilabial [m] = longest, therefore lowest zero, then alveolar [n], then velar, etc.
  9. Nasal passages (side resonators) absorb (attenuate) much of the energy.
  10. Frication = aperiodic turbulence (air bouncing around quickly and randomly at the point of constriction).
  11. Voiced fricatives will have less frication energy (lighter) because air is only flowing to the point of constriction half of the time – when the vocal folds are open. If you’re looking for/measuring fricatives, extend the range, so you can see the frication noise above 5,000 Hz: In a Praat Edit window: Spectrum > Spectrogram settings… View range
  12. Formants visible because vocal tract is in position for a neighboring vowel; frication comes from glottis (between vocal folds) – the source is below the whole vocal tract, meaning the filter (vocal tract) is the same as for vowels, where the source is also at the glottis. So, [h] can be considered a voiceless vowel.
  13. Also notice: formant structure for [m] (lighter, lower formants) plus nasal zero; long, heavy aspiration for [k]