This document provides an overview of physical sounds and speech sounds. It discusses the basics of sounds including acoustics, sound waves, frequency, amplitude, pure tones, and harmonics. It also examines speech sounds and the articulators involved in shaping sounds, including the lungs, vocal tract, larynx, pharynx, oral cavity and nasal cavity. Places of articulation are identified. The document also discusses Korean phonology and provides examples of Korean consonants and vowels. Different approaches for grouping sounds are introduced.
2. Physical sounds Speech sounds References
Today’s kick-off
Respect your instructor, your TA, and your fellow students.
2 / 96
3. Physical sounds Speech sounds References
Today’s kick-off
Respect your instructor, your TA, and your fellow students.
The spacing in Korean (see NTU COOL - Discussion)
3 / 96
4. Physical sounds Speech sounds References
Today’s kick-off
Respect your instructor, your TA, and your fellow students.
The spacing in Korean (see NTU COOL - Discussion)
In-class exercise
4 / 96
6. Physical sounds Speech sounds References
Basics of sounds
• The physical characteristics of sound of speech is termed acoustics.
6 / 96
7. Physical sounds Speech sounds References
Basics of sounds
• The physical characteristics of sound of speech is termed acoustics.
Sound doesn’t have mass or weight.
Sound: An audible disturbance in a medium caused by a
vibrating source.
7 / 96
8. Physical sounds Speech sounds References
Basics of sounds
• The physical characteristics of sound of speech is termed acoustics.
Sound doesn’t have mass or weight.
Sound: An audible disturbance in a medium caused by a
vibrating source.
Essential constituents of sound:
1 A source of energy
2 A vibrating source
3 A medium of transmission
8 / 96
9. Physical sounds Speech sounds References
Sound waves
Figure adapted from Goldstein (2010), Fig. 11.1
Sound travels like waves and diffuse in
all directions. ⇒ Sound waves
Pressure moves from high pressure to
low pressure. ⇒ Equalization of
pressure
Air particles are re-organized due to
pressure changes:
Increase in pressure: compression
(condensation)
Decrease in pressure: rarefaction
9 / 96
10. Physical sounds Speech sounds References
Sound waves
Boyle’s Law: P1V1 = P2V2
– In a closed system, pressure and
volume are inversely related for a
given temperature.
In the digital world, sound waves are
represented by samples of amplitude at
a fixed rate.
The difference between two particular
states in the same sound wave is called
phase.
Normal speech signals range from 600
µPa ∼ 20 Pa (though human ears can pick
up as low as 20 µPa).
10 / 96
11. Physical sounds Speech sounds References
Sound waves
Types of waves
Longitudinal wave: the direction in
which the molecules oscillate is
the same as the direction of the wave
itself.
Transverse wave: the direction of
movement of the individual molecules is
perpendicular to the direction in which
the wave itself travels.
11 / 96
12. Physical sounds Speech sounds References
Sound waves
Types of waves
Longitudinal wave: the direction in
which the molecules oscillate is
the same as the direction of the wave
itself.
Transverse wave: the direction of
movement of the individual molecules is
perpendicular to the direction in which
the wave itself travels.
In general, transverse waves travel
faster than longitudinal waves.
However, the medium plays a more
determinative role.
12 / 96
13. Physical sounds Speech sounds References
Pure tone and sine waves
Pure tone
A pure tone consists of only ONE
frequency (i.e., a vibration that repeats itself
in exactly the same way at an unchanging rate).
A tuning fork creates a sound wave
that consists of one single frequency. ⇒
pure tone
Elasticity: the restoring force that
causes an elastic medium to bounce
back when stretched or displaced.
Inertia: the tendency for motion or
lack of motion to continue.
13 / 96
14. Physical sounds Speech sounds References
Frequency vs. amplitude
Frequency: rate of occurrence. The unit is Hertz (Hz).
Frequency = number of periods
1(s) = 1
period duration(s)
14 / 96
15. Physical sounds Speech sounds References
Frequency vs. amplitude
Frequency: rate of occurrence. The unit is Hertz (Hz).
Frequency = number of periods
1(s) = 1
period duration(s)
The relation between distance, time, and speed:
distance(m)
time(s) = speed (m/s)
The speed of sound is around 344 m/s at around 21 ◦
C.
15 / 96
16. Physical sounds Speech sounds References
Frequency vs. amplitude
Frequency: rate of occurrence. The unit is Hertz (Hz).
Frequency = number of periods
1(s) = 1
period duration(s)
The relation between distance, time, and speed:
distance(m)
time(s) = speed (m/s)
The speed of sound is around 344 m/s at around 21 ◦
C.
Amplitude: difference between (air) pressure maxima and
minima.
⇒ perceived as louder!
16 / 96
21. Physical sounds Speech sounds References
Sound waves
Figure adapted from Goldstein (2010), Fig. 11.2
A sine wave ⇒ pure tone
Pressure changes can also be
indicated by darkening and
lightening
21 / 96
22. Physical sounds Speech sounds References
Sound waves
Figure adapted from Goldstein (2010), Fig. 11.2
A sine wave ⇒ pure tone
Pressure changes can also be
indicated by darkening and
lightening
Figure adapted from Goldstein (2007), Fig. 11.3
Larger amplitude (i.e., larger air
pressure) is associated with the
perception of greater loudness.
22 / 96
23. Physical sounds Speech sounds References
Periodic and aperiodic sounds
Periodic (top): sinusoidal pressure
changes; sine wave
Complex (middle): multiple
overlapping sound waves
⇒ A loud 100 Hz
⇒ A quiet 1000 Hz
Aperiodic (bottom): Random or
non-overlap sound waves
23 / 96
24. Physical sounds Speech sounds References
Complex tones
Figure adapted from Goldstein (2010), Fig. 11.5
Except pure tones, sounds we hear
and produce are complex tones.
Fourier transform tells us that a
complex wave can be decomposed
into a fundamental frequency
and harmonics.
Fast Fourier Transform (FFT)
(https://www.youtube.com/
watch?v=zKKGA30bHG0)
24 / 96
25. Physical sounds Speech sounds References
Sound waves
What can we see on waveforms?
25 / 96
26. Physical sounds Speech sounds References
Sound waves
What can we see on waveforms?
1 Fundamental frequency (F0): each vibration of the vocal cords.
26 / 96
27. Physical sounds Speech sounds References
Sound waves
What can we see on waveforms?
1 Fundamental frequency (F0): each vibration of the vocal cords.
2 Amplitude: relative loudness
27 / 96
28. Physical sounds Speech sounds References
Sound waves
What can we see on waveforms?
1 Fundamental frequency (F0): each vibration of the vocal cords.
2 Amplitude: relative loudness
3 Silence
28 / 96
29. Physical sounds Speech sounds References
Sound waves
What can we see on waveforms?
1 Fundamental frequency (F0): each vibration of the vocal cords.
2 Amplitude: relative loudness
3 Silence
What can we not see on waveforms?
29 / 96
30. Physical sounds Speech sounds References
Sound waves
What can we see on waveforms?
1 Fundamental frequency (F0): each vibration of the vocal cords.
2 Amplitude: relative loudness
3 Silence
What can we not see on waveforms?
1 Which sound it is
30 / 96
31. Physical sounds Speech sounds References
Sound waves
What can we see on waveforms?
1 Fundamental frequency (F0): each vibration of the vocal cords.
2 Amplitude: relative loudness
3 Silence
What can we not see on waveforms?
1 Which sound it is
2 BUT we can guess if the waveforms correspond to a vowel or a
consonant.
31 / 96
32. Physical sounds Speech sounds References
Spectrogram
Figure adapted from Ladefoged and Johnson (2011)
Definition: A graphic representation of sounds in terms of their
component frequencies, in which time is shown on the horizontal
axis, frequency on the vertical axis, and the intensity of each
frequency at each moment in time by the darkness of the mark.
(Ladefoged and Johnson, 2011:310)
32 / 96
33. Physical sounds Speech sounds References
Formants
Figure adapted from (Ladefoged and Johnson, 2011)
(Spectral) Peaks of pressure are
characterized as Formants.
Transitions into and out of the
vowels.
Different vowels can be characterized
by different formant profiles.
33 / 96
34. Physical sounds Speech sounds References
Timbre
Figure adapted from Goldstein (2007), Fig. 11.10
Musical instruments have different
fundamental frequencies, harmonics,
correspondent sound pressure level
(SPL), and frequency ranges.
⇒ Different timbres
34 / 96
35. Physical sounds Speech sounds References
Octave
Figure adapted from Goldstein (2007), Fig. 11.8
An octave: a pitch with another half or double its frequency
There are 12 semitones within an octave.
Fn × 12
√
2 = Fn+1
35 / 96
39. Physical sounds Speech sounds References
Sounds
The Gods must be crazy (1980): https://youtu.be/dfg4yowq-iE
Seediq Bale (2011): https://youtu.be/3L PNQgQ49w
39 / 96
40. Physical sounds Speech sounds References
Sounds
What are the neuromuscular structures (i.e., articulators) that help us
“shape” and “produce” sounds?
40 / 96
41. Physical sounds Speech sounds References
Sounds
What are the neuromuscular structures (i.e., articulators) that help us
“shape” and “produce” sounds?
41 / 96
42. Physical sounds Speech sounds References
Sounds
What are the neuromuscular structures (i.e., articulators) that help us
“shape” and “produce” sounds?
42 / 96
43. Physical sounds Speech sounds References
Sounds
What are the neuromuscular structures (i.e., articulators) that help us
“shape” and “produce” sounds?
1 the lungs
43 / 96
44. Physical sounds Speech sounds References
Sounds
What are the neuromuscular structures (i.e., articulators) that help us
“shape” and “produce” sounds?
1 the lungs
2 the vocal tract
44 / 96
45. Physical sounds Speech sounds References
Sounds
What are the neuromuscular structures (i.e., articulators) that help us
“shape” and “produce” sounds?
1 the lungs
2 the vocal tract
larynx
45 / 96
46. Physical sounds Speech sounds References
Sounds
What are the neuromuscular structures (i.e., articulators) that help us
“shape” and “produce” sounds?
1 the lungs
2 the vocal tract
larynx
pharynx
46 / 96
47. Physical sounds Speech sounds References
Sounds
What are the neuromuscular structures (i.e., articulators) that help us
“shape” and “produce” sounds?
1 the lungs
2 the vocal tract
larynx
pharynx
oral cavity
47 / 96
48. Physical sounds Speech sounds References
Sounds
What are the neuromuscular structures (i.e., articulators) that help us
“shape” and “produce” sounds?
1 the lungs
2 the vocal tract
larynx
pharynx
oral cavity
nasal cavity
48 / 96
49. Physical sounds Speech sounds References
Sounds
1 Lungs
What kinds of aerodynamic
mechanisms are involved?
49 / 96
50. Physical sounds Speech sounds References
Sounds
1 Lungs
What kinds of aerodynamic
mechanisms are involved?
Egressive vs. ingressive
Norwegian example:
https://en.wikipedia.org/wiki/File:
Ja ingressive.ogg
⇒ Pulmonic egressive vs.
ingressive
50 / 96
51. Physical sounds Speech sounds References
Sounds
In the San video, how many articulators in the vocal tract can you
identify?
51 / 96
52. Physical sounds Speech sounds References
Sounds
In the San video, how many articulators in the vocal tract can you
identify?
2 Vocal tract
Larynx/Glottis
Pharynx
Palate (hard vs. soft)
Tongue
Teeth
Lips
⇒ Places of articulation
52 / 96
53. Physical sounds Speech sounds References
Did you notice?
Now you have a (rough) idea about different places of
articulation. But, have you noticed?
53 / 96
54. Physical sounds Speech sounds References
Did you notice?
Now you have a (rough) idea about different places of
articulation. But, have you noticed?
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
54 / 96
55. Physical sounds Speech sounds References
Did you notice?
Now you have a (rough) idea about different places of
articulation. But, have you noticed?
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
labial
alveolar
velar
alveo-palatal
palatal
alveolar
55 / 96
64. Physical sounds Speech sounds References
More about Korean
Can you guess the sounds for the following Korean consonants?
64 / 96
65. Physical sounds Speech sounds References
Another approach of grouping
The order of ㄅㄆㄇㄈ reveals a way of grouping sounds. But is
there any other approach?
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
65 / 96
66. Physical sounds Speech sounds References
Another approach of grouping
The order of ㄅㄆㄇㄈ reveals a way of grouping sounds. But is
there any other approach?
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
66 / 96
67. Physical sounds Speech sounds References
Another approach of grouping
The order of ㄅㄆㄇㄈ reveals a way of grouping sounds. But is
there any other approach?
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
67 / 96
68. Physical sounds Speech sounds References
Another approach of grouping
The order of ㄅㄆㄇㄈ reveals a way of grouping sounds. But is
there any other approach?
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
68 / 96
69. Physical sounds Speech sounds References
Another approach of grouping
The order of ㄅㄆㄇㄈ reveals a way of grouping sounds. But is
there any other approach?
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
69 / 96
70. Physical sounds Speech sounds References
Manner of articulation
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
1 Stop: Airstream is blocked completely in the oral
cavity.
70 / 96
71. Physical sounds Speech sounds References
Manner of articulation
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
1 Stop: Airstream is blocked completely in the oral
cavity.
2 Nasal: Airstream passes through the nasal cavity
(by lowering the velum).
71 / 96
72. Physical sounds Speech sounds References
Manner of articulation
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
1 Stop: Airstream is blocked completely in the oral
cavity.
2 Nasal: Airstream passes through the nasal cavity
(by lowering the velum).
3 Fricative: Airstream is obstructed that it causes
frication.
72 / 96
73. Physical sounds Speech sounds References
Manner of articulation
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
1 Stop: Airstream is blocked completely in the oral
cavity.
2 Nasal: Airstream passes through the nasal cavity
(by lowering the velum).
3 Fricative: Airstream is obstructed that it causes
frication.
4 Affricate: Sounds with complete obstruction
followed by a slight release; affricate = stop +
fricative.
73 / 96
74. Physical sounds Speech sounds References
Manner of articulation
ㄅㄆㄇㄈ
ㄉㄊㄋㄌ
ㄍㄎㄏ
ㄐㄑㄒ
ㄓㄔㄕㄖ
ㄗㄘㄙ
1 Stop: Airstream is blocked completely in the oral
cavity.
2 Nasal: Airstream passes through the nasal cavity
(by lowering the velum).
3 Fricative: Airstream is obstructed that it causes
frication.
4 Affricate: Sounds with complete obstruction
followed by a slight release; affricate = stop +
fricative.
5 Liquid: One type of approximants that involves
turbulence (e.g., [l]).
74 / 96
75. Physical sounds Speech sounds References
Other ways of grouping
OK. Now we know the places and manners of articulation. But,
[b] and [g] are still qualitatively different, even though they both
share the same place and manner of articulation. Likewise, [k]
and [g]. What quality can tell one from the other?
75 / 96
76. Physical sounds Speech sounds References
Other ways of grouping
OK. Now we know the places and manners of articulation. But,
[b] and [g] are still qualitatively different, even though they both
share the same place and manner of articulation. Likewise, [k]
and [g]. What quality can tell one from the other?
⇒ Voicing: the vibration of the vocal folds.
76 / 96
77. Physical sounds Speech sounds References
Other ways of grouping
OK. Now we know the places and manners of articulation. But,
[b] and [g] are still qualitatively different, even though they both
share the same place and manner of articulation. Likewise, [k]
and [g]. What quality can tell one from the other?
⇒ Voicing: the vibration of the vocal folds.
Speech signals also reveal contrasts between voiced and voiceless
sounds.
77 / 96
78. Physical sounds Speech sounds References
Other ways of grouping
OK. Now we know the places and manners of articulation. But,
[b] and [g] are still qualitatively different, even though they both
share the same place and manner of articulation. Likewise, [k]
and [g]. What quality can tell one from the other?
⇒ Voicing: the vibration of the vocal folds.
Speech signals also reveal contrasts between voiced and voiceless
sounds.
⇒ Voice onset time
78 / 96
79. Physical sounds Speech sounds References
Voice Onset Time (VOT)
The latency between the release of the closure and the voicing
onset.
VOT = voicing onset – release time
⇒ VOT can be negative!
Which is [ba], which is [pa]?
Time (s)
0 1.142
0
8000
Frequency
(Hz)
79 / 96
80. Physical sounds Speech sounds References
Voice Onset Time (VOT)
The latency between the release of the closure and the voicing
onset.
VOT = voicing onset – release time
⇒ VOT can be negative!
Which is [ba], which is [pa]?
Time (s)
0 1.142
0
8000
Frequency
(Hz)
Time (s)
0 1.142
0
8000
Frequency
(Hz)
• http://clas.mq.edu.au/speech/phonetics/phonetics/airstream laryngeal/vot.html
80 / 96
81. Physical sounds Speech sounds References
Other ways of grouping
Next, let’s consider these two words: cool and school.
Did you notice that the [k] sound in these two words sound
slightly different? Different how?
81 / 96
82. Physical sounds Speech sounds References
Other ways of grouping
Next, let’s consider these two words: cool and school.
Did you notice that the [k] sound in these two words sound
slightly different? Different how?
⇒ Aspiration
Did you notice? What’s the difference between ㄅ and ㄆ ?
You can test aspiration with a piece of paper or tissue.
82 / 96
83. Physical sounds Speech sounds References
More about aspiration
Bernoulli’s Principle
Air speed ↑, air pressure ↓
The speed of flow (either fluid or air)
increases, the pressure perpendicular to the
flow decreases.
Try “raspberry!”
83 / 96
84. Physical sounds Speech sounds References
Consonant
The International Phonetic Association built an alphabetical
system to represent all the documented sounds and organized
them based on their properties.
This system is called International Phonetic Alphabet
(IPA).
84 / 96
85. Physical sounds Speech sounds References
Consonant
The International Phonetic Association built an alphabetical
system to represent all the documented sounds and organized
them based on their properties.
This system is called International Phonetic Alphabet
(IPA).
85 / 96
91. Physical sounds Speech sounds References
Time out
YOUR CONTRIBUTION
Post on NTU COOL Discussion: two jokes or commercials that play
with sounds.
91 / 96
92. Physical sounds Speech sounds References
Clicks
Remember the San video from last week? There were some
“click” sounds. But where are those sounds in the IPA chart?
92 / 96
93. Physical sounds Speech sounds References
Clicks
Remember the San video from last week? There were some
“click” sounds. But where are those sounds in the IPA chart?
93 / 96