SlideShare a Scribd company logo
1 of 20
Download to read offline
On the difference
among /pi/, /ti/ and /ki/


                        2011/03/18
                    IHARA Takehiro
This slide is a brief translation of
this Japanese blog article.
The main problem is the difference
among /p/, /t/ and /k/. In this field, many old
papers says that the difference lays on time
change of the frequency of the second
formant. However, it is hard to find these
time changes. I suspect this old hypothesis.
The method for synthesis of these
consonants was explored for exploring the
difference. Then, the methods for synthesis
of /pi/, /ti/ and /ki/ were revealed (peculiar
to vowel /i/).
The problem definition is that the goal is
synthesis /pi/, /ti/ and /ki/ from vowel /i/ by
signal processing. Since this problem
argues only articulation position,
substitution to /bi/, /di/ and /gi/ is regarded
as success.
This slide presents the synthesis methods and the
opinions to the synthesized utterances by one sixties
male and one sixties female. The reason of few testees
is that I do not belong to any laboratory nor company.
The testees listened only my voice (because they do
not have vitality to listen many utterances), though I
used the corpus " "Spoken Language" and the DSR
Projects Speech Corpus (PASL-DSR)" [1] for this
research. I thank the researchers at National Institute
of Informatics Speech Resources Consortium.
The spectrogram of /i/ is shown as Fig.1,
which is processed. The sampling
frequency was 16 kHz. The maximum value
of frequency (vertical axis) is 8 kHz. The
width of time (horizontal axis) is
approximately 0.3 seconds. Red represents
large power and blue represents small
power.
Figure 1. /i/.
The method to synthesize /ki/ is to make
narrow band colored noise at the second
formant shown by the black arrow as Fig.2.
This narrowness is important for this
method, but the center frequency of the
noise is not important.
Figure 2. Synthesized /ki/.
The two testees said that this synthesized
/ki/ was more intelligible than /pi/ and /ti/
that are shown below.
The method for synthesis of /pi/ has two or
three steps. First, slide the part shown by
the black rectangle toward lower frequency.
Second, add noise at the part shown by the
blue rectangle. Although many utterance /i/
changed to /pi/ by these two steps, /i/ of my
voice did not. Third, replace the power at
the part shown by the green rectangle to
zero amplitude, which made my /i/ to /pi/.
Figure 3. Synthesized /pi/.
This process for /pi/ almost agrees a
conventional hypothesis about time change
of the center frequency of the second
formant.
While the male testee said that this /pi/ does
not sound intelligible, the female testee said
that it sounds intelligible so much.
The method for synthesis of /ti/ is not clear,
because resynthesized /i/s by several
methods sounded like /ti/. A method is to
make the amplitude at the part shown by the
green rectangle to constant value and to
make the phase at this part to random value.
Figure 4. Synthesized /ti/.
While the male testee said that this /ti/
sounds intelligible so much, the female
testee said that it sounds like sometimes /ti/
and sometimes /pi/.
Although I made /po/, /to/ and /ko/, the
male testee said that they sound like /o/
without consonants. This article does not
mention them. however these synthesized
utterance are distributed with the other
utterances and scripts.
The Octave scripts and the synthesized
utterances are distributed at SkyDrive. You
may use them. I permit you to modify them
and to redistribute them. I will be happy if
you develop this research, write papers, and
publish them with your name.
Reference
• [1] S. Itahashi, "Creating Speech Corpora
  for Speech Science and Technology,"
  IEICE Trans. Vol.E74, No.7, pp.1906-1910,
  1991.

More Related Content

Similar to how to make some consonants

Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...kevig
 
Characterization of Arabic sibilant consonants
Characterization of Arabic sibilant consonants Characterization of Arabic sibilant consonants
Characterization of Arabic sibilant consonants IJECEIAES
 
Novel cochlear filter based cepstral coefficients for classification of unvoi...
Novel cochlear filter based cepstral coefficients for classification of unvoi...Novel cochlear filter based cepstral coefficients for classification of unvoi...
Novel cochlear filter based cepstral coefficients for classification of unvoi...ijnlc
 
English Vowel Sounds - Phonetics and Phonology
English Vowel Sounds - Phonetics and PhonologyEnglish Vowel Sounds - Phonetics and Phonology
English Vowel Sounds - Phonetics and Phonologymanuelmedinavuad
 
Sonic localization-cues-for-classrooms-a-structural-model-proposal
Sonic localization-cues-for-classrooms-a-structural-model-proposalSonic localization-cues-for-classrooms-a-structural-model-proposal
Sonic localization-cues-for-classrooms-a-structural-model-proposalCemal Ardil
 
Analysis Synthesis Comparison
Analysis Synthesis ComparisonAnalysis Synthesis Comparison
Analysis Synthesis ComparisonJim Webb
 
Writing the discussion chapter for quantitative research.pdf
Writing the discussion chapter for quantitative research.pdfWriting the discussion chapter for quantitative research.pdf
Writing the discussion chapter for quantitative research.pdfMartin McMorrow
 
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS cscpconf
 
Phonetics and phonology
Phonetics and phonology   Phonetics and phonology
Phonetics and phonology Cintia Ferreyra
 
Why Siren sounds the way it dioes?
Why Siren sounds the way it dioes?Why Siren sounds the way it dioes?
Why Siren sounds the way it dioes?Deepak Sharma
 
Real Time Drum Augmentation with Physical Modeling
Real Time Drum Augmentation with Physical ModelingReal Time Drum Augmentation with Physical Modeling
Real Time Drum Augmentation with Physical ModelingBen Eyes
 
Phonetics
PhoneticsPhonetics
Phoneticslizdvan
 

Similar to how to make some consonants (20)

Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
 
Vowels
VowelsVowels
Vowels
 
Characterization of Arabic sibilant consonants
Characterization of Arabic sibilant consonants Characterization of Arabic sibilant consonants
Characterization of Arabic sibilant consonants
 
Linguistics
LinguisticsLinguistics
Linguistics
 
Novel cochlear filter based cepstral coefficients for classification of unvoi...
Novel cochlear filter based cepstral coefficients for classification of unvoi...Novel cochlear filter based cepstral coefficients for classification of unvoi...
Novel cochlear filter based cepstral coefficients for classification of unvoi...
 
English Vowel Sounds - Phonetics and Phonology
English Vowel Sounds - Phonetics and PhonologyEnglish Vowel Sounds - Phonetics and Phonology
English Vowel Sounds - Phonetics and Phonology
 
Sonic localization-cues-for-classrooms-a-structural-model-proposal
Sonic localization-cues-for-classrooms-a-structural-model-proposalSonic localization-cues-for-classrooms-a-structural-model-proposal
Sonic localization-cues-for-classrooms-a-structural-model-proposal
 
Analysis Synthesis Comparison
Analysis Synthesis ComparisonAnalysis Synthesis Comparison
Analysis Synthesis Comparison
 
Writing the discussion chapter for quantitative research.pdf
Writing the discussion chapter for quantitative research.pdfWriting the discussion chapter for quantitative research.pdf
Writing the discussion chapter for quantitative research.pdf
 
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
 
B110512
B110512B110512
B110512
 
Phonetics and phonology
Phonetics and phonology   Phonetics and phonology
Phonetics and phonology
 
Why Siren sounds the way it dioes?
Why Siren sounds the way it dioes?Why Siren sounds the way it dioes?
Why Siren sounds the way it dioes?
 
Real Time Drum Augmentation with Physical Modeling
Real Time Drum Augmentation with Physical ModelingReal Time Drum Augmentation with Physical Modeling
Real Time Drum Augmentation with Physical Modeling
 
Phonetics
PhoneticsPhonetics
Phonetics
 
Confirmation Talk
Confirmation TalkConfirmation Talk
Confirmation Talk
 
Confirmation Talk
Confirmation TalkConfirmation Talk
Confirmation Talk
 
Phonology 1 11
Phonology 1 11Phonology 1 11
Phonology 1 11
 
Phonetics
PhoneticsPhonetics
Phonetics
 
Phonemes
PhonemesPhonemes
Phonemes
 

Recently uploaded

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 

Recently uploaded (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

how to make some consonants

  • 1. On the difference among /pi/, /ti/ and /ki/ 2011/03/18 IHARA Takehiro
  • 2. This slide is a brief translation of this Japanese blog article.
  • 3. The main problem is the difference among /p/, /t/ and /k/. In this field, many old papers says that the difference lays on time change of the frequency of the second formant. However, it is hard to find these time changes. I suspect this old hypothesis.
  • 4. The method for synthesis of these consonants was explored for exploring the difference. Then, the methods for synthesis of /pi/, /ti/ and /ki/ were revealed (peculiar to vowel /i/).
  • 5. The problem definition is that the goal is synthesis /pi/, /ti/ and /ki/ from vowel /i/ by signal processing. Since this problem argues only articulation position, substitution to /bi/, /di/ and /gi/ is regarded as success.
  • 6. This slide presents the synthesis methods and the opinions to the synthesized utterances by one sixties male and one sixties female. The reason of few testees is that I do not belong to any laboratory nor company. The testees listened only my voice (because they do not have vitality to listen many utterances), though I used the corpus " "Spoken Language" and the DSR Projects Speech Corpus (PASL-DSR)" [1] for this research. I thank the researchers at National Institute of Informatics Speech Resources Consortium.
  • 7. The spectrogram of /i/ is shown as Fig.1, which is processed. The sampling frequency was 16 kHz. The maximum value of frequency (vertical axis) is 8 kHz. The width of time (horizontal axis) is approximately 0.3 seconds. Red represents large power and blue represents small power.
  • 9. The method to synthesize /ki/ is to make narrow band colored noise at the second formant shown by the black arrow as Fig.2. This narrowness is important for this method, but the center frequency of the noise is not important.
  • 11. The two testees said that this synthesized /ki/ was more intelligible than /pi/ and /ti/ that are shown below.
  • 12. The method for synthesis of /pi/ has two or three steps. First, slide the part shown by the black rectangle toward lower frequency. Second, add noise at the part shown by the blue rectangle. Although many utterance /i/ changed to /pi/ by these two steps, /i/ of my voice did not. Third, replace the power at the part shown by the green rectangle to zero amplitude, which made my /i/ to /pi/.
  • 14. This process for /pi/ almost agrees a conventional hypothesis about time change of the center frequency of the second formant. While the male testee said that this /pi/ does not sound intelligible, the female testee said that it sounds intelligible so much.
  • 15. The method for synthesis of /ti/ is not clear, because resynthesized /i/s by several methods sounded like /ti/. A method is to make the amplitude at the part shown by the green rectangle to constant value and to make the phase at this part to random value.
  • 17. While the male testee said that this /ti/ sounds intelligible so much, the female testee said that it sounds like sometimes /ti/ and sometimes /pi/.
  • 18. Although I made /po/, /to/ and /ko/, the male testee said that they sound like /o/ without consonants. This article does not mention them. however these synthesized utterance are distributed with the other utterances and scripts.
  • 19. The Octave scripts and the synthesized utterances are distributed at SkyDrive. You may use them. I permit you to modify them and to redistribute them. I will be happy if you develop this research, write papers, and publish them with your name.
  • 20. Reference • [1] S. Itahashi, "Creating Speech Corpora for Speech Science and Technology," IEICE Trans. Vol.E74, No.7, pp.1906-1910, 1991.