how to make some consonants

•

1 like•250 views

Takehiro IHARA

make consonants artificially. make "pi" "ti" "ki" from "i".

Technology

On the difference
among /pi/, /ti/ and /ki/

2011/03/18
IHARA Takehiro

This slide is a brief translation of
this Japanese blog article.

The main problem is the difference
among /p/, /t/ and /k/. In this field, many old
papers says that the difference lays on time
change of the frequency of the second
formant. However, it is hard to find these
time changes. I suspect this old hypothesis.

The method for synthesis of these
consonants was explored for exploring the
difference. Then, the methods for synthesis
of /pi/, /ti/ and /ki/ were revealed (peculiar
to vowel /i/).

The problem definition is that the goal is
synthesis /pi/, /ti/ and /ki/ from vowel /i/ by
signal processing. Since this problem
argues only articulation position,
substitution to /bi/, /di/ and /gi/ is regarded
as success.

This slide presents the synthesis methods and the
opinions to the synthesized utterances by one sixties
male and one sixties female. The reason of few testees
is that I do not belong to any laboratory nor company.
The testees listened only my voice (because they do
not have vitality to listen many utterances), though I
used the corpus " "Spoken Language" and the DSR
Projects Speech Corpus (PASL-DSR)" [1] for this
research. I thank the researchers at National Institute
of Informatics Speech Resources Consortium.

The spectrogram of /i/ is shown as Fig.1,
which is processed. The sampling
frequency was 16 kHz. The maximum value
of frequency (vertical axis) is 8 kHz. The
width of time (horizontal axis) is
approximately 0.3 seconds. Red represents
large power and blue represents small
power.

The method to synthesize /ki/ is to make
narrow band colored noise at the second
formant shown by the black arrow as Fig.2.
This narrowness is important for this
method, but the center frequency of the
noise is not important.

The two testees said that this synthesized
/ki/ was more intelligible than /pi/ and /ti/
that are shown below.

The method for synthesis of /pi/ has two or
three steps. First, slide the part shown by
the black rectangle toward lower frequency.
Second, add noise at the part shown by the
blue rectangle. Although many utterance /i/
changed to /pi/ by these two steps, /i/ of my
voice did not. Third, replace the power at
the part shown by the green rectangle to
zero amplitude, which made my /i/ to /pi/.

This process for /pi/ almost agrees a
conventional hypothesis about time change
of the center frequency of the second
formant.
While the male testee said that this /pi/ does
not sound intelligible, the female testee said
that it sounds intelligible so much.

The method for synthesis of /ti/ is not clear,
because resynthesized /i/s by several
methods sounded like /ti/. A method is to
make the amplitude at the part shown by the
green rectangle to constant value and to
make the phase at this part to random value.

While the male testee said that this /ti/
sounds intelligible so much, the female
testee said that it sounds like sometimes /ti/
and sometimes /pi/.

Although I made /po/, /to/ and /ko/, the
male testee said that they sound like /o/
without consonants. This article does not
mention them. however these synthesized
utterance are distributed with the other
utterances and scripts.

The Octave scripts and the synthesized
utterances are distributed at SkyDrive. You
may use them. I permit you to modify them
and to redistribute them. I will be happy if
you develop this research, write papers, and
publish them with your name.

Reference
• [1] S. Itahashi, "Creating Speech Corpora
for Speech Science and Technology,"
IEICE Trans. Vol.E74, No.7, pp.1906-1910,
1991.

Similar to how to make some consonants

Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...kevig

Vowelsmanuelmedinavuad

Characterization of Arabic sibilant consonants IJECEIAES

LinguisticsSarah Abdussalam

Novel cochlear filter based cepstral coefficients for classification of unvoi...ijnlc

English Vowel Sounds - Phonetics and Phonologymanuelmedinavuad

Sonic localization-cues-for-classrooms-a-structural-model-proposalCemal Ardil

Analysis Synthesis ComparisonJim Webb

Writing the discussion chapter for quantitative research.pdfMartin McMorrow

DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS cscpconf

B110512inventionjournals

Phonetics and phonology Cintia Ferreyra

Why Siren sounds the way it dioes?Deepak Sharma

Real Time Drum Augmentation with Physical ModelingBen Eyes

PhoneticsUludag University

Confirmation TalkHilmi Hamzah

Phonology 1 11Mousa Mzuri

Phoneticslizdvan

Phonemesmanuelmedinavuad

Similar to how to make some consonants (20)

Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...

Vowels

Characterization of Arabic sibilant consonants

Linguistics

Novel cochlear filter based cepstral coefficients for classification of unvoi...

English Vowel Sounds - Phonetics and Phonology

Sonic localization-cues-for-classrooms-a-structural-model-proposal

Analysis Synthesis Comparison

Writing the discussion chapter for quantitative research.pdf

DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS

B110512

Phonetics and phonology

Why Siren sounds the way it dioes?

Real Time Drum Augmentation with Physical Modeling

Phonetics

Confirmation Talk

Phonology 1 11

Phonetics

Phonemes

Recently uploaded

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Understanding the FAA Part 107 License ..Christopher Logan Kennedy

Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Elevate Developer Efficiency & build GenAI Application with Amazon QBhuvaneswari Subramani

FWD Group - Insurer Innovation Award 2024The Digital Insurer

CNIC Information System with Pakdata Cf In Pakistandanishmna97

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Recently uploaded (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Understanding the FAA Part 107 License ..

Six Myths about Ontologies: The Basics of Formal Ontology

presentation ICT roal in 21st century education

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Elevate Developer Efficiency & build GenAI Application with Amazon Q

FWD Group - Insurer Innovation Award 2024

CNIC Information System with Pakdata Cf In Pakistan

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

How to Troubleshoot Apps for the Modern Connected Worker

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

how to make some consonants

1. On the difference among /pi/, /ti/ and /ki/ 2011/03/18 IHARA Takehiro

2. This slide is a brief translation of this Japanese blog article.

3. The main problem is the difference among /p/, /t/ and /k/. In this field, many old papers says that the difference lays on time change of the frequency of the second formant. However, it is hard to find these time changes. I suspect this old hypothesis.

4. The method for synthesis of these consonants was explored for exploring the difference. Then, the methods for synthesis of /pi/, /ti/ and /ki/ were revealed (peculiar to vowel /i/).

5. The problem definition is that the goal is synthesis /pi/, /ti/ and /ki/ from vowel /i/ by signal processing. Since this problem argues only articulation position, substitution to /bi/, /di/ and /gi/ is regarded as success.

6. This slide presents the synthesis methods and the opinions to the synthesized utterances by one sixties male and one sixties female. The reason of few testees is that I do not belong to any laboratory nor company. The testees listened only my voice (because they do not have vitality to listen many utterances), though I used the corpus " "Spoken Language" and the DSR Projects Speech Corpus (PASL-DSR)" [1] for this research. I thank the researchers at National Institute of Informatics Speech Resources Consortium.

7. The spectrogram of /i/ is shown as Fig.1, which is processed. The sampling frequency was 16 kHz. The maximum value of frequency (vertical axis) is 8 kHz. The width of time (horizontal axis) is approximately 0.3 seconds. Red represents large power and blue represents small power.

8. Figure 1. /i/.

9. The method to synthesize /ki/ is to make narrow band colored noise at the second formant shown by the black arrow as Fig.2. This narrowness is important for this method, but the center frequency of the noise is not important.

10. Figure 2. Synthesized /ki/.

11. The two testees said that this synthesized /ki/ was more intelligible than /pi/ and /ti/ that are shown below.

12. The method for synthesis of /pi/ has two or three steps. First, slide the part shown by the black rectangle toward lower frequency. Second, add noise at the part shown by the blue rectangle. Although many utterance /i/ changed to /pi/ by these two steps, /i/ of my voice did not. Third, replace the power at the part shown by the green rectangle to zero amplitude, which made my /i/ to /pi/.

13. Figure 3. Synthesized /pi/.

14. This process for /pi/ almost agrees a conventional hypothesis about time change of the center frequency of the second formant. While the male testee said that this /pi/ does not sound intelligible, the female testee said that it sounds intelligible so much.

15. The method for synthesis of /ti/ is not clear, because resynthesized /i/s by several methods sounded like /ti/. A method is to make the amplitude at the part shown by the green rectangle to constant value and to make the phase at this part to random value.

16. Figure 4. Synthesized /ti/.

17. While the male testee said that this /ti/ sounds intelligible so much, the female testee said that it sounds like sometimes /ti/ and sometimes /pi/.

18. Although I made /po/, /to/ and /ko/, the male testee said that they sound like /o/ without consonants. This article does not mention them. however these synthesized utterance are distributed with the other utterances and scripts.

19. The Octave scripts and the synthesized utterances are distributed at SkyDrive. You may use them. I permit you to modify them and to redistribute them. I will be happy if you develop this research, write papers, and publish them with your name.

20. Reference • [1] S. Itahashi, "Creating Speech Corpora for Speech Science and Technology," IEICE Trans. Vol.E74, No.7, pp.1906-1910, 1991.

how to make some consonants

Recommended

Recommended

More Related Content

Similar to how to make some consonants

Similar to how to make some consonants (20)

Recently uploaded

Recently uploaded (20)

how to make some consonants