SlideShare a Scribd company logo
1 of 3
Download to read offline
International Journal of Science and Research (IJSR) 
ISSN (Online): 2319-7064 
Impact Factor (2012): 3.358 
Imitation of the Voice – An Acoustic Study 
Babi Duli 
Ph.D Research Scholar, Dept. of Linguistics and Phonetics, School of Phonetics and Spoken English 
EFL University, Hyderabad – 500605 Email:- bobbyciefl@gmail.com Ph: +91 9494006604 
Abstract: The present study presented here is an imitation of a voice sample imitated the south Indian film star Mahesh Babu by a 
professional imitator/mimicry artist in a TV show. The imitator tried to imitate the dialogue of the hero from his block buster hit movie 
titled Dookudu ([du:kuu). The aim of the study is to investigate how closely the imitations matched in the selected acoustic parameters 
of the original voice. It was found that he was able to imitate dialogue very closely, but timing at the segmental level showed little change 
in the direction of the targets. Mean formant frequency did not match the target voice. It is too distant in the timing, mean of formant 
frequencies and in pitch 
Keywords: imitation, fundamental frequency, formant frequency, pitch, timing, forensic, phoneticians, culprits, acoustics, YouTube 
1. Introduction 
Imitation is an act of attempting others voice which is 
unique by birth. It can be done either with the knowledge of 
the actual person or in his/her absence. If it is done in his/her 
presence, chances are very limited to make the actual person 
as culprit or to indulge in any of the immoral activities, but 
in absence it will go to any extent( either entertain or not). If 
it entertains, the imitator gets the awards, if not; only the 
original speaker can get the repression. But according to law 
100 culprits can escape from the punishment but one good 
person should not be caught in injustice. Here the forensic 
experts trained a lot especially to trace out the culprits and 
save the innocent. Even phoneticians also trained in this 
connection to connect to the culprits’ voices from a 
collection of compendious voices to do the justice to the 
common compatriot. 
2. Earlier research review 
Only a handful of studies exist in this area, and to our 
knowledge only two that deal with acoustic parameters in 
some detail. Bessler [2] has studied a caricatured 
impersonation of de Gaulle. The most relevant finding with 
respect to the present study was the fact that the 
impersonator exaggerated both mean fundamental frequency 
level and range. Stressed syllable durations were also 
exaggerated. The impersonation was primarily meant for 
entertainment purposes and not accuracy of imitation. The 
generalisability of the results may therefore be questioned. 
In the other study [3], vowel formant frequencies and 
fundamental frequencies in imitations were compared with 
the corresponding values for the original voices. Although 
the imitators managed to change their formant and 
fundamental frequencies in the direction of the target values 
“they were not able to adapt these parameters to match or 
even be similar to those of the imitated persons.” [3, p. 
1842] 
3. Methodology 
The target voice sample is collected from the YouTube from 
a popular Telugu comedy show in E-TV titled Jabardasth. 
The source voice sample is collected from the super hit 
movie titled Dookudu. The video files are converted into 
.wav files by using ‘video pad’ and ‘wave pad’. Praat 
software is used for segmenting the voice samples and for 
analyzing. 
4. Acoustic Analysis 
The speech materials are digitized and used for fundamental 
frequency level and pitch level for the acoustic analyses. All 
the speech files are labelled at word level. The mean of the 
Fundamental frequency, formant frequencies and the mean 
of the Pitch are then computed. First three formant 
frequencies are calculated for all the words which are 
presented in the material. All tokens are numbered 
consecutively to ensure that comparisons between tokens in 
the original, and imitation of the voice were made between 
in identical contexts. 
5. Results 
5.1 Timing 
Imitator reached for maintain the closest timing at the initial 
word in the continuous speech but he failed after that. 
Initially the difference between imitator and Mahesh is a 
very close but after it is moved to the falling stage and 
acquired negative results. This is one of the key factors for 
identifying the culprit. It is proved that voice is unique in 
nature. The following table1 will give the clarity regarding 
the timing of the voice samples. 
Table1: Total timing of the utterance 
Token I – Total (in Ms*) M- Total 
Volume 3 Issue 10, October 2014 
www.ijsr.net 
Licensed Under Creative Commons Attribution CC BY 
(in MS*) 
Dif 
/baja:nike/ 595 567 28 
/mi:ni/ 319 431 -112 
/telijani/ 232 312 -80 
/blarana:di/ 686 804 -118 
*Millisecons I stands for imitator voice M stands for 
Mahesh voice Dif stands for difference of the voice 
A more detailed analysis of the timing showed on a line 
diagram. At the initial stage it is very close to the original 
voice but later the imitator’s voice reached to the falling 
stage and original voice reached to the raising stage. This is 
clearly seen in the following Figure 1 
Paper ID: OCT14474 1176
International Journal of Science and Research (IJSR) 
ISSN (Online): 2319-7064 
Impact Factor (2012): 3.358 
Figure 1: Timing of the utterance 
5.2 Formant frequency 
The second best way of examining voice samples is that get 
the mean of the formant frequency. In F1, F2, F3 only 
Mahesh has the dominating voice than the imitator. Imitator 
failed to reach the original voice samples. He attempted but 
failed due to less preparation. Even this is not possible to 
anybody to imitate the original voice samples alike because 
as it stated above voice is unique in nature. By examining 
the mean of the formant frequencies also forensic experts 
and phoneticians identify the culprits or voices. The 
following table2 explains about the mean of the formant 
frequency 
Table 2 : Mean of the formant frequencies 
Token M-F1 I-F1 Dif M-F2 I-F2 Dif M-F3 I-F3 Dif 
/baja:nike/ 718.39 564.51 153.88 1704.26 1603.07 101.19 2535.72 2449.05 86.67 
/mi:ni/ 389.56 376.47 13.09 2006.89 1788.94 217.95 2590.42 2357 233.42 
/telijani/ 562.91 471.42 91.49 1882.86 1697.43 185.43 2575.49 2362.59 212.9 
/blarana:di/ 672.87 582.28 90.59 1673.09 1614.60 58.49 2712.80 2549.11 163.69 
F stands for frequency 
For the table 2 the following bar chart (figure2) represents 
the results of the mean of the formant frequencies. 
Figure2: Mean of the formant frequency 
5.3. Pitch 
In any voice pitch plays a crucial role. The following table3 
is clearly showing that the imitator has the pitch in between 
148Hz to 160 Hz but Mahesh has 100Hz to 125Hz. It means 
where Mahesh pitch levels end, there the imitators pitch 
levels starts. Based on this assumption, Pitch plays a very 
crucial role in any individual’s voice. 
Table3: Pitch of the voice 
Token I-Pitch M-pitch Dif 
/baja:nike/ 156.07 121.78 34.29 
/mi:ni/ 158.71 123.25 35.46 
/telijani/ 152.78 109.38 43.4 
/blarana:di/ 148.67 100.13 48.54 
Imitator has the high pitch, but Mahesh has the low Pitch. Of 
course the pitch levels are merely close when it is to a naked 
ear, but they are too distant when it is acoustically analyzed. 
The following line diagram (Figure3) represents how the 
voices are distant. 
Figure3: Pitch of the voice 
6. Conclusion 
Based on the present study the author found that the voice is 
unique in nature. It is highly impossible to copy. Though it 
is very close to the original voice to a naked ear but when it 
is analyzed acoustically, then there are so many factors 
reveal that no two voices are alike. It is proved based on this 
study. It is too distant in the timing, mean of the formant 
frequencies, and the pitch. There are other voices and 
imitations also to work on like (the famous actors/artists in 
the Tollywood) NTR (Nandamuri Taraka Ramarao) ANR 
(Akkineni Nageswara Rao), Krishna, Chiranjeevi, etc., 
References 
[1] Anders Eriksson and Pär Wretling. How flexible is the 
human voice? – A case study of mimicry Umeå 
University, S-901 87 Umeå, Sweden. 
[2] Bessler, P. (1991) La caricature de de Gaulle par Tissot: 
Étude phonostylistique. In: Information/Communication, 
12, 19–32. Canadian Scholars’ Press. 
[3] Endres, W., W. Bambach & G. Flösser. (1971) Voice 
spectrograms as a function of age, voice disguise, and 
voice imitation. J. Acoust. Soc. Am., 49, 1842–1848. 
Volume 3 Issue 10, October 2014 
www.ijsr.net 
Paper ID: OCT14474 1177 
Licensed Under Creative Commons Attribution CC BY
International Journal of Science and Research (IJSR) 
ISSN (Online): 2319-7064 
Impact Factor (2012): 3.358 
[4] Philip Rose.( 2002) Forensic Speaker Identification: 
[5] Traunmüller, H. & A. Eriksson. (1995) The perceptual 
evaluation of F0 excursions in speech as evidenced in 
liveliness estimations. J. Acoust. Soc. Am., 97, 1905– 
1915. 
Author Profile 
Mr. Babi Duli, a Ph.D research scholar at the EFL 
University, from the Department of Linguistics and 
Phonetics, School of Phonetics and Spoken English, 
continuing his research in the area of Forensic 
Phonetics. He had a master degree from Adikavi 
Nannaya University, Rajahmundry. He was having 
experience in teaching Pronunciation. He worked for several 
prestigious institutions, and trained many students in the area of 
pronunciation. 
Volume 3 Issue 10, October 2014 
www.ijsr.net 
Taylor And Fransic Series. London 
Paper ID: OCT14474 1178 
Licensed Under Creative Commons Attribution CC BY

More Related Content

What's hot

Upgrading the Performance of Speech Emotion Recognition at the Segmental Level
Upgrading the Performance of Speech Emotion Recognition at the Segmental Level Upgrading the Performance of Speech Emotion Recognition at the Segmental Level
Upgrading the Performance of Speech Emotion Recognition at the Segmental Level IOSR Journals
 
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...acijjournal
 
High Level Speaker Specific Features as an Efficiency Enhancing Parameters in...
High Level Speaker Specific Features as an Efficiency Enhancing Parameters in...High Level Speaker Specific Features as an Efficiency Enhancing Parameters in...
High Level Speaker Specific Features as an Efficiency Enhancing Parameters in...IJECEIAES
 
Forensic phonetics[1]
Forensic phonetics[1]Forensic phonetics[1]
Forensic phonetics[1]PAHELI SHARMA
 
Speaker identification based on temporal parameters
Speaker identification based on temporal parametersSpeaker identification based on temporal parameters
Speaker identification based on temporal parametersAlexandria University
 
pronunciation research article
pronunciation research articlepronunciation research article
pronunciation research articlecamilopico90
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresSivaranjan Goswami
 
10.1121@1.4904542
10.1121@1.490454210.1121@1.4904542
10.1121@1.4904542Edwin Silva
 
voice recognition
voice recognition voice recognition
voice recognition Hemant Jain
 
Development of text to speech system for yoruba language
Development of text to speech system for yoruba languageDevelopment of text to speech system for yoruba language
Development of text to speech system for yoruba languageAlexander Decker
 
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015Philip Desenne
 
Artificially enhancing better-ear glimpsing cues to improve understanding of ...
Artificially enhancing better-ear glimpsing cues to improve understanding of ...Artificially enhancing better-ear glimpsing cues to improve understanding of ...
Artificially enhancing better-ear glimpsing cues to improve understanding of ...HEARnet _
 
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...sipij
 

What's hot (16)

F010334548
F010334548F010334548
F010334548
 
Upgrading the Performance of Speech Emotion Recognition at the Segmental Level
Upgrading the Performance of Speech Emotion Recognition at the Segmental Level Upgrading the Performance of Speech Emotion Recognition at the Segmental Level
Upgrading the Performance of Speech Emotion Recognition at the Segmental Level
 
Voice
VoiceVoice
Voice
 
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...
 
High Level Speaker Specific Features as an Efficiency Enhancing Parameters in...
High Level Speaker Specific Features as an Efficiency Enhancing Parameters in...High Level Speaker Specific Features as an Efficiency Enhancing Parameters in...
High Level Speaker Specific Features as an Efficiency Enhancing Parameters in...
 
Forensic phonetics[1]
Forensic phonetics[1]Forensic phonetics[1]
Forensic phonetics[1]
 
Speaker identification based on temporal parameters
Speaker identification based on temporal parametersSpeaker identification based on temporal parameters
Speaker identification based on temporal parameters
 
pronunciation research article
pronunciation research articlepronunciation research article
pronunciation research article
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech features
 
10.1121@1.4904542
10.1121@1.490454210.1121@1.4904542
10.1121@1.4904542
 
voice recognition
voice recognition voice recognition
voice recognition
 
Dy36749754
Dy36749754Dy36749754
Dy36749754
 
Development of text to speech system for yoruba language
Development of text to speech system for yoruba languageDevelopment of text to speech system for yoruba language
Development of text to speech system for yoruba language
 
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015
 
Artificially enhancing better-ear glimpsing cues to improve understanding of ...
Artificially enhancing better-ear glimpsing cues to improve understanding of ...Artificially enhancing better-ear glimpsing cues to improve understanding of ...
Artificially enhancing better-ear glimpsing cues to improve understanding of ...
 
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
 

Viewers also liked

Viewers also liked (19)

M dgx mde0mdm=
M dgx mde0mdm=M dgx mde0mdm=
M dgx mde0mdm=
 
Hospitality Business Vs Environmental Sustainability: A Study of Soarak Hotel...
Hospitality Business Vs Environmental Sustainability: A Study of Soarak Hotel...Hospitality Business Vs Environmental Sustainability: A Study of Soarak Hotel...
Hospitality Business Vs Environmental Sustainability: A Study of Soarak Hotel...
 
Tourism a Viable Path for Wealth Creation in Nigeria: An Analysis of Awka Met...
Tourism a Viable Path for Wealth Creation in Nigeria: An Analysis of Awka Met...Tourism a Viable Path for Wealth Creation in Nigeria: An Analysis of Awka Met...
Tourism a Viable Path for Wealth Creation in Nigeria: An Analysis of Awka Met...
 
Evaluation of Efficacy of Lemon Juice Extract (Citrius Lemoni Risso) on Wound...
Evaluation of Efficacy of Lemon Juice Extract (Citrius Lemoni Risso) on Wound...Evaluation of Efficacy of Lemon Juice Extract (Citrius Lemoni Risso) on Wound...
Evaluation of Efficacy of Lemon Juice Extract (Citrius Lemoni Risso) on Wound...
 
International Journal of Science and Research (IJSR)
International Journal of Science and Research (IJSR)International Journal of Science and Research (IJSR)
International Journal of Science and Research (IJSR)
 
Identifying the Preferences and Brand Choices of Female Customers for Cars: A...
Identifying the Preferences and Brand Choices of Female Customers for Cars: A...Identifying the Preferences and Brand Choices of Female Customers for Cars: A...
Identifying the Preferences and Brand Choices of Female Customers for Cars: A...
 
T0 numtq0nq==
T0 numtq0nq==T0 numtq0nq==
T0 numtq0nq==
 
T0 numtq0o dc=
T0 numtq0o dc=T0 numtq0o dc=
T0 numtq0o dc=
 
T0 numtq0o dk=
T0 numtq0o dk=T0 numtq0o dk=
T0 numtq0o dk=
 
T0 numtq0nju=
T0 numtq0nju=T0 numtq0nju=
T0 numtq0nju=
 
Md iw mtqxmdm4
Md iw mtqxmdm4Md iw mtqxmdm4
Md iw mtqxmdm4
 
T0 numtq0n tg=
T0 numtq0n tg=T0 numtq0n tg=
T0 numtq0n tg=
 
T0 numtq0nzu=
T0 numtq0nzu=T0 numtq0nzu=
T0 numtq0nzu=
 
Air Quality Monitoring Using Model: A Review
Air Quality Monitoring Using Model: A ReviewAir Quality Monitoring Using Model: A Review
Air Quality Monitoring Using Model: A Review
 
T0 numtq0nji=
T0 numtq0nji=T0 numtq0nji=
T0 numtq0nji=
 
T0 numtq0njm=
T0 numtq0njm=T0 numtq0njm=
T0 numtq0njm=
 
Effects of Anthropogenic Activities on Distribution and Abundance of the Epip...
Effects of Anthropogenic Activities on Distribution and Abundance of the Epip...Effects of Anthropogenic Activities on Distribution and Abundance of the Epip...
Effects of Anthropogenic Activities on Distribution and Abundance of the Epip...
 
Phenotypic Diversity for Qualitative Characters of Barley (Hordeum Vulgare (L...
Phenotypic Diversity for Qualitative Characters of Barley (Hordeum Vulgare (L...Phenotypic Diversity for Qualitative Characters of Barley (Hordeum Vulgare (L...
Phenotypic Diversity for Qualitative Characters of Barley (Hordeum Vulgare (L...
 
A Study on Stress Management among Employees at Sakthi Finance Limited, Coimb...
A Study on Stress Management among Employees at Sakthi Finance Limited, Coimb...A Study on Stress Management among Employees at Sakthi Finance Limited, Coimb...
A Study on Stress Management among Employees at Sakthi Finance Limited, Coimb...
 

Similar to T0 numtq0nzq=

International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentIJERD Editor
 
STUDY OF ACOUSTIC PROPERTIES OF NASAL AND NONNASAL VOWELS IN TEMPORAL DOMAIN
STUDY OF ACOUSTIC PROPERTIES OF NASAL AND NONNASAL VOWELS IN TEMPORAL DOMAINSTUDY OF ACOUSTIC PROPERTIES OF NASAL AND NONNASAL VOWELS IN TEMPORAL DOMAIN
STUDY OF ACOUSTIC PROPERTIES OF NASAL AND NONNASAL VOWELS IN TEMPORAL DOMAINcscpconf
 
Study of acoustic properties of nasal and nonnasal vowels in temporal domain
Study of acoustic properties of nasal and nonnasal vowels in temporal domainStudy of acoustic properties of nasal and nonnasal vowels in temporal domain
Study of acoustic properties of nasal and nonnasal vowels in temporal domaincsandit
 
S A M P L E P A P E R S54INHIBITORY INFLUENCES ON ASYCHRO.docx
S A M P L E  P A P E R S54INHIBITORY INFLUENCES ON ASYCHRO.docxS A M P L E  P A P E R S54INHIBITORY INFLUENCES ON ASYCHRO.docx
S A M P L E P A P E R S54INHIBITORY INFLUENCES ON ASYCHRO.docxagnesdcarey33086
 
Novel cochlear filter based cepstral coefficients for classification of unvoi...
Novel cochlear filter based cepstral coefficients for classification of unvoi...Novel cochlear filter based cepstral coefficients for classification of unvoi...
Novel cochlear filter based cepstral coefficients for classification of unvoi...ijnlc
 
An Introduction To Speech Sciences (Acoustic Analysis Of Speech)
An Introduction To Speech Sciences (Acoustic Analysis Of Speech)An Introduction To Speech Sciences (Acoustic Analysis Of Speech)
An Introduction To Speech Sciences (Acoustic Analysis Of Speech)Jeff Nelson
 
Coad_M_GoForth2Column
Coad_M_GoForth2ColumnCoad_M_GoForth2Column
Coad_M_GoForth2ColumnMargaret Coad
 
Sonic localization-cues-for-classrooms-a-structural-model-proposal
Sonic localization-cues-for-classrooms-a-structural-model-proposalSonic localization-cues-for-classrooms-a-structural-model-proposal
Sonic localization-cues-for-classrooms-a-structural-model-proposalCemal Ardil
 
Effects Of Extended Bandwidth
Effects Of Extended BandwidthEffects Of Extended Bandwidth
Effects Of Extended BandwidthGeoffrey Cooling
 
Central Auditory Function: Willeford Test Battery
Central Auditory Function: Willeford Test BatteryCentral Auditory Function: Willeford Test Battery
Central Auditory Function: Willeford Test Batteryalexandracostlow
 
Audio descriptive analysis of singer and musical instrument identification in...
Audio descriptive analysis of singer and musical instrument identification in...Audio descriptive analysis of singer and musical instrument identification in...
Audio descriptive analysis of singer and musical instrument identification in...eSAT Journals
 
Acoustic phonetics
Acoustic phoneticsAcoustic phonetics
Acoustic phoneticsVivaAs
 
Effects of age and working memory in understanding speech in conversational n...
Effects of age and working memory in understanding speech in conversational n...Effects of age and working memory in understanding speech in conversational n...
Effects of age and working memory in understanding speech in conversational n...HEARnet _
 
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...kevig
 

Similar to T0 numtq0nzq= (20)

ESSAY2
ESSAY2ESSAY2
ESSAY2
 
Stuttering its natureandmanagement
Stuttering its natureandmanagementStuttering its natureandmanagement
Stuttering its natureandmanagement
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
STUDY OF ACOUSTIC PROPERTIES OF NASAL AND NONNASAL VOWELS IN TEMPORAL DOMAIN
STUDY OF ACOUSTIC PROPERTIES OF NASAL AND NONNASAL VOWELS IN TEMPORAL DOMAINSTUDY OF ACOUSTIC PROPERTIES OF NASAL AND NONNASAL VOWELS IN TEMPORAL DOMAIN
STUDY OF ACOUSTIC PROPERTIES OF NASAL AND NONNASAL VOWELS IN TEMPORAL DOMAIN
 
Study of acoustic properties of nasal and nonnasal vowels in temporal domain
Study of acoustic properties of nasal and nonnasal vowels in temporal domainStudy of acoustic properties of nasal and nonnasal vowels in temporal domain
Study of acoustic properties of nasal and nonnasal vowels in temporal domain
 
S A M P L E P A P E R S54INHIBITORY INFLUENCES ON ASYCHRO.docx
S A M P L E  P A P E R S54INHIBITORY INFLUENCES ON ASYCHRO.docxS A M P L E  P A P E R S54INHIBITORY INFLUENCES ON ASYCHRO.docx
S A M P L E P A P E R S54INHIBITORY INFLUENCES ON ASYCHRO.docx
 
Novel cochlear filter based cepstral coefficients for classification of unvoi...
Novel cochlear filter based cepstral coefficients for classification of unvoi...Novel cochlear filter based cepstral coefficients for classification of unvoi...
Novel cochlear filter based cepstral coefficients for classification of unvoi...
 
An Introduction To Speech Sciences (Acoustic Analysis Of Speech)
An Introduction To Speech Sciences (Acoustic Analysis Of Speech)An Introduction To Speech Sciences (Acoustic Analysis Of Speech)
An Introduction To Speech Sciences (Acoustic Analysis Of Speech)
 
Coad_M_GoForth2Column
Coad_M_GoForth2ColumnCoad_M_GoForth2Column
Coad_M_GoForth2Column
 
Sonic localization-cues-for-classrooms-a-structural-model-proposal
Sonic localization-cues-for-classrooms-a-structural-model-proposalSonic localization-cues-for-classrooms-a-structural-model-proposal
Sonic localization-cues-for-classrooms-a-structural-model-proposal
 
Effects Of Extended Bandwidth
Effects Of Extended BandwidthEffects Of Extended Bandwidth
Effects Of Extended Bandwidth
 
B110512
B110512B110512
B110512
 
Central Auditory Function: Willeford Test Battery
Central Auditory Function: Willeford Test BatteryCentral Auditory Function: Willeford Test Battery
Central Auditory Function: Willeford Test Battery
 
Audio descriptive analysis of singer and musical instrument identification in...
Audio descriptive analysis of singer and musical instrument identification in...Audio descriptive analysis of singer and musical instrument identification in...
Audio descriptive analysis of singer and musical instrument identification in...
 
Acoustic phonetics
Acoustic phoneticsAcoustic phonetics
Acoustic phonetics
 
Xu
XuXu
Xu
 
Ijeer journal
Ijeer journalIjeer journal
Ijeer journal
 
Effects of age and working memory in understanding speech in conversational n...
Effects of age and working memory in understanding speech in conversational n...Effects of age and working memory in understanding speech in conversational n...
Effects of age and working memory in understanding speech in conversational n...
 
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
Investigation of the Effect of Obstacle Placed Near the Human Glottis on the ...
 

More from International Journal of Science and Research (IJSR)

More from International Journal of Science and Research (IJSR) (20)

Innovations in the Diagnosis and Treatment of Chronic Heart Failure
Innovations in the Diagnosis and Treatment of Chronic Heart FailureInnovations in the Diagnosis and Treatment of Chronic Heart Failure
Innovations in the Diagnosis and Treatment of Chronic Heart Failure
 
Design and implementation of carrier based sinusoidal pwm (bipolar) inverter
Design and implementation of carrier based sinusoidal pwm (bipolar) inverterDesign and implementation of carrier based sinusoidal pwm (bipolar) inverter
Design and implementation of carrier based sinusoidal pwm (bipolar) inverter
 
Polarization effect of antireflection coating for soi material system
Polarization effect of antireflection coating for soi material systemPolarization effect of antireflection coating for soi material system
Polarization effect of antireflection coating for soi material system
 
Image resolution enhancement via multi surface fitting
Image resolution enhancement via multi surface fittingImage resolution enhancement via multi surface fitting
Image resolution enhancement via multi surface fitting
 
Ad hoc networks technical issues on radio links security & qo s
Ad hoc networks technical issues on radio links security & qo sAd hoc networks technical issues on radio links security & qo s
Ad hoc networks technical issues on radio links security & qo s
 
Microstructure analysis of the carbon nano tubes aluminum composite with diff...
Microstructure analysis of the carbon nano tubes aluminum composite with diff...Microstructure analysis of the carbon nano tubes aluminum composite with diff...
Microstructure analysis of the carbon nano tubes aluminum composite with diff...
 
Improving the life of lm13 using stainless spray ii coating for engine applic...
Improving the life of lm13 using stainless spray ii coating for engine applic...Improving the life of lm13 using stainless spray ii coating for engine applic...
Improving the life of lm13 using stainless spray ii coating for engine applic...
 
An overview on development of aluminium metal matrix composites with hybrid r...
An overview on development of aluminium metal matrix composites with hybrid r...An overview on development of aluminium metal matrix composites with hybrid r...
An overview on development of aluminium metal matrix composites with hybrid r...
 
Pesticide mineralization in water using silver nanoparticles incorporated on ...
Pesticide mineralization in water using silver nanoparticles incorporated on ...Pesticide mineralization in water using silver nanoparticles incorporated on ...
Pesticide mineralization in water using silver nanoparticles incorporated on ...
 
Comparative study on computers operated by eyes and brain
Comparative study on computers operated by eyes and brainComparative study on computers operated by eyes and brain
Comparative study on computers operated by eyes and brain
 
T s eliot and the concept of literary tradition and the importance of allusions
T s eliot and the concept of literary tradition and the importance of allusionsT s eliot and the concept of literary tradition and the importance of allusions
T s eliot and the concept of literary tradition and the importance of allusions
 
Effect of select yogasanas and pranayama practices on selected physiological ...
Effect of select yogasanas and pranayama practices on selected physiological ...Effect of select yogasanas and pranayama practices on selected physiological ...
Effect of select yogasanas and pranayama practices on selected physiological ...
 
Grid computing for load balancing strategies
Grid computing for load balancing strategiesGrid computing for load balancing strategies
Grid computing for load balancing strategies
 
A new algorithm to improve the sharing of bandwidth
A new algorithm to improve the sharing of bandwidthA new algorithm to improve the sharing of bandwidth
A new algorithm to improve the sharing of bandwidth
 
Main physical causes of climate change and global warming a general overview
Main physical causes of climate change and global warming   a general overviewMain physical causes of climate change and global warming   a general overview
Main physical causes of climate change and global warming a general overview
 
Performance assessment of control loops
Performance assessment of control loopsPerformance assessment of control loops
Performance assessment of control loops
 
Capital market in bangladesh an overview
Capital market in bangladesh an overviewCapital market in bangladesh an overview
Capital market in bangladesh an overview
 
Faster and resourceful multi core web crawling
Faster and resourceful multi core web crawlingFaster and resourceful multi core web crawling
Faster and resourceful multi core web crawling
 
Extended fuzzy c means clustering algorithm in segmentation of noisy images
Extended fuzzy c means clustering algorithm in segmentation of noisy imagesExtended fuzzy c means clustering algorithm in segmentation of noisy images
Extended fuzzy c means clustering algorithm in segmentation of noisy images
 
Parallel generators of pseudo random numbers with control of calculation errors
Parallel generators of pseudo random numbers with control of calculation errorsParallel generators of pseudo random numbers with control of calculation errors
Parallel generators of pseudo random numbers with control of calculation errors
 

T0 numtq0nzq=

  • 1. International Journal of Science and Research (IJSR) ISSN (Online): 2319-7064 Impact Factor (2012): 3.358 Imitation of the Voice – An Acoustic Study Babi Duli Ph.D Research Scholar, Dept. of Linguistics and Phonetics, School of Phonetics and Spoken English EFL University, Hyderabad – 500605 Email:- bobbyciefl@gmail.com Ph: +91 9494006604 Abstract: The present study presented here is an imitation of a voice sample imitated the south Indian film star Mahesh Babu by a professional imitator/mimicry artist in a TV show. The imitator tried to imitate the dialogue of the hero from his block buster hit movie titled Dookudu ([du:kuu). The aim of the study is to investigate how closely the imitations matched in the selected acoustic parameters of the original voice. It was found that he was able to imitate dialogue very closely, but timing at the segmental level showed little change in the direction of the targets. Mean formant frequency did not match the target voice. It is too distant in the timing, mean of formant frequencies and in pitch Keywords: imitation, fundamental frequency, formant frequency, pitch, timing, forensic, phoneticians, culprits, acoustics, YouTube 1. Introduction Imitation is an act of attempting others voice which is unique by birth. It can be done either with the knowledge of the actual person or in his/her absence. If it is done in his/her presence, chances are very limited to make the actual person as culprit or to indulge in any of the immoral activities, but in absence it will go to any extent( either entertain or not). If it entertains, the imitator gets the awards, if not; only the original speaker can get the repression. But according to law 100 culprits can escape from the punishment but one good person should not be caught in injustice. Here the forensic experts trained a lot especially to trace out the culprits and save the innocent. Even phoneticians also trained in this connection to connect to the culprits’ voices from a collection of compendious voices to do the justice to the common compatriot. 2. Earlier research review Only a handful of studies exist in this area, and to our knowledge only two that deal with acoustic parameters in some detail. Bessler [2] has studied a caricatured impersonation of de Gaulle. The most relevant finding with respect to the present study was the fact that the impersonator exaggerated both mean fundamental frequency level and range. Stressed syllable durations were also exaggerated. The impersonation was primarily meant for entertainment purposes and not accuracy of imitation. The generalisability of the results may therefore be questioned. In the other study [3], vowel formant frequencies and fundamental frequencies in imitations were compared with the corresponding values for the original voices. Although the imitators managed to change their formant and fundamental frequencies in the direction of the target values “they were not able to adapt these parameters to match or even be similar to those of the imitated persons.” [3, p. 1842] 3. Methodology The target voice sample is collected from the YouTube from a popular Telugu comedy show in E-TV titled Jabardasth. The source voice sample is collected from the super hit movie titled Dookudu. The video files are converted into .wav files by using ‘video pad’ and ‘wave pad’. Praat software is used for segmenting the voice samples and for analyzing. 4. Acoustic Analysis The speech materials are digitized and used for fundamental frequency level and pitch level for the acoustic analyses. All the speech files are labelled at word level. The mean of the Fundamental frequency, formant frequencies and the mean of the Pitch are then computed. First three formant frequencies are calculated for all the words which are presented in the material. All tokens are numbered consecutively to ensure that comparisons between tokens in the original, and imitation of the voice were made between in identical contexts. 5. Results 5.1 Timing Imitator reached for maintain the closest timing at the initial word in the continuous speech but he failed after that. Initially the difference between imitator and Mahesh is a very close but after it is moved to the falling stage and acquired negative results. This is one of the key factors for identifying the culprit. It is proved that voice is unique in nature. The following table1 will give the clarity regarding the timing of the voice samples. Table1: Total timing of the utterance Token I – Total (in Ms*) M- Total Volume 3 Issue 10, October 2014 www.ijsr.net Licensed Under Creative Commons Attribution CC BY (in MS*) Dif /baja:nike/ 595 567 28 /mi:ni/ 319 431 -112 /telijani/ 232 312 -80 /blarana:di/ 686 804 -118 *Millisecons I stands for imitator voice M stands for Mahesh voice Dif stands for difference of the voice A more detailed analysis of the timing showed on a line diagram. At the initial stage it is very close to the original voice but later the imitator’s voice reached to the falling stage and original voice reached to the raising stage. This is clearly seen in the following Figure 1 Paper ID: OCT14474 1176
  • 2. International Journal of Science and Research (IJSR) ISSN (Online): 2319-7064 Impact Factor (2012): 3.358 Figure 1: Timing of the utterance 5.2 Formant frequency The second best way of examining voice samples is that get the mean of the formant frequency. In F1, F2, F3 only Mahesh has the dominating voice than the imitator. Imitator failed to reach the original voice samples. He attempted but failed due to less preparation. Even this is not possible to anybody to imitate the original voice samples alike because as it stated above voice is unique in nature. By examining the mean of the formant frequencies also forensic experts and phoneticians identify the culprits or voices. The following table2 explains about the mean of the formant frequency Table 2 : Mean of the formant frequencies Token M-F1 I-F1 Dif M-F2 I-F2 Dif M-F3 I-F3 Dif /baja:nike/ 718.39 564.51 153.88 1704.26 1603.07 101.19 2535.72 2449.05 86.67 /mi:ni/ 389.56 376.47 13.09 2006.89 1788.94 217.95 2590.42 2357 233.42 /telijani/ 562.91 471.42 91.49 1882.86 1697.43 185.43 2575.49 2362.59 212.9 /blarana:di/ 672.87 582.28 90.59 1673.09 1614.60 58.49 2712.80 2549.11 163.69 F stands for frequency For the table 2 the following bar chart (figure2) represents the results of the mean of the formant frequencies. Figure2: Mean of the formant frequency 5.3. Pitch In any voice pitch plays a crucial role. The following table3 is clearly showing that the imitator has the pitch in between 148Hz to 160 Hz but Mahesh has 100Hz to 125Hz. It means where Mahesh pitch levels end, there the imitators pitch levels starts. Based on this assumption, Pitch plays a very crucial role in any individual’s voice. Table3: Pitch of the voice Token I-Pitch M-pitch Dif /baja:nike/ 156.07 121.78 34.29 /mi:ni/ 158.71 123.25 35.46 /telijani/ 152.78 109.38 43.4 /blarana:di/ 148.67 100.13 48.54 Imitator has the high pitch, but Mahesh has the low Pitch. Of course the pitch levels are merely close when it is to a naked ear, but they are too distant when it is acoustically analyzed. The following line diagram (Figure3) represents how the voices are distant. Figure3: Pitch of the voice 6. Conclusion Based on the present study the author found that the voice is unique in nature. It is highly impossible to copy. Though it is very close to the original voice to a naked ear but when it is analyzed acoustically, then there are so many factors reveal that no two voices are alike. It is proved based on this study. It is too distant in the timing, mean of the formant frequencies, and the pitch. There are other voices and imitations also to work on like (the famous actors/artists in the Tollywood) NTR (Nandamuri Taraka Ramarao) ANR (Akkineni Nageswara Rao), Krishna, Chiranjeevi, etc., References [1] Anders Eriksson and Pär Wretling. How flexible is the human voice? – A case study of mimicry Umeå University, S-901 87 Umeå, Sweden. [2] Bessler, P. (1991) La caricature de de Gaulle par Tissot: Étude phonostylistique. In: Information/Communication, 12, 19–32. Canadian Scholars’ Press. [3] Endres, W., W. Bambach & G. Flösser. (1971) Voice spectrograms as a function of age, voice disguise, and voice imitation. J. Acoust. Soc. Am., 49, 1842–1848. Volume 3 Issue 10, October 2014 www.ijsr.net Paper ID: OCT14474 1177 Licensed Under Creative Commons Attribution CC BY
  • 3. International Journal of Science and Research (IJSR) ISSN (Online): 2319-7064 Impact Factor (2012): 3.358 [4] Philip Rose.( 2002) Forensic Speaker Identification: [5] Traunmüller, H. & A. Eriksson. (1995) The perceptual evaluation of F0 excursions in speech as evidenced in liveliness estimations. J. Acoust. Soc. Am., 97, 1905– 1915. Author Profile Mr. Babi Duli, a Ph.D research scholar at the EFL University, from the Department of Linguistics and Phonetics, School of Phonetics and Spoken English, continuing his research in the area of Forensic Phonetics. He had a master degree from Adikavi Nannaya University, Rajahmundry. He was having experience in teaching Pronunciation. He worked for several prestigious institutions, and trained many students in the area of pronunciation. Volume 3 Issue 10, October 2014 www.ijsr.net Taylor And Fransic Series. London Paper ID: OCT14474 1178 Licensed Under Creative Commons Attribution CC BY