SlideShare a Scribd company logo
1 of 15
2022 International Conference for
Advancement in Technology
(ICONAT 2022)
An Acoustic and Statistic Study of Emotions Expressed
in Marathi Speech
Paper ID:433
By
TRUPTI K. HARHARE,
MILIND SHAH
University of Mumbai
Outline
1. Introduction
2. Literature Survey
3. Methodology
4. Implementation
5. Results and Discussion
6. Conclusion
7. Future work
References
1. Introduction
Problem Definition
The COVID-19 pandemic has drastically altered people's lifestyles in
many parts of the world. The lockdowns and social distancing norms
eventually increased human-machine interaction applications.
Improvements are being made in speech recognition, speaker
recognition, and various human-system interaction technologies.
Recognizing emotion, on the alternative hand, is still under research
for building a prosody model.
Marathi language is poorly studied as far as emotions is concerned.
Need to analyze and compare the acoustic correlations of the prosody
features for various emotions in Marathi language.
In the Marathi language, this paper seeks to acoustically and
statistically evaluate acting speech for anger, happiness, fear, and
neutral emotions.
1. Introduction
Objectives
To develop information processing tools and techniques to facilitate
human-machine interaction without language barrier (Technology
Development for Indian Languages (TDIL) Programme initiated by
the Ministry of Electronics & Information Technology, Govt. of
India.)
To create awareness and positive attitude towards Marathi language
through its visibility in the public domain.
Creating and accessing Marathi language knowledge resources; and
integrating them to develop innovative user products and services.
Analyse and compare the acoustic and statistic study of emotions
expressed in Marathi speech.
2. Literature Survey
Speech : Segmental Information: Vowels, consonants[1-3].
Suprasegmental Information: Intonation, Rhythm, tone, stress.
Suprasegmental elements called as PROSODY features.
Prosody is characterized by emotions and speaking style.
Basic emotions like happy, sad, angry, fear and neutral readout style
emotions were considered [1-5].
The analysis of acoustic parameters like duration, Fundamental
Frequency/pitch, and intensity for the corresponding emotions were
carried out [6-11].
Statistical analysis [12-15] was performed after acoustic analysis to
corroborate the results of the acoustic analysis and then to pick the best
prosodic features to develop a prosody model.
3.Methodology
•Collection of neutral meaning sentences from story books, novels etc.
•Recordings of the utterances in four emotion styles such as angry,
happy, fear and neutral.
•Perceptual verification
•Annotation of the recorded utterances in PRAAT
•Calculation of twelve prosodic features based on pitch, intensity,
duration, and formants for all the emotional utterances.
•Acoustic analysis and statistical analysis of the annotated data based
on the emotions.
4.Implementation
•Objective: To determine the important prosodic features for emotion
classification for Marathi speech using acoustic and statistical hypothesis test.
•Implementation: The acoustic analysis carried out on twelve prosodic features.
•One way ANOVA is a hypothesis test, of the twelve prosodic features calculated
independently to determine whether the mean values of these prosodic features
differ significantly across the four emotions.
•The ANOVA analysis establishes a difference between two or more group
means, but it does not specify which groups are significantly different [20-23].
•The Tukey test which compares the means of the features pairwise to discover
whether a significant difference exists between each pair.
•Advantage: The emotion categorization capacity of the twelve prosodic features
is determined using the findings of ANOVA and the Tukey technique.
5. Results and Discussion
•Pitch Related Features
•A. Acoustic Analysis:
B. ANOVA analysis:
Maximum pitch Minimum pitch Median pitch
F(3, 403)=8.402, p <0.001 F(3,403)= 21.54, p <0.001 F(3,403)= 65.07, p <0.001
The results of ANOVA analysis showed that all these pitch features are significant and can be used for
emotion classification in Marathi.
5. Results and Discussion
•Intensity Related Features
A. Acoustic Analysis:
A. ANOVA analysis:
Maximum Intensity Minimum Intensity Mean Intensity
F(3,403)= 87.34, p <0.001 F(3,403)= 6.325, p <0.001 F(3,403)= 98.01, p <0.001
The results of ANOVA analysis showed that all these intensity features are significant and can
be used for emotion classification in Marathi.
5. Results and Discussion
•Duration Related Features
Acoustic Analysis:
•
B. ANOVA analysis:
Number of syllables per second Number of voice breaks sentence duration
F(3,403)=31.8, p<0.001 F(3,403)= 19.14, p<0.001 F(3,403)= 24.35, p<0.001
The results of ANOVA analysis showed that all these duration features are
significant and can be used for emotion classification in Marathi.
5. Results and Discussion
•Formant Related Features
A. Acoustic Analysis:
B. ANOVA analysis:
F1 F2 F3
F(3,403)=15.84, p <0.001 F(3,403)= 1.941 F(3,403)= 14.51, p <0.001
•The results of ANOVA analysis showed that F1 and F3 features are significant and can be used for
emotion classification in Marathi but as the formant F2 is non-significant and will not be much useful
feature for classifying the emotions.
Continued
Pitch Related Features
Features F value calculated
with one-way ANOVA
Analysis
Tukey Test for multiple comparisons of means (Confidence level: 95%)
(Significant codes: p<0.0001: ‘***’, p<0.001: ‘**’,
p <0.01: ‘*’, p<0.05: ‘.’, p< 0.1 ‘’)
Significant Non-significant
Maximum Pitch 8.402 Fear-Anger, p<0.0001
Happiness-Fear, p<0.0001
Neutral-Happiness, p<0.05
Neutral-Fear, p<0.05
Happiness-Anger
Neutral-Anger
Minimum Pitch 21.54 Neutral-Anger, p<0.0001
Neutral-Happiness, p<0.0001
Neutral-Fear, p<0.0001
Fear-Anger
Happiness-Anger
Happiness-Fear
Median Pitch 65.07 Fear-Anger, p<0.0001
Neutral-Anger, p<0.0001
Happiness-Fear, p<0.0001
Neutral-Happiness, p<0.0001
Happiness-Anger
Neutral-Fear
Continued
Formants Related Features
Features F value calculated
with one-way ANOVA
Analysis
Tukey Test for multiple comparisons of means
(Confidence level: 95%)
(Significant codes: p<0.0001: ‘***’, p<0.001: ‘**’, p <0.01:
‘*’, p<0.05: ‘.’, p< 0.1 ‘’)
Significant Non-significant
F1 15.84 Fear-Anger, p<0.0001
Happiness-Anger, p<0.0001
Neutral-Anger, p<0.0001
Happiness-Fear, p<0.05
Neutral-Fear
Neutral-Happiness
F2 1.941 Fear-Anger
Happiness-Anger
Neutral-Anger
Happiness-Fear
Neutral-Fear
Neutral-Happiness
F3 14.59 Fear-Anger,p<0.0001
Happiness-Fear, p<0.0001
Neutral-Fear, p <0.01
Neutral-Happiness, p<0.05
Happiness-Anger
Neutral-Anger
6. Conclusion
This study presented the analysis of the Marathi emotional speech
database from an acoustic and statistical perspective.
A statistical study employing 1-way ANOVA indicated significant
changes in various prosodic features according to emotions.
ANOVA analysis showed that all the pitch attributes, duration
attributes, intensity-related attributes are significant for emotion
classification in Marathi. The formant F2 is a non-significant and not
useful feature for classifying the emotions in the Marathi language.
The Tuckey test revealed that even though the prosodic features are
significant in ANOVA analysis, all of the features are not distinct, but
there can be pairwise similarity in feature behavior for emotions.
This study demonstrated the usefulness of statistical tests to assess
the Marathi emotional speech database.
References
[1] T. Wani, T. Gunwan, S. Qadri, M. Kartiwi, “A Comprehensive Review of Speech Emotion Recognition Systems”, IEEEAccess, vol. 9, pp.
47795–47814, April 2021.
[2] P. Rao, N. Sanghvi, H. Mixdorff, K. Sabu, "Acoustic correlates of focus in Marathi: Production and perception", Journal of Phonetics, vol. 65,
pp. 110, 2017.
[3] J. Yadav and K. S. Rao. ”Generation of emotional speech by prosody imposition on sentence, word and syllable level fragments of neutral
speech”, in Proc. International Conference on Cognitive Computing and Information Processing (CCIP), March 3-4, pp. 1-5, 2015.
[4] M. C. Madhavi, S. Sharma and H. A. Patil, "Development of language resources for speech application in Gujarati and Marathi," International
Conference on Asian Language Processing (IALP), Kuching, pp. 115-118, 2014.
[5] X. Yang and Y. Yang, “Prosodic Realization of Rhetorical Structure in Chinese Discourse”, IEEE Transactions on Audio, Speech, and
Language Processing, vol. 20, no.4, May 2012.
[6] A. Agrawal, A. Dev, “Emotion recognition and conversion based on segmentation of speech in Hindi language”, IEEE International
Conference on Computing for Sustainable Global Development, New Delhi, India, 2015.
[7] N. Apandi, N. Jamil, “An analysis of Malay language emotional speech corpus for emotion recognition system”, Industrial Electronics and
Applications Conference(IEACon), IEEE, Kota Kinabalu, Malaysia, 2016.
[8] E. Väyrynen, “Emotion recognition from speech using prosodic features”, Academic dissertation, University of Oulu, Finland, 2014.
[9] T. Wang, Y. Lee, Q. Ma, “Within and Across-Language Comparison of Vocal Emotions in Mandarin and English”, Appl. Sci. vol.8, pp.2629;
December 2018.
[10] J. Tao, Y. Kang and A. Li. “Prosody conversion from neutral speech to emotional speech”, IEEE Trans. On Audio, Speech, and Language
Processing, 14(4), pp. 1145-1154, 2006.
[11] M. Begum, N. Raja, Ainon, R. Zainuddin, Z. M. Don, G. Knowles, “Prosody Generation by Integrating Rule and Template based Approaches
for Emotional Malay Speech Synthesis”, In Proc. TENCON, Hyderabad, India, pp. 1-6, Nov. 2008.
[12] Agnes Jacob, P.Mythili, “Upgrading the Performance of Speech Emotion Recognition at the Segmental Level”, IOSR Journal of Computer
Engineering (IOSR-JCE) Volume 15, Issue 3, pp. 48-52, 2013.
[13] T. Iliou,C.Anagnostopoulos, “Classification on Speech Emotion Recognition - A Comparative Study”, International Journal on
Advances in Life Sciences, vol 2 no 1 & 2, 2010.
[14] S. Ali, M. Andleeb, D. Rehman, “A Study of the Effect of Emotions and Software on Prosodic Features on Spoken Utterances in Urdu
Language”,
I.J. Image, Graphics and Signal Processing, vol. 4, pp.46-53,2016.
[15] M. Yusnita A, Paulraj M. P., S. Yaacobb , N. Fadzilah, Shahriman A. B.,“Acoustic Analysis of Formants across Genders and Ethnical Accents
in Malaysian English using ANOVA”, International Conference On Design and Manufacturing, vol.64, pp. 385–394, 2013.

More Related Content

Similar to ICONAT__PRESENTATION_TEMPLETE.pptx

Evaluation of Support Vector Machine and Decision Tree for Emotion Recognitio...
Evaluation of Support Vector Machine and Decision Tree for Emotion Recognitio...Evaluation of Support Vector Machine and Decision Tree for Emotion Recognitio...
Evaluation of Support Vector Machine and Decision Tree for Emotion Recognitio...journalBEEI
 
Creation of speech corpus for emotion analysis in Gujarati language and its e...
Creation of speech corpus for emotion analysis in Gujarati language and its e...Creation of speech corpus for emotion analysis in Gujarati language and its e...
Creation of speech corpus for emotion analysis in Gujarati language and its e...IJECEIAES
 
Signal Processing Tool for Emotion Recognition
Signal Processing Tool for Emotion RecognitionSignal Processing Tool for Emotion Recognition
Signal Processing Tool for Emotion Recognitionidescitation
 
Speech Feature Extraction and Data Visualisation
Speech Feature Extraction and Data VisualisationSpeech Feature Extraction and Data Visualisation
Speech Feature Extraction and Data VisualisationITIIIndustries
 
A hybrid strategy for emotion classification
A hybrid strategy for emotion classificationA hybrid strategy for emotion classification
A hybrid strategy for emotion classificationnooriasukmaningtyas
 
Interpreters and Emotional Intelligence How do we use it and why does it matter?
Interpreters and Emotional Intelligence How do we use it and why does it matter?Interpreters and Emotional Intelligence How do we use it and why does it matter?
Interpreters and Emotional Intelligence How do we use it and why does it matter?Diana Singureanu
 
76201926
7620192676201926
76201926IJRAT
 
Polarity detection of movie reviews in
Polarity detection of movie reviews inPolarity detection of movie reviews in
Polarity detection of movie reviews inijcsa
 
Literature Review On: ”Speech Emotion Recognition Using Deep Neural Network”
Literature Review On: ”Speech Emotion Recognition Using Deep Neural Network”Literature Review On: ”Speech Emotion Recognition Using Deep Neural Network”
Literature Review On: ”Speech Emotion Recognition Using Deep Neural Network”IRJET Journal
 
The Language of TV Commercials’ Slogans: A Semantic Analysis
The Language of TV Commercials’ Slogans: A Semantic AnalysisThe Language of TV Commercials’ Slogans: A Semantic Analysis
The Language of TV Commercials’ Slogans: A Semantic AnalysisBahram Kazemian
 
IRJET - Survey on Different Approaches of Depression Analysis
IRJET - Survey on Different Approaches of Depression AnalysisIRJET - Survey on Different Approaches of Depression Analysis
IRJET - Survey on Different Approaches of Depression AnalysisIRJET Journal
 
Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion...
Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion...Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion...
Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion...IJECEIAES
 
A SURVEY OF S ENTIMENT CLASSIFICATION TECHNIQUES USED FOR I NDIAN REGIONA...
A  SURVEY OF  S ENTIMENT CLASSIFICATION  TECHNIQUES USED FOR  I NDIAN REGIONA...A  SURVEY OF  S ENTIMENT CLASSIFICATION  TECHNIQUES USED FOR  I NDIAN REGIONA...
A SURVEY OF S ENTIMENT CLASSIFICATION TECHNIQUES USED FOR I NDIAN REGIONA...ijcsa
 
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...sipij
 

Similar to ICONAT__PRESENTATION_TEMPLETE.pptx (20)

Evaluation of Support Vector Machine and Decision Tree for Emotion Recognitio...
Evaluation of Support Vector Machine and Decision Tree for Emotion Recognitio...Evaluation of Support Vector Machine and Decision Tree for Emotion Recognitio...
Evaluation of Support Vector Machine and Decision Tree for Emotion Recognitio...
 
H010215561
H010215561H010215561
H010215561
 
50120130406003
5012013040600350120130406003
50120130406003
 
50120130406003
5012013040600350120130406003
50120130406003
 
Creation of speech corpus for emotion analysis in Gujarati language and its e...
Creation of speech corpus for emotion analysis in Gujarati language and its e...Creation of speech corpus for emotion analysis in Gujarati language and its e...
Creation of speech corpus for emotion analysis in Gujarati language and its e...
 
Signal Processing Tool for Emotion Recognition
Signal Processing Tool for Emotion RecognitionSignal Processing Tool for Emotion Recognition
Signal Processing Tool for Emotion Recognition
 
Speech Feature Extraction and Data Visualisation
Speech Feature Extraction and Data VisualisationSpeech Feature Extraction and Data Visualisation
Speech Feature Extraction and Data Visualisation
 
A hybrid strategy for emotion classification
A hybrid strategy for emotion classificationA hybrid strategy for emotion classification
A hybrid strategy for emotion classification
 
Interpreters and Emotional Intelligence How do we use it and why does it matter?
Interpreters and Emotional Intelligence How do we use it and why does it matter?Interpreters and Emotional Intelligence How do we use it and why does it matter?
Interpreters and Emotional Intelligence How do we use it and why does it matter?
 
76201926
7620192676201926
76201926
 
N01741100102
N01741100102N01741100102
N01741100102
 
rs_day_10012015
rs_day_10012015rs_day_10012015
rs_day_10012015
 
Polarity detection of movie reviews in
Polarity detection of movie reviews inPolarity detection of movie reviews in
Polarity detection of movie reviews in
 
Literature Review On: ”Speech Emotion Recognition Using Deep Neural Network”
Literature Review On: ”Speech Emotion Recognition Using Deep Neural Network”Literature Review On: ”Speech Emotion Recognition Using Deep Neural Network”
Literature Review On: ”Speech Emotion Recognition Using Deep Neural Network”
 
The Language of TV Commercials’ Slogans: A Semantic Analysis
The Language of TV Commercials’ Slogans: A Semantic AnalysisThe Language of TV Commercials’ Slogans: A Semantic Analysis
The Language of TV Commercials’ Slogans: A Semantic Analysis
 
IRJET - Survey on Different Approaches of Depression Analysis
IRJET - Survey on Different Approaches of Depression AnalysisIRJET - Survey on Different Approaches of Depression Analysis
IRJET - Survey on Different Approaches of Depression Analysis
 
Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion...
Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion...Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion...
Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion...
 
F334047
F334047F334047
F334047
 
A SURVEY OF S ENTIMENT CLASSIFICATION TECHNIQUES USED FOR I NDIAN REGIONA...
A  SURVEY OF  S ENTIMENT CLASSIFICATION  TECHNIQUES USED FOR  I NDIAN REGIONA...A  SURVEY OF  S ENTIMENT CLASSIFICATION  TECHNIQUES USED FOR  I NDIAN REGIONA...
A SURVEY OF S ENTIMENT CLASSIFICATION TECHNIQUES USED FOR I NDIAN REGIONA...
 
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
 

Recently uploaded

Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTbhaskargani46
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Christo Ananth
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLManishPatel169454
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 

Recently uploaded (20)

Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 

ICONAT__PRESENTATION_TEMPLETE.pptx

  • 1. 2022 International Conference for Advancement in Technology (ICONAT 2022) An Acoustic and Statistic Study of Emotions Expressed in Marathi Speech Paper ID:433 By TRUPTI K. HARHARE, MILIND SHAH University of Mumbai
  • 2. Outline 1. Introduction 2. Literature Survey 3. Methodology 4. Implementation 5. Results and Discussion 6. Conclusion 7. Future work References
  • 3. 1. Introduction Problem Definition The COVID-19 pandemic has drastically altered people's lifestyles in many parts of the world. The lockdowns and social distancing norms eventually increased human-machine interaction applications. Improvements are being made in speech recognition, speaker recognition, and various human-system interaction technologies. Recognizing emotion, on the alternative hand, is still under research for building a prosody model. Marathi language is poorly studied as far as emotions is concerned. Need to analyze and compare the acoustic correlations of the prosody features for various emotions in Marathi language. In the Marathi language, this paper seeks to acoustically and statistically evaluate acting speech for anger, happiness, fear, and neutral emotions.
  • 4. 1. Introduction Objectives To develop information processing tools and techniques to facilitate human-machine interaction without language barrier (Technology Development for Indian Languages (TDIL) Programme initiated by the Ministry of Electronics & Information Technology, Govt. of India.) To create awareness and positive attitude towards Marathi language through its visibility in the public domain. Creating and accessing Marathi language knowledge resources; and integrating them to develop innovative user products and services. Analyse and compare the acoustic and statistic study of emotions expressed in Marathi speech.
  • 5. 2. Literature Survey Speech : Segmental Information: Vowels, consonants[1-3]. Suprasegmental Information: Intonation, Rhythm, tone, stress. Suprasegmental elements called as PROSODY features. Prosody is characterized by emotions and speaking style. Basic emotions like happy, sad, angry, fear and neutral readout style emotions were considered [1-5]. The analysis of acoustic parameters like duration, Fundamental Frequency/pitch, and intensity for the corresponding emotions were carried out [6-11]. Statistical analysis [12-15] was performed after acoustic analysis to corroborate the results of the acoustic analysis and then to pick the best prosodic features to develop a prosody model.
  • 6. 3.Methodology •Collection of neutral meaning sentences from story books, novels etc. •Recordings of the utterances in four emotion styles such as angry, happy, fear and neutral. •Perceptual verification •Annotation of the recorded utterances in PRAAT •Calculation of twelve prosodic features based on pitch, intensity, duration, and formants for all the emotional utterances. •Acoustic analysis and statistical analysis of the annotated data based on the emotions.
  • 7. 4.Implementation •Objective: To determine the important prosodic features for emotion classification for Marathi speech using acoustic and statistical hypothesis test. •Implementation: The acoustic analysis carried out on twelve prosodic features. •One way ANOVA is a hypothesis test, of the twelve prosodic features calculated independently to determine whether the mean values of these prosodic features differ significantly across the four emotions. •The ANOVA analysis establishes a difference between two or more group means, but it does not specify which groups are significantly different [20-23]. •The Tukey test which compares the means of the features pairwise to discover whether a significant difference exists between each pair. •Advantage: The emotion categorization capacity of the twelve prosodic features is determined using the findings of ANOVA and the Tukey technique.
  • 8. 5. Results and Discussion •Pitch Related Features •A. Acoustic Analysis: B. ANOVA analysis: Maximum pitch Minimum pitch Median pitch F(3, 403)=8.402, p <0.001 F(3,403)= 21.54, p <0.001 F(3,403)= 65.07, p <0.001 The results of ANOVA analysis showed that all these pitch features are significant and can be used for emotion classification in Marathi.
  • 9. 5. Results and Discussion •Intensity Related Features A. Acoustic Analysis: A. ANOVA analysis: Maximum Intensity Minimum Intensity Mean Intensity F(3,403)= 87.34, p <0.001 F(3,403)= 6.325, p <0.001 F(3,403)= 98.01, p <0.001 The results of ANOVA analysis showed that all these intensity features are significant and can be used for emotion classification in Marathi.
  • 10. 5. Results and Discussion •Duration Related Features Acoustic Analysis: • B. ANOVA analysis: Number of syllables per second Number of voice breaks sentence duration F(3,403)=31.8, p<0.001 F(3,403)= 19.14, p<0.001 F(3,403)= 24.35, p<0.001 The results of ANOVA analysis showed that all these duration features are significant and can be used for emotion classification in Marathi.
  • 11. 5. Results and Discussion •Formant Related Features A. Acoustic Analysis: B. ANOVA analysis: F1 F2 F3 F(3,403)=15.84, p <0.001 F(3,403)= 1.941 F(3,403)= 14.51, p <0.001 •The results of ANOVA analysis showed that F1 and F3 features are significant and can be used for emotion classification in Marathi but as the formant F2 is non-significant and will not be much useful feature for classifying the emotions.
  • 12. Continued Pitch Related Features Features F value calculated with one-way ANOVA Analysis Tukey Test for multiple comparisons of means (Confidence level: 95%) (Significant codes: p<0.0001: ‘***’, p<0.001: ‘**’, p <0.01: ‘*’, p<0.05: ‘.’, p< 0.1 ‘’) Significant Non-significant Maximum Pitch 8.402 Fear-Anger, p<0.0001 Happiness-Fear, p<0.0001 Neutral-Happiness, p<0.05 Neutral-Fear, p<0.05 Happiness-Anger Neutral-Anger Minimum Pitch 21.54 Neutral-Anger, p<0.0001 Neutral-Happiness, p<0.0001 Neutral-Fear, p<0.0001 Fear-Anger Happiness-Anger Happiness-Fear Median Pitch 65.07 Fear-Anger, p<0.0001 Neutral-Anger, p<0.0001 Happiness-Fear, p<0.0001 Neutral-Happiness, p<0.0001 Happiness-Anger Neutral-Fear
  • 13. Continued Formants Related Features Features F value calculated with one-way ANOVA Analysis Tukey Test for multiple comparisons of means (Confidence level: 95%) (Significant codes: p<0.0001: ‘***’, p<0.001: ‘**’, p <0.01: ‘*’, p<0.05: ‘.’, p< 0.1 ‘’) Significant Non-significant F1 15.84 Fear-Anger, p<0.0001 Happiness-Anger, p<0.0001 Neutral-Anger, p<0.0001 Happiness-Fear, p<0.05 Neutral-Fear Neutral-Happiness F2 1.941 Fear-Anger Happiness-Anger Neutral-Anger Happiness-Fear Neutral-Fear Neutral-Happiness F3 14.59 Fear-Anger,p<0.0001 Happiness-Fear, p<0.0001 Neutral-Fear, p <0.01 Neutral-Happiness, p<0.05 Happiness-Anger Neutral-Anger
  • 14. 6. Conclusion This study presented the analysis of the Marathi emotional speech database from an acoustic and statistical perspective. A statistical study employing 1-way ANOVA indicated significant changes in various prosodic features according to emotions. ANOVA analysis showed that all the pitch attributes, duration attributes, intensity-related attributes are significant for emotion classification in Marathi. The formant F2 is a non-significant and not useful feature for classifying the emotions in the Marathi language. The Tuckey test revealed that even though the prosodic features are significant in ANOVA analysis, all of the features are not distinct, but there can be pairwise similarity in feature behavior for emotions. This study demonstrated the usefulness of statistical tests to assess the Marathi emotional speech database.
  • 15. References [1] T. Wani, T. Gunwan, S. Qadri, M. Kartiwi, “A Comprehensive Review of Speech Emotion Recognition Systems”, IEEEAccess, vol. 9, pp. 47795–47814, April 2021. [2] P. Rao, N. Sanghvi, H. Mixdorff, K. Sabu, "Acoustic correlates of focus in Marathi: Production and perception", Journal of Phonetics, vol. 65, pp. 110, 2017. [3] J. Yadav and K. S. Rao. ”Generation of emotional speech by prosody imposition on sentence, word and syllable level fragments of neutral speech”, in Proc. International Conference on Cognitive Computing and Information Processing (CCIP), March 3-4, pp. 1-5, 2015. [4] M. C. Madhavi, S. Sharma and H. A. Patil, "Development of language resources for speech application in Gujarati and Marathi," International Conference on Asian Language Processing (IALP), Kuching, pp. 115-118, 2014. [5] X. Yang and Y. Yang, “Prosodic Realization of Rhetorical Structure in Chinese Discourse”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no.4, May 2012. [6] A. Agrawal, A. Dev, “Emotion recognition and conversion based on segmentation of speech in Hindi language”, IEEE International Conference on Computing for Sustainable Global Development, New Delhi, India, 2015. [7] N. Apandi, N. Jamil, “An analysis of Malay language emotional speech corpus for emotion recognition system”, Industrial Electronics and Applications Conference(IEACon), IEEE, Kota Kinabalu, Malaysia, 2016. [8] E. Väyrynen, “Emotion recognition from speech using prosodic features”, Academic dissertation, University of Oulu, Finland, 2014. [9] T. Wang, Y. Lee, Q. Ma, “Within and Across-Language Comparison of Vocal Emotions in Mandarin and English”, Appl. Sci. vol.8, pp.2629; December 2018. [10] J. Tao, Y. Kang and A. Li. “Prosody conversion from neutral speech to emotional speech”, IEEE Trans. On Audio, Speech, and Language Processing, 14(4), pp. 1145-1154, 2006. [11] M. Begum, N. Raja, Ainon, R. Zainuddin, Z. M. Don, G. Knowles, “Prosody Generation by Integrating Rule and Template based Approaches for Emotional Malay Speech Synthesis”, In Proc. TENCON, Hyderabad, India, pp. 1-6, Nov. 2008. [12] Agnes Jacob, P.Mythili, “Upgrading the Performance of Speech Emotion Recognition at the Segmental Level”, IOSR Journal of Computer Engineering (IOSR-JCE) Volume 15, Issue 3, pp. 48-52, 2013. [13] T. Iliou,C.Anagnostopoulos, “Classification on Speech Emotion Recognition - A Comparative Study”, International Journal on Advances in Life Sciences, vol 2 no 1 & 2, 2010. [14] S. Ali, M. Andleeb, D. Rehman, “A Study of the Effect of Emotions and Software on Prosodic Features on Spoken Utterances in Urdu Language”, I.J. Image, Graphics and Signal Processing, vol. 4, pp.46-53,2016. [15] M. Yusnita A, Paulraj M. P., S. Yaacobb , N. Fadzilah, Shahriman A. B.,“Acoustic Analysis of Formants across Genders and Ethnical Accents in Malaysian English using ANOVA”, International Conference On Design and Manufacturing, vol.64, pp. 385–394, 2013.