SlideShare a Scribd company logo
A TOOL TO CONVERT TEXT TO
SPEECH WITH EMOTIONS
EmoSpeak
Submitted to: Ms. Shikha Jain
Submitted by:
Akriti Saini (10503902)
Stuti Shukla (10503870)
What is NLP?
 Natural language processing (NLP) is a field
of computer science, artificial intelligence,
and linguistics concerned with the interactions
between computers and human (natural) languages. As
such, NLP is related to the area of human–computer
interaction. Many challenges in NLP involve natural
language understanding, that is, enabling computers to
derive meaning from human or natural language input,
and others involve natural language generation.
 The area of NLP we are concerned with is:
Text-to-Speech with emotions.
What is Text-to-Speech?
 A text-to-speech (TTS) system converts normal
language text into speech.
 The quality of a speech synthesizer is judged by its
similarity to the human voice and by its ability to
be understood clearly. An intelligible text-to-
speech program allows people with visual
impairments or reading disabilities to listen to
written works on a home computer.
Our Tool: EmoSpeak
 EmoSpeak converts text to speech in such a way
that it takes into account all the emotions of the
text and incorporates all the extracted emotions
into speech.
 The tool first identifies the various emotions in the
raw text and then modifies certain characteristics
of the voice in order to modulate it, and then
expresses the various emotions.
 The tool is composed of two parts: a front-end and
a back-end. The front-end is responsible for text
normalization, pre-processing, or tokenization.
The back-end—often referred to as
the synthesizer—then converts the symbolic
linguistic representation into sound.
Voice Modulation
 One of the goals of text-to-speech(TTS) systems is to
produce natural-sounding synthesized speech.
Towards this end various natural language
processing (NLP) tasks are performed to model the
prosodic aspects of the TTS.
 One of the fundamental NLP task being used is the
part-of-speech (POS) tagging of the words in the
text.
 The voice modulation aspect of the project. i.e.
changing certain characteristics of the voice based on
a particular emotion has various characteristics of
the voice that could be changed such as f0
frequency, f0 contour, f0 range, jitter, nasal
duration etc.
 These characteristics are changed according to the
emotion, which is set by the user.
Implementation
 For implementation purpose, the first task is to take a
pdf file as an input and convert it to the corresponding
text file.
 The text is then tokenized and decision regarding the
class (emotional or neutral) to which it belongs is taken.
Upon deciding that the text belongs to the emotional
class, it is then required to identify the emotional
subcategory to which the text belongs- suppose ‘happy’
 The above classification can be done by using WordNet
and WordNet-Affect. Now depending on emotions, the
voice can be accordingly modulated by varying the
intensity, time of pause between the words, pitch of the
voice.
Diagram for Emotion Extraction from the
Text
Integrated Literature Survey
 By exploring various research papers we infer that
there are various approaches available which can be
followed to implement our application. Our first task
should be to decide upon whether the text falls in
emotional or non-emotional (neutral) class.
 The important thing that we came to know was that,
using WordNet and WordNet affect was the best way
in order to identify the emotions in a particular text,
because it had the maximum precision among all the
other procedures, like LSA
 From the literature survey we also conclude that
there are various text-to-speech engines available
and our foremost task would be to choose an
appropriate engine according to the requirements.
We came across the researches in which emotional
text-to-speech engine has been implemented for the
Italian and Arabic languages.
Application and Significance of
the project
 It can be used to inculcate the habits of reading books in
the children, as from human psychology it can be
inferred that the particular task when done or performed
beyond a certain limit, develops a liking for that
particular task. So by listening to various type of books
children will develop a habit of reading books.
 It can also be used to supplement children’s reading
classes. A child learns easily especially when things are
pointed to him. They can listen to a voice reading the
contents of the book as they follow with their eyes. It can
be used as a tutor replacing the need of teacher to guide
children
 By implementing expressive child-directed
storytelling in a text-to-speech application, it can be
useful in therapeutic education of children with
communication disorders. This can be done by
helping them to learn how to express their feeling
and try to communicate.
 It can help visually impaired or the people with
certain reading disabilities to get the feel of reading a
book.
Major presentation on EmoSpeak

More Related Content

What's hot

Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games Research
Jose Zagal
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
Kuppusamy P
 
Speech synthesis technology
Speech synthesis technologySpeech synthesis technology
Speech synthesis technology
Kalluri Madhuri
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
KarenVacca
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
Seminar Links
 
Big data
Big dataBig data
Big data
Ishucs
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speech
Bilgin Aksoy
 
F0363942
F0363942F0363942
F0363942
iosrjournals
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overview
Varun Jain
 
Comprehension and Language (handout version)
Comprehension and Language (handout version)Comprehension and Language (handout version)
Comprehension and Language (handout version)
Al Alva
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
Iqbal
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
Jimit Rupani
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
International Islamic University
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
Basha Chand
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
SrijanKumar18
 
Nlp
NlpNlp

What's hot (20)

Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games Research
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Ho3114511454
Ho3114511454Ho3114511454
Ho3114511454
 
Speech synthesis technology
Speech synthesis technologySpeech synthesis technology
Speech synthesis technology
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Big data
Big dataBig data
Big data
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
ESSAY2
ESSAY2ESSAY2
ESSAY2
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speech
 
F0363942
F0363942F0363942
F0363942
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overview
 
Comprehension and Language (handout version)
Comprehension and Language (handout version)Comprehension and Language (handout version)
Comprehension and Language (handout version)
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Nlp
NlpNlp
Nlp
 
C5 giruba beulah
C5 giruba beulahC5 giruba beulah
C5 giruba beulah
 

Viewers also liked

Eduardo Coutinho - Psychoacoustic cues to emotion in speech prosody and music
Eduardo Coutinho - Psychoacoustic cues to emotion in speech prosody and musicEduardo Coutinho - Psychoacoustic cues to emotion in speech prosody and music
Eduardo Coutinho - Psychoacoustic cues to emotion in speech prosody and musicswissnex San Francisco
 
seminar emosparkk
seminar emosparkk seminar emosparkk
seminar emosparkk
Akshay Vş
 
Emospark
EmosparkEmospark
Emospark
sangu21
 
emospark-ppt
 emospark-ppt emospark-ppt
emospark-ppt
Chaitanya Ram
 
Emotional Tts
Emotional TtsEmotional Tts
Emotional Tts
felix burkhardt
 
EmoSpark
EmoSparkEmoSpark
EmoSpark
Anand
 
Advantages and Disadvantages of Technology
Advantages and Disadvantages of TechnologyAdvantages and Disadvantages of Technology
Advantages and Disadvantages of Technology
09356506910
 
Advantages and disadvantages of technology
Advantages and disadvantages of technologyAdvantages and disadvantages of technology
Advantages and disadvantages of technology
Huseyin87
 
Advantages and Disadvantages of Technology
Advantages and Disadvantages of TechnologyAdvantages and Disadvantages of Technology
Advantages and Disadvantages of Technology
Pave Maris Cortez
 

Viewers also liked (10)

Eduardo Coutinho - Psychoacoustic cues to emotion in speech prosody and music
Eduardo Coutinho - Psychoacoustic cues to emotion in speech prosody and musicEduardo Coutinho - Psychoacoustic cues to emotion in speech prosody and music
Eduardo Coutinho - Psychoacoustic cues to emotion in speech prosody and music
 
PPt file
PPt filePPt file
PPt file
 
seminar emosparkk
seminar emosparkk seminar emosparkk
seminar emosparkk
 
Emospark
EmosparkEmospark
Emospark
 
emospark-ppt
 emospark-ppt emospark-ppt
emospark-ppt
 
Emotional Tts
Emotional TtsEmotional Tts
Emotional Tts
 
EmoSpark
EmoSparkEmoSpark
EmoSpark
 
Advantages and Disadvantages of Technology
Advantages and Disadvantages of TechnologyAdvantages and Disadvantages of Technology
Advantages and Disadvantages of Technology
 
Advantages and disadvantages of technology
Advantages and disadvantages of technologyAdvantages and disadvantages of technology
Advantages and disadvantages of technology
 
Advantages and Disadvantages of Technology
Advantages and Disadvantages of TechnologyAdvantages and Disadvantages of Technology
Advantages and Disadvantages of Technology
 

Similar to Major presentation on EmoSpeak

Natural language processing in artificial intelligence
Natural language processing in artificial intelligenceNatural language processing in artificial intelligence
Natural language processing in artificial intelligence
Abdul Rafay
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
Robert Antony
 
Untitled presentation.pdf
Untitled presentation.pdfUntitled presentation.pdf
Untitled presentation.pdf
Upinder Kaur
 
Natural language understanding of chatbots
Natural language understanding of chatbotsNatural language understanding of chatbots
Natural language understanding of chatbots
abn17p
 
Natural language understandihggjsjng. pptx
Natural language understandihggjsjng. pptxNatural language understandihggjsjng. pptx
Natural language understandihggjsjng. pptx
MAKSHAY6
 
Natural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overviewNatural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overview
Benjaminlapid1
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-Speech
Ngwe Tun
 
Article Summaries
Article SummariesArticle Summaries
Article SummariesORhonda
 
Natural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and ChallengesNatural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and Challenges
antonellarose
 
AI - natural language processing
AI - natural language processingAI - natural language processing
AI - natural language processing
Learnbay Datascience
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
Saurav Aryal
 
Syracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docxSyracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docx
deanmtaylor1545
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Bhavya Chawla
 
Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...
Daniel Adenew
 
Role of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languagesRole of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languages
Dr. Amit Kumar Jha
 
5. phases of nlp
5. phases of nlp5. phases of nlp
5. phases of nlp
monircse2
 
The human mind at work
The human mind at workThe human mind at work
The human mind at work
Faith Clavaton
 
Linguistic Fundamentals in Translation and Translation Studies
Linguistic Fundamentals in Translation and Translation StudiesLinguistic Fundamentals in Translation and Translation Studies
Linguistic Fundamentals in Translation and Translation Studies
Sugey7
 
AI_08_NLP.pptx
AI_08_NLP.pptxAI_08_NLP.pptx
AI_08_NLP.pptx
Yousef Aburawi
 

Similar to Major presentation on EmoSpeak (20)

Natural language processing in artificial intelligence
Natural language processing in artificial intelligenceNatural language processing in artificial intelligence
Natural language processing in artificial intelligence
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Untitled presentation.pdf
Untitled presentation.pdfUntitled presentation.pdf
Untitled presentation.pdf
 
Natural language understanding of chatbots
Natural language understanding of chatbotsNatural language understanding of chatbots
Natural language understanding of chatbots
 
Natural language understandihggjsjng. pptx
Natural language understandihggjsjng. pptxNatural language understandihggjsjng. pptx
Natural language understandihggjsjng. pptx
 
Natural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overviewNatural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overview
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-Speech
 
Article Summaries
Article SummariesArticle Summaries
Article Summaries
 
Natural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and ChallengesNatural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and Challenges
 
AI - natural language processing
AI - natural language processingAI - natural language processing
AI - natural language processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Syracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docxSyracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docx
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...
 
Role of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languagesRole of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languages
 
NLP
NLPNLP
NLP
 
5. phases of nlp
5. phases of nlp5. phases of nlp
5. phases of nlp
 
The human mind at work
The human mind at workThe human mind at work
The human mind at work
 
Linguistic Fundamentals in Translation and Translation Studies
Linguistic Fundamentals in Translation and Translation StudiesLinguistic Fundamentals in Translation and Translation Studies
Linguistic Fundamentals in Translation and Translation Studies
 
AI_08_NLP.pptx
AI_08_NLP.pptxAI_08_NLP.pptx
AI_08_NLP.pptx
 

Recently uploaded

DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
gestioneergodomus
 
Technical Drawings introduction to drawing of prisms
Technical Drawings introduction to drawing of prismsTechnical Drawings introduction to drawing of prisms
Technical Drawings introduction to drawing of prisms
heavyhaig
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
ihlasbinance2003
 
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
dxobcob
 
Water billing management system project report.pdf
Water billing management system project report.pdfWater billing management system project report.pdf
Water billing management system project report.pdf
Kamal Acharya
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
zwunae
 
bank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdfbank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdf
Divyam548318
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
Ethernet Routing and switching chapter 1.ppt
Ethernet Routing and switching chapter 1.pptEthernet Routing and switching chapter 1.ppt
Ethernet Routing and switching chapter 1.ppt
azkamurat
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
Madan Karki
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
camseq
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
Victor Morales
 
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdfBPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
MIGUELANGEL966976
 
Series of visio cisco devices Cisco_Icons.ppt
Series of visio cisco devices Cisco_Icons.pptSeries of visio cisco devices Cisco_Icons.ppt
Series of visio cisco devices Cisco_Icons.ppt
PauloRodrigues104553
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
NidhalKahouli2
 
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
awadeshbabu
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
obonagu
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
SUTEJAS
 
Low power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniquesLow power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniques
nooriasukmaningtyas
 

Recently uploaded (20)

DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
 
Technical Drawings introduction to drawing of prisms
Technical Drawings introduction to drawing of prismsTechnical Drawings introduction to drawing of prisms
Technical Drawings introduction to drawing of prisms
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
 
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
 
Water billing management system project report.pdf
Water billing management system project report.pdfWater billing management system project report.pdf
Water billing management system project report.pdf
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
 
bank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdfbank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdf
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
Ethernet Routing and switching chapter 1.ppt
Ethernet Routing and switching chapter 1.pptEthernet Routing and switching chapter 1.ppt
Ethernet Routing and switching chapter 1.ppt
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
 
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdfBPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
 
Series of visio cisco devices Cisco_Icons.ppt
Series of visio cisco devices Cisco_Icons.pptSeries of visio cisco devices Cisco_Icons.ppt
Series of visio cisco devices Cisco_Icons.ppt
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
 
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
 
Low power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniquesLow power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniques
 

Major presentation on EmoSpeak

  • 1. A TOOL TO CONVERT TEXT TO SPEECH WITH EMOTIONS EmoSpeak Submitted to: Ms. Shikha Jain Submitted by: Akriti Saini (10503902) Stuti Shukla (10503870)
  • 2. What is NLP?  Natural language processing (NLP) is a field of computer science, artificial intelligence, and linguistics concerned with the interactions between computers and human (natural) languages. As such, NLP is related to the area of human–computer interaction. Many challenges in NLP involve natural language understanding, that is, enabling computers to derive meaning from human or natural language input, and others involve natural language generation.  The area of NLP we are concerned with is: Text-to-Speech with emotions.
  • 3. What is Text-to-Speech?  A text-to-speech (TTS) system converts normal language text into speech.  The quality of a speech synthesizer is judged by its similarity to the human voice and by its ability to be understood clearly. An intelligible text-to- speech program allows people with visual impairments or reading disabilities to listen to written works on a home computer.
  • 4. Our Tool: EmoSpeak  EmoSpeak converts text to speech in such a way that it takes into account all the emotions of the text and incorporates all the extracted emotions into speech.  The tool first identifies the various emotions in the raw text and then modifies certain characteristics of the voice in order to modulate it, and then expresses the various emotions.
  • 5.  The tool is composed of two parts: a front-end and a back-end. The front-end is responsible for text normalization, pre-processing, or tokenization. The back-end—often referred to as the synthesizer—then converts the symbolic linguistic representation into sound.
  • 6. Voice Modulation  One of the goals of text-to-speech(TTS) systems is to produce natural-sounding synthesized speech. Towards this end various natural language processing (NLP) tasks are performed to model the prosodic aspects of the TTS.  One of the fundamental NLP task being used is the part-of-speech (POS) tagging of the words in the text.
  • 7.  The voice modulation aspect of the project. i.e. changing certain characteristics of the voice based on a particular emotion has various characteristics of the voice that could be changed such as f0 frequency, f0 contour, f0 range, jitter, nasal duration etc.  These characteristics are changed according to the emotion, which is set by the user.
  • 8. Implementation  For implementation purpose, the first task is to take a pdf file as an input and convert it to the corresponding text file.  The text is then tokenized and decision regarding the class (emotional or neutral) to which it belongs is taken. Upon deciding that the text belongs to the emotional class, it is then required to identify the emotional subcategory to which the text belongs- suppose ‘happy’  The above classification can be done by using WordNet and WordNet-Affect. Now depending on emotions, the voice can be accordingly modulated by varying the intensity, time of pause between the words, pitch of the voice.
  • 9. Diagram for Emotion Extraction from the Text
  • 10. Integrated Literature Survey  By exploring various research papers we infer that there are various approaches available which can be followed to implement our application. Our first task should be to decide upon whether the text falls in emotional or non-emotional (neutral) class.  The important thing that we came to know was that, using WordNet and WordNet affect was the best way in order to identify the emotions in a particular text, because it had the maximum precision among all the other procedures, like LSA
  • 11.  From the literature survey we also conclude that there are various text-to-speech engines available and our foremost task would be to choose an appropriate engine according to the requirements. We came across the researches in which emotional text-to-speech engine has been implemented for the Italian and Arabic languages.
  • 12. Application and Significance of the project  It can be used to inculcate the habits of reading books in the children, as from human psychology it can be inferred that the particular task when done or performed beyond a certain limit, develops a liking for that particular task. So by listening to various type of books children will develop a habit of reading books.  It can also be used to supplement children’s reading classes. A child learns easily especially when things are pointed to him. They can listen to a voice reading the contents of the book as they follow with their eyes. It can be used as a tutor replacing the need of teacher to guide children
  • 13.  By implementing expressive child-directed storytelling in a text-to-speech application, it can be useful in therapeutic education of children with communication disorders. This can be done by helping them to learn how to express their feeling and try to communicate.  It can help visually impaired or the people with certain reading disabilities to get the feel of reading a book.