SlideShare a Scribd company logo
Text
to
Speech
synthesizer.
Mini Project Report
(Speech and Audio Processing ECT 359-1)
Contents :
 Introduction
 Objective
 Theoretical background
 Flowchart
 Code and Execution
 Result with Discussion
 Applications
 Advantages
 Limitations and Future scope
 References
2
Introduction :
 The text-to-speech (TTS) synthesis procedure
consists of two main phases.
 The first is text analysis, where the input text is
transcribed into a phonetic or some other linguistic
representation.
 And the second one is the generation of speech
waveforms, where the output is produced from this
phonetic and prosodic information.
3
Introduction :
 These two phases are usually called high and low-level
synthesis . A simplified version of this procedure is
presented in figure below.
4
5
Objectives :
 Text to speech synthesizer will be of great help to
people with visual impairment .
 Text to speech synthesizer will help a machine to
communicate with users.
Theoretical Background :
6
 Speech Synthesis is the artificial production of human
speech.
 A synthesizer can incorporate a model of the vocal tract and
other human voice characteristics to create a completely
"synthetic" voice output.
 A computer system used for this purpose is called a speech
computer or speech synthesizer.
 A text-to-speech (TTS) system converts normal language text
into speech; other systems render symbolic linguistic
representations like phonetic transcriptions into speech.
TTS overview :
7
The procedure consist of two main phases:-
 Text Analysis
 Speech waveforms
 TEXT ANALYSIS : The input text is transcribed into a phonetic or some other
linguistic representation
 SPEECH WAVEFORMS : The acoustic output is produced from the phonetic
and prosodic information
Front End and Back End in TTS
8
 A text-to-speech system (or "engine") is composed of two
parts: a front-end and a back-end.
 The front-end converts raw text containing symbols like
numbers and abbreviations into the equivalent of written
out words (tokenization), then assigns phonetic
transcriptions to each word, and divides and marks the
text into prosodic units, like phrases, clauses, and
sentences (grapheme-phoneme conversion).
 The back-end often referred to as the synthesizer— then
converts the symbolic linguistic representation into sound.
9
FrFront End and Back End in TTS :
TTS Technology :
10
11
Speech Synthesizer used :
Concatenative synthesis is based on the concatenation (or
stringing together) of segments of recorded speech. Generally,
concatenative synthesis produces the most natural-sounding
synthesized speech.
 Concatenate segments of pre-recorded natural human
speech.
 Requires database of previously recorded human speech
covering all the possible segments to be synthesized .
 Segment might be phoneme, syllable, word, phrase, or any
combination .
Detailed Architecture of TTS systems :
12
.NET Framework :
 .NET is a framework developed by Microsoft.
 It is a new programming methodology.
 .NET is platform independent/cross platform ‘
 .NET is language insensitive.
 It includes a large class library known as Framework
Class Library (FCL).
13
Continued …….
14
 Microsoft also produces an IDE largely for .NET software
called Visual Studio.
 It provides language interoperability (each language can
use code written in other language ) across several
programming languages.
.NET Architecture :
15
.NET Execution :
16
Flowchart :
17
18
Code
19
20
Execution
21
Result :
22
 In this way , our aim to convert text which we passed as argument in
function is converted into artificial human voice (speech) .
 With the help of this TTS synthesizer , a blind guy can even read a book
or novel which is not available in braille language .
 This TTS synthesizer can be used in medical store for proper
pronunciation of medicines on cover or boxes.
 It is mostly used in voice stick device and voice assistant like Siri, google
assistant , Cortana and Alexa etc.
Applications :
 Talking Calculator
 Computer generated instructions
 Aids for the blind
 Telephone inquiry services
 Teaching machices
 Usage in education and daily life .
23
Advantages :
 Able to read large paragraphs .
 It offers a range of different accents and voices .
 Provide significant help for people with eyes disabilities.
 More accuracy in medical systems.
 It can be adapted easily to say whatever users want them to say.
 It provides talking machines for vocally impaired or deaf people
and better aids for speech therapy.
24
Limitations :
 No explicit emotions
 Homographs (Pronunciation)
 Prosody
 Language specific problems
 Special characters and symbols
25
Future Scope :
 It can also work in different languages like Marathi ,
Hindi , Kannada , etc.
 Accuracy will become better and can able to
pronounce symbols and special characters.
 Increasing variety of voices .
26
References :
 www.google.com
 www.youtube.com
 www.shareslide.net
 www.mathworks.com
 www.microsoft.com
28
Thank
you !!!!!!!!!!!!
Team Presentation
29

More Related Content

What's hot

NLP
NLPNLP
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Tagging
theyaseen51
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translation
Marcis Pinnis
 
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
ijma
 
Intro to NLP. Lecture 2
Intro to NLP.  Lecture 2Intro to NLP.  Lecture 2
Intro to NLP. Lecture 2
Ekaterina Chernyak
 
NLP new words
NLP new wordsNLP new words
NLP new words
guest9fc47a
 
12EEE032- text 2 voice
12EEE032-  text 2 voice12EEE032-  text 2 voice
12EEE032- text 2 voice
Nsaroj kumar
 
Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to Hindi
Rajat Jain
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
Hardik Kanjariya
 
MuLiST
MuLiSTMuLiST
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
paperpublications3
 
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
ravi sharma
 
A Marathi Hidden-Markov Model Based Speech Synthesis System
A Marathi Hidden-Markov Model Based Speech Synthesis SystemA Marathi Hidden-Markov Model Based Speech Synthesis System
A Marathi Hidden-Markov Model Based Speech Synthesis System
iosrjce
 
Lec 15,16,17 NLP.machine translation
Lec 15,16,17  NLP.machine translationLec 15,16,17  NLP.machine translation
Lec 15,16,17 NLP.machine translation
guest873a50
 
Sphinx4
Sphinx4Sphinx4
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translation
Rushdi Shams
 
A tutorial on Machine Translation
A tutorial on Machine TranslationA tutorial on Machine Translation
A tutorial on Machine Translation
Jaganadh Gopinadhan
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
Basha Chand
 
Ijetcas14 458
Ijetcas14 458Ijetcas14 458
Ijetcas14 458
Iasir Journals
 

What's hot (19)

NLP
NLPNLP
NLP
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Tagging
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translation
 
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
 
Intro to NLP. Lecture 2
Intro to NLP.  Lecture 2Intro to NLP.  Lecture 2
Intro to NLP. Lecture 2
 
NLP new words
NLP new wordsNLP new words
NLP new words
 
12EEE032- text 2 voice
12EEE032-  text 2 voice12EEE032-  text 2 voice
12EEE032- text 2 voice
 
Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to Hindi
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
MuLiST
MuLiSTMuLiST
MuLiST
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
 
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
 
A Marathi Hidden-Markov Model Based Speech Synthesis System
A Marathi Hidden-Markov Model Based Speech Synthesis SystemA Marathi Hidden-Markov Model Based Speech Synthesis System
A Marathi Hidden-Markov Model Based Speech Synthesis System
 
Lec 15,16,17 NLP.machine translation
Lec 15,16,17  NLP.machine translationLec 15,16,17  NLP.machine translation
Lec 15,16,17 NLP.machine translation
 
Sphinx4
Sphinx4Sphinx4
Sphinx4
 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translation
 
A tutorial on Machine Translation
A tutorial on Machine TranslationA tutorial on Machine Translation
A tutorial on Machine Translation
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Ijetcas14 458
Ijetcas14 458Ijetcas14 458
Ijetcas14 458
 

Similar to SAP (SPEECH AND AUDIO PROCESSING)

Speech Synthesis.pptx
Speech Synthesis.pptxSpeech Synthesis.pptx
Speech Synthesis.pptx
Subramanian Mani
 
visH (fin).pptx
visH (fin).pptxvisH (fin).pptx
visH (fin).pptx
tefflontrolegdy
 
A Short Introduction To Text-To-Speech Synthesis
A Short Introduction To Text-To-Speech SynthesisA Short Introduction To Text-To-Speech Synthesis
A Short Introduction To Text-To-Speech Synthesis
Cynthia King
 
SMATalk: Standard Malay Text to Speech Talk System
SMATalk: Standard Malay Text to Speech Talk SystemSMATalk: Standard Malay Text to Speech Talk System
SMATalk: Standard Malay Text to Speech Talk System
CSCJournals
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-Speech
Ngwe Tun
 
Tutorial - Speech Synthesis System
Tutorial - Speech Synthesis SystemTutorial - Speech Synthesis System
Tutorial - Speech Synthesis System
IJERA Editor
 
551 466-472
551 466-472551 466-472
551 466-472
idescitation
 
H010625862
H010625862H010625862
H010625862
IOSR Journals
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law companding
iosrjce
 
F017163443
F017163443F017163443
F017163443
IOSR Journals
 
ACHIEVING SECURITY VIA SPEECH RECOGNITION
ACHIEVING SECURITY VIA SPEECH RECOGNITIONACHIEVING SECURITY VIA SPEECH RECOGNITION
ACHIEVING SECURITY VIA SPEECH RECOGNITION
ijistjournal
 
Voice based web browser
Voice based web browserVoice based web browser
Voice based web browser
SowndaryaP
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET Journal
 
Survey On Speech Synthesis
Survey On Speech SynthesisSurvey On Speech Synthesis
Survey On Speech Synthesis
CSCJournals
 
Ey4301913917
Ey4301913917Ey4301913917
Ey4301913917
IJERA Editor
 
G1803013542
G1803013542G1803013542
G1803013542
IOSR Journals
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
Thejus Joby
 
Direct Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete UnitsDirect Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete Units
IJCI JOURNAL
 
Assign
AssignAssign
Deciphering voice of customer through speech analytics
Deciphering voice of customer through speech analyticsDeciphering voice of customer through speech analytics
Deciphering voice of customer through speech analytics
R Systems International
 

Similar to SAP (SPEECH AND AUDIO PROCESSING) (20)

Speech Synthesis.pptx
Speech Synthesis.pptxSpeech Synthesis.pptx
Speech Synthesis.pptx
 
visH (fin).pptx
visH (fin).pptxvisH (fin).pptx
visH (fin).pptx
 
A Short Introduction To Text-To-Speech Synthesis
A Short Introduction To Text-To-Speech SynthesisA Short Introduction To Text-To-Speech Synthesis
A Short Introduction To Text-To-Speech Synthesis
 
SMATalk: Standard Malay Text to Speech Talk System
SMATalk: Standard Malay Text to Speech Talk SystemSMATalk: Standard Malay Text to Speech Talk System
SMATalk: Standard Malay Text to Speech Talk System
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-Speech
 
Tutorial - Speech Synthesis System
Tutorial - Speech Synthesis SystemTutorial - Speech Synthesis System
Tutorial - Speech Synthesis System
 
551 466-472
551 466-472551 466-472
551 466-472
 
H010625862
H010625862H010625862
H010625862
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law companding
 
F017163443
F017163443F017163443
F017163443
 
ACHIEVING SECURITY VIA SPEECH RECOGNITION
ACHIEVING SECURITY VIA SPEECH RECOGNITIONACHIEVING SECURITY VIA SPEECH RECOGNITION
ACHIEVING SECURITY VIA SPEECH RECOGNITION
 
Voice based web browser
Voice based web browserVoice based web browser
Voice based web browser
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
 
Survey On Speech Synthesis
Survey On Speech SynthesisSurvey On Speech Synthesis
Survey On Speech Synthesis
 
Ey4301913917
Ey4301913917Ey4301913917
Ey4301913917
 
G1803013542
G1803013542G1803013542
G1803013542
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
Direct Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete UnitsDirect Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete Units
 
Assign
AssignAssign
Assign
 
Deciphering voice of customer through speech analytics
Deciphering voice of customer through speech analyticsDeciphering voice of customer through speech analytics
Deciphering voice of customer through speech analytics
 

Recently uploaded

Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
Hitesh Mohapatra
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
IJECEIAES
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
sachin chaurasia
 
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
University of Maribor
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
abbyasa1014
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
CSM Cloud Service Management Presentarion
CSM Cloud Service Management PresentarionCSM Cloud Service Management Presentarion
CSM Cloud Service Management Presentarion
rpskprasana
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
171ticu
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
JamalHussainArman
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
co23btech11018
 
New techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdfNew techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdf
wisnuprabawa3
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
NazakatAliKhoso2
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
jpsjournal1
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Sinan KOZAK
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdfBPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
MIGUELANGEL966976
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
bijceesjournal
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
Aditya Rajan Patra
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
nooriasukmaningtyas
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
IJECEIAES
 

Recently uploaded (20)

Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
 
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
CSM Cloud Service Management Presentarion
CSM Cloud Service Management PresentarionCSM Cloud Service Management Presentarion
CSM Cloud Service Management Presentarion
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
 
New techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdfNew techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdf
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdfBPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
 

SAP (SPEECH AND AUDIO PROCESSING)

  • 2. Contents :  Introduction  Objective  Theoretical background  Flowchart  Code and Execution  Result with Discussion  Applications  Advantages  Limitations and Future scope  References 2
  • 3. Introduction :  The text-to-speech (TTS) synthesis procedure consists of two main phases.  The first is text analysis, where the input text is transcribed into a phonetic or some other linguistic representation.  And the second one is the generation of speech waveforms, where the output is produced from this phonetic and prosodic information. 3
  • 4. Introduction :  These two phases are usually called high and low-level synthesis . A simplified version of this procedure is presented in figure below. 4
  • 5. 5 Objectives :  Text to speech synthesizer will be of great help to people with visual impairment .  Text to speech synthesizer will help a machine to communicate with users.
  • 6. Theoretical Background : 6  Speech Synthesis is the artificial production of human speech.  A synthesizer can incorporate a model of the vocal tract and other human voice characteristics to create a completely "synthetic" voice output.  A computer system used for this purpose is called a speech computer or speech synthesizer.  A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech.
  • 7. TTS overview : 7 The procedure consist of two main phases:-  Text Analysis  Speech waveforms  TEXT ANALYSIS : The input text is transcribed into a phonetic or some other linguistic representation  SPEECH WAVEFORMS : The acoustic output is produced from the phonetic and prosodic information
  • 8. Front End and Back End in TTS 8  A text-to-speech system (or "engine") is composed of two parts: a front-end and a back-end.  The front-end converts raw text containing symbols like numbers and abbreviations into the equivalent of written out words (tokenization), then assigns phonetic transcriptions to each word, and divides and marks the text into prosodic units, like phrases, clauses, and sentences (grapheme-phoneme conversion).  The back-end often referred to as the synthesizer— then converts the symbolic linguistic representation into sound.
  • 9. 9 FrFront End and Back End in TTS :
  • 11. 11 Speech Synthesizer used : Concatenative synthesis is based on the concatenation (or stringing together) of segments of recorded speech. Generally, concatenative synthesis produces the most natural-sounding synthesized speech.  Concatenate segments of pre-recorded natural human speech.  Requires database of previously recorded human speech covering all the possible segments to be synthesized .  Segment might be phoneme, syllable, word, phrase, or any combination .
  • 12. Detailed Architecture of TTS systems : 12
  • 13. .NET Framework :  .NET is a framework developed by Microsoft.  It is a new programming methodology.  .NET is platform independent/cross platform ‘  .NET is language insensitive.  It includes a large class library known as Framework Class Library (FCL). 13
  • 14. Continued ……. 14  Microsoft also produces an IDE largely for .NET software called Visual Studio.  It provides language interoperability (each language can use code written in other language ) across several programming languages.
  • 19. 19
  • 21. 21
  • 22. Result : 22  In this way , our aim to convert text which we passed as argument in function is converted into artificial human voice (speech) .  With the help of this TTS synthesizer , a blind guy can even read a book or novel which is not available in braille language .  This TTS synthesizer can be used in medical store for proper pronunciation of medicines on cover or boxes.  It is mostly used in voice stick device and voice assistant like Siri, google assistant , Cortana and Alexa etc.
  • 23. Applications :  Talking Calculator  Computer generated instructions  Aids for the blind  Telephone inquiry services  Teaching machices  Usage in education and daily life . 23
  • 24. Advantages :  Able to read large paragraphs .  It offers a range of different accents and voices .  Provide significant help for people with eyes disabilities.  More accuracy in medical systems.  It can be adapted easily to say whatever users want them to say.  It provides talking machines for vocally impaired or deaf people and better aids for speech therapy. 24
  • 25. Limitations :  No explicit emotions  Homographs (Pronunciation)  Prosody  Language specific problems  Special characters and symbols 25
  • 26. Future Scope :  It can also work in different languages like Marathi , Hindi , Kannada , etc.  Accuracy will become better and can able to pronounce symbols and special characters.  Increasing variety of voices . 26
  • 27. References :  www.google.com  www.youtube.com  www.shareslide.net  www.mathworks.com  www.microsoft.com