SlideShare a Scribd company logo
1 of 21
CITY ENGINEERING COLLEGE
DEPARTMENT:- ELECTRONICS AND COMMUNICATION
INTERNSHIP TITLE
DATA SCIENCE
PROJECT TITLE
TEXT TO SPEECH CONVERSION USING PYTHON LANGUAGE
BY,
VISHWAS D V (1CE19EC010)
OUTLINE
 COMPANY PROFILE
 ABSTRACT
 INTRODUCTION
 COMPONENTS USED
 BLOCK DIAGRAM
 FLOW CHART
 PROJECT
 CONCLUSION
 PROS & CONS
 REFERENCE
COMPANY PROFILE
 iPEC Solutions is a service-oriented organization in Information
Technology (IT) domain, founded with the aim to create significant
space into IT Training & Consulting, Managed IT Services & Tech
Transformation.
 Our primary objective is to provide Artificial Intelligence (AI) enabled
solutions and services to organizations and individuals.
 The training division of iPEC Solutions is located in south Bangalore.
This division has successfully trained 1000+ students last year in
emerging technologies such as Data Science, Business Analytics,
Machine Learning, Conversational Ai, Python Programming, and
Artificial Intelligence etc.
 The trainers are highly qualified professionals with over ten years of
experience in IT sector.
 iPEC Solutions started with an aim to bridge the gap between
demand and supply for technically skilled workforce, by
providing new age courses related to Industry 4.0 technology.
 These programmes help in the development of new talents, and
inventive thinking for both existing and future jobs.
 Our name "iPEC" is an inspiring synergy of four words:
Innovative, Professional, Engineering, Consultant.
 By uniting these words, we emphasize in order to provide a
tailored solution to our customers, we apply innovation and
scientific principles.
ABSTRACT
 Speech is one of the oldest and most natural means of
information exchange between human.
 Over the years, Attempts have been made to develop vocally
interactive computers to realise voice/speech synthesis.
Obviously such an interface would yield great benefits.
 Text-To-Speech Synthesis is a Technology that provides a
means of converting written text from a descriptive form to a
spoken language that is easily understandable by the end user
(Basically in English Language).
 It runs on JAVA platform, and the methodology used was
Object Oriented Analysis and Development Methodology.
INTRODUCTION
 Voice/speech synthesis is a field of computer science that deals
with designing computer systems that synthesize written text.
 It is a technology that allows a computer to convert a written text
into speech via a microphone or telephone. As an emerging
technology, not all developers are familiar with speech technology.
 While the basic functions of both speech synthesis and speech
recognition takes only minutes to understand, there are subtle and
powerful capabilities provided by computerized speech that
developers will want to understand and utilize.
 The basic idea of text-to-speech (TTS) technology is to convert
written input to spoken output by generating synthetic speech.
 Text to Speech Synthesis Defined:
SOFTWARE USED
 PRAAT: Praat is a free computer software package for speech
analysis in phonetics. It was designed, and continues to be
developed, by Paul Boersma and David Weenink of the University of
Amsterdam. It can run on a wide range of operating systems,
including various versions of Unix, Linux, Mac and Microsoft
Windows (2000, XP, Vista, 7, 8, 10).
 Praat offers the possibility to record mono and stereo sounds and to
edit and analyse sounds regarding intensity, pitch height, duration or
formants.
 Furthermore, so called TextGrid Objects can be created
In order to transcribe and annotate sound objects.
AUDACITY:
 Audacity is a free, easy-to-use, multi-track audio editor and recorder for
Windows, macOS, GNU/Linux and other operating systems. The interface
is translated into many languages.
 You can use audacity to:
 Record live audio.
 Record computer playback on any Windows Vista or later machine.
 Convert tapes and records into digital recordings or CDs.
 Edit WAV, AIFF, FLAC, MP2, MP3, Ogg Vorbis sound files.
 AC3, M4A/M4R (AAC), WMA, Opus and other formats supported using
optional libraries.
 Numerous effects including change the speed, pitch or tempo of a
recording.
WAVESURFER:
 WaveSurfer is an audio editor widely used for studies of acoustic
phonetics.
 It is a simple but fairly powerful program for interactive display of sound
pressure waveforms, spectral sections, spectrograms, pitch tracks and
transcriptions.
 It can read and write a number of transcription file formats used in
industrial speech research including TIMIT.
 WaveSurfer is free software, distributed under a permissive free
software licence.
BLOCK DIAGRAM
FLOW CHART
PROJECT:
 A speech synthesis system is by definition a system, which produces synthetic
speech. It is implicitly clear, that this involves some sort of input. What is not
clear is the type of this input.
 If the input is plain text, which does not contain additional phonetic and/or
phonological information the system may be called a text-to-speech (TTS)
system.
 Continuous speech is a set of complicated audio signals which makes producing
them artificially difficult.
 Speech signals are usually considered as voiced or unvoiced, but in some
cases they are something between these two. Voiced sounds consist of
fundamental frequency (F0) and its harmonic components produced by vocal
cords (vocal folds).
 The airflow is forced through a vocal tract constriction which can occur in several
places between glottis and mouth.
 The application of synthetic speech is expanding fast whilst the quality
of TTS systems is also increasing steadily.
 Speech synthesis systems are also becoming more affordable for
common customers, making these systems suitable for everyday use.
 For example, better availability of TTS systems may increase
employability for people with communication difficulties.
 Applications for the Blind.
 . Applications for the Deafened and Vocally Handicapped
 Applications for Telecommunications and Multimedia.
Representation and Analysis of Speech Signals
 Continuous speech is a set of complicated audio signals which makes producing
them artificially difficult.
 Speech signals are usually considered as voiced or unvoiced, but in some cases
they are something between these two.
 Voiced sounds consist of fundamental frequency (F0) and its harmonic
components produced by vocal cords (vocal folds).
 The airflow is forced through a vocal tract constriction which can occur in several
places between glottis and mouth.
 Some sounds are produced with complete stoppage of airflow followed by a
sudden release, producing an impulsive turbulent excitation often followed by a
more protracted turbulent excitation (Allen et al., 1987).
Applications of Speech Synthesis
 The application of synthetic speech is expanding fast whilst the quality
of TTS systems is also increasing steadily.
 Speech synthesis systems are also becoming more affordable for
common customers, making these systems suitable for everyday use.
 For example, better availability of TTS systems may increase
employability for people with communication difficulties.
 Other Applications and Future Directions (e.g. Human-Machine
Interaction)
 Applications for Telecommunications and Multimedia.
Speech Synthesis Module
 The TTS system converts an arbitrary ASCII text to speech.
 The first step involves extracting the phonetic components of the message,
and we obtain a string of symbols representing sound-units (phonemes or
allophones), boundaries between words, phrases and sentences along with a
set of prosody markers (indicating the speed, the intonation etc.).
 The second step consists of finding the match between the sequence of
symbols and appropriate items stored in the phonetic inventory and binding
them together to form the acoustic signal for the voice output device.
Choice of Speech Engine and Programming Language
 The speech engine used in this new system was the FreeTTS speech
engine.
 FreeTTS was used because it is programmed using JAVA (the backbone
programming language of this designed TTS system).
 It also supports SAPI (Speech Application Programming Interface) which
is in synchronism with the JSAPI (Java Speech Application Programming
Interface). JSAPI was also the standardized interface used in the new
system.
 FreeTTS includes an engine for the vocal synthesis that supports a certain
number of voices (male and female) at different frequencies.
 It is recommended to use JSAPI to interface with FreeTTS because JSAPI
interface provides the best methods of controlling and using FreeTTS.
CONCLUSION:
 Synthesizing text is a high technology advancement and artificial formation of
speech given a text to be spoken.
 With Text-to-Speech synthesis, we can actually mediate and fill in the lacuna
provided by not fully exploiting the capabilities of some handicapped
individuals.
 It's never been so easy to use a text-to-speech program, as just one click and
your computer will speak any text aloud in a clear, natural sounding voice.
 This training can come in handy with proper tutor on how to handle JSML
language and how to use it to annotate text for the proper output and
emphasis.
REFERENCES:
 Bailey MS, Hemmeter J. Characteristics of noninstitutionalized DI and SSI program
participants, 2010 update. 2014. [July 10, 2015].
 Bainbridge K. Speech problems among US children (age 3-17 years). Washington, DC:
2015. [May 18]. (Workshop presentation to the Committee on the Evaluation of the
Supplemental Security Income (SSI) Disability Program for Children with Speech
Disorders and Language Disorders).
 Campbell TF, Dollaghan CA, Rockette HE, Paradise JL, Feldman HM, Shriberg LD,
Sabo DL, Kurs-Lasky M. Risk factors for speech delay of unknown origin in 3-year-old
children. Child Development. 2003;74(2):346–357.
 Clegg J, Hollis C, Mawhood L, Rutter M. Developmental language disorders—A follow-
up in later adult life. Cognitive, language and psychosocial outcomes. Journal of Child
Psychology and Psychiatry and Allied Disciplines. 2005;46(2):128–149.
 Eadie P, Morgan A, Okoumunne OC, Eecen KT, Wake M, Reilly S. Speech sound
disorder at 4 years: Prevalence, comorbidities, and predictors in a community cohort of
children. Developmental Medicine & Child Neurology. 2015;57(6):578–584.
THANK YOU

More Related Content

What's hot

Voice morphing-101113123852-phpapp01 (1)
Voice morphing-101113123852-phpapp01 (1)Voice morphing-101113123852-phpapp01 (1)
Voice morphing-101113123852-phpapp01 (1)Ravi Teja
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologyAamir-sheriff
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminarDiptimaya Sarangi
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceIlhaan Marwat
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentationsamyakbhuta
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice RecognitionAmrita More
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionRHIMRJ Journal
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technologyKishan Vemula
 
Presentation on audio editing softwares by akanksha
Presentation on audio editing softwares by akankshaPresentation on audio editing softwares by akanksha
Presentation on audio editing softwares by akankshaakanksha singh
 
TEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxTEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxNsaroj kumar
 
Text to speech converter in C#.NET
Text to speech converter in C#.NETText to speech converter in C#.NET
Text to speech converter in C#.NETMandeep Cheema
 
digital scent technology
digital scent technologydigital scent technology
digital scent technologySRUTHY K S
 
Voice Browser
Voice BrowserVoice Browser
Voice BrowserSaila Sri
 
Artificial intelligence for speech recognition
Artificial intelligence for speech recognitionArtificial intelligence for speech recognition
Artificial intelligence for speech recognitionsowmith chatlapally
 

What's hot (20)

Voice morphing-101113123852-phpapp01 (1)
Voice morphing-101113123852-phpapp01 (1)Voice morphing-101113123852-phpapp01 (1)
Voice morphing-101113123852-phpapp01 (1)
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentation
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech Recognition
 
Voicemorphing
VoicemorphingVoicemorphing
Voicemorphing
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technology
 
Presentation on audio editing softwares by akanksha
Presentation on audio editing softwares by akankshaPresentation on audio editing softwares by akanksha
Presentation on audio editing softwares by akanksha
 
VOICE BROWSER
VOICE BROWSERVOICE BROWSER
VOICE BROWSER
 
TEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxTEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptx
 
Text to speech converter in C#.NET
Text to speech converter in C#.NETText to speech converter in C#.NET
Text to speech converter in C#.NET
 
digital scent technology
digital scent technologydigital scent technology
digital scent technology
 
Voice Browser
Voice BrowserVoice Browser
Voice Browser
 
Artificial intelligence for speech recognition
Artificial intelligence for speech recognitionArtificial intelligence for speech recognition
Artificial intelligence for speech recognition
 
TACTILE INTERNET
TACTILE INTERNETTACTILE INTERNET
TACTILE INTERNET
 
Audio and Video Streaming
Audio and Video StreamingAudio and Video Streaming
Audio and Video Streaming
 
Video Streaming
Video StreamingVideo Streaming
Video Streaming
 

Similar to visH (fin).pptx

Unlocking the Power of AI Text-to-Speech
Unlocking the Power of AI Text-to-SpeechUnlocking the Power of AI Text-to-Speech
Unlocking the Power of AI Text-to-SpeechNola58
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech RecognitionThejus Joby
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingiosrjce
 
IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing SystemIRJET Journal
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET Journal
 
Efficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingEfficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingIOSR Journals
 
SAP (SPEECH AND AUDIO PROCESSING)
SAP (SPEECH AND AUDIO PROCESSING)SAP (SPEECH AND AUDIO PROCESSING)
SAP (SPEECH AND AUDIO PROCESSING)dineshkatta4
 
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...IJERA Editor
 
A Short Introduction To Text-To-Speech Synthesis
A Short Introduction To Text-To-Speech SynthesisA Short Introduction To Text-To-Speech Synthesis
A Short Introduction To Text-To-Speech SynthesisCynthia King
 
Leveraging machine learning in text to-speech tools and applications.
Leveraging machine learning in text to-speech tools and applications.Leveraging machine learning in text to-speech tools and applications.
Leveraging machine learning in text to-speech tools and applications.Countants
 
VOICE BASED E-MAIL
VOICE BASED E-MAILVOICE BASED E-MAIL
VOICE BASED E-MAILStudentRocks
 
Speech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSpeech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSubmissionResearchpa
 
Automatic subtitle generation
Automatic subtitle generationAutomatic subtitle generation
Automatic subtitle generationtanyasaxena1611
 
Tutorial - Speech Synthesis System
Tutorial - Speech Synthesis SystemTutorial - Speech Synthesis System
Tutorial - Speech Synthesis SystemIJERA Editor
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 

Similar to visH (fin).pptx (20)

Unlocking the Power of AI Text-to-Speech
Unlocking the Power of AI Text-to-SpeechUnlocking the Power of AI Text-to-Speech
Unlocking the Power of AI Text-to-Speech
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
H010625862
H010625862H010625862
H010625862
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law companding
 
IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing System
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
 
Efficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingEfficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And Recording
 
SAP (SPEECH AND AUDIO PROCESSING)
SAP (SPEECH AND AUDIO PROCESSING)SAP (SPEECH AND AUDIO PROCESSING)
SAP (SPEECH AND AUDIO PROCESSING)
 
D1803041822
D1803041822D1803041822
D1803041822
 
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
 
A Short Introduction To Text-To-Speech Synthesis
A Short Introduction To Text-To-Speech SynthesisA Short Introduction To Text-To-Speech Synthesis
A Short Introduction To Text-To-Speech Synthesis
 
An Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile EnvironmentAn Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile Environment
 
voice browser
voice browservoice browser
voice browser
 
Leveraging machine learning in text to-speech tools and applications.
Leveraging machine learning in text to-speech tools and applications.Leveraging machine learning in text to-speech tools and applications.
Leveraging machine learning in text to-speech tools and applications.
 
VOICE BASED E-MAIL
VOICE BASED E-MAILVOICE BASED E-MAIL
VOICE BASED E-MAIL
 
Speech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSpeech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speech
 
Automatic subtitle generation
Automatic subtitle generationAutomatic subtitle generation
Automatic subtitle generation
 
Tutorial - Speech Synthesis System
Tutorial - Speech Synthesis SystemTutorial - Speech Synthesis System
Tutorial - Speech Synthesis System
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 

Recently uploaded

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .Satyam Kumar
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHC Sai Kiran
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
DATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage exampleDATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage examplePragyanshuParadkar1
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 

Recently uploaded (20)

Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdf
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECH
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
DATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage exampleDATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage example
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 

visH (fin).pptx

  • 1. CITY ENGINEERING COLLEGE DEPARTMENT:- ELECTRONICS AND COMMUNICATION INTERNSHIP TITLE DATA SCIENCE PROJECT TITLE TEXT TO SPEECH CONVERSION USING PYTHON LANGUAGE BY, VISHWAS D V (1CE19EC010)
  • 2. OUTLINE  COMPANY PROFILE  ABSTRACT  INTRODUCTION  COMPONENTS USED  BLOCK DIAGRAM  FLOW CHART  PROJECT  CONCLUSION  PROS & CONS  REFERENCE
  • 3. COMPANY PROFILE  iPEC Solutions is a service-oriented organization in Information Technology (IT) domain, founded with the aim to create significant space into IT Training & Consulting, Managed IT Services & Tech Transformation.  Our primary objective is to provide Artificial Intelligence (AI) enabled solutions and services to organizations and individuals.  The training division of iPEC Solutions is located in south Bangalore. This division has successfully trained 1000+ students last year in emerging technologies such as Data Science, Business Analytics, Machine Learning, Conversational Ai, Python Programming, and Artificial Intelligence etc.  The trainers are highly qualified professionals with over ten years of experience in IT sector.
  • 4.  iPEC Solutions started with an aim to bridge the gap between demand and supply for technically skilled workforce, by providing new age courses related to Industry 4.0 technology.  These programmes help in the development of new talents, and inventive thinking for both existing and future jobs.  Our name "iPEC" is an inspiring synergy of four words: Innovative, Professional, Engineering, Consultant.  By uniting these words, we emphasize in order to provide a tailored solution to our customers, we apply innovation and scientific principles.
  • 5. ABSTRACT  Speech is one of the oldest and most natural means of information exchange between human.  Over the years, Attempts have been made to develop vocally interactive computers to realise voice/speech synthesis. Obviously such an interface would yield great benefits.  Text-To-Speech Synthesis is a Technology that provides a means of converting written text from a descriptive form to a spoken language that is easily understandable by the end user (Basically in English Language).  It runs on JAVA platform, and the methodology used was Object Oriented Analysis and Development Methodology.
  • 6. INTRODUCTION  Voice/speech synthesis is a field of computer science that deals with designing computer systems that synthesize written text.  It is a technology that allows a computer to convert a written text into speech via a microphone or telephone. As an emerging technology, not all developers are familiar with speech technology.  While the basic functions of both speech synthesis and speech recognition takes only minutes to understand, there are subtle and powerful capabilities provided by computerized speech that developers will want to understand and utilize.  The basic idea of text-to-speech (TTS) technology is to convert written input to spoken output by generating synthetic speech.
  • 7.  Text to Speech Synthesis Defined:
  • 8. SOFTWARE USED  PRAAT: Praat is a free computer software package for speech analysis in phonetics. It was designed, and continues to be developed, by Paul Boersma and David Weenink of the University of Amsterdam. It can run on a wide range of operating systems, including various versions of Unix, Linux, Mac and Microsoft Windows (2000, XP, Vista, 7, 8, 10).  Praat offers the possibility to record mono and stereo sounds and to edit and analyse sounds regarding intensity, pitch height, duration or formants.  Furthermore, so called TextGrid Objects can be created In order to transcribe and annotate sound objects.
  • 9. AUDACITY:  Audacity is a free, easy-to-use, multi-track audio editor and recorder for Windows, macOS, GNU/Linux and other operating systems. The interface is translated into many languages.  You can use audacity to:  Record live audio.  Record computer playback on any Windows Vista or later machine.  Convert tapes and records into digital recordings or CDs.  Edit WAV, AIFF, FLAC, MP2, MP3, Ogg Vorbis sound files.  AC3, M4A/M4R (AAC), WMA, Opus and other formats supported using optional libraries.  Numerous effects including change the speed, pitch or tempo of a recording.
  • 10. WAVESURFER:  WaveSurfer is an audio editor widely used for studies of acoustic phonetics.  It is a simple but fairly powerful program for interactive display of sound pressure waveforms, spectral sections, spectrograms, pitch tracks and transcriptions.  It can read and write a number of transcription file formats used in industrial speech research including TIMIT.  WaveSurfer is free software, distributed under a permissive free software licence.
  • 13. PROJECT:  A speech synthesis system is by definition a system, which produces synthetic speech. It is implicitly clear, that this involves some sort of input. What is not clear is the type of this input.  If the input is plain text, which does not contain additional phonetic and/or phonological information the system may be called a text-to-speech (TTS) system.  Continuous speech is a set of complicated audio signals which makes producing them artificially difficult.  Speech signals are usually considered as voiced or unvoiced, but in some cases they are something between these two. Voiced sounds consist of fundamental frequency (F0) and its harmonic components produced by vocal cords (vocal folds).  The airflow is forced through a vocal tract constriction which can occur in several places between glottis and mouth.
  • 14.  The application of synthetic speech is expanding fast whilst the quality of TTS systems is also increasing steadily.  Speech synthesis systems are also becoming more affordable for common customers, making these systems suitable for everyday use.  For example, better availability of TTS systems may increase employability for people with communication difficulties.  Applications for the Blind.  . Applications for the Deafened and Vocally Handicapped  Applications for Telecommunications and Multimedia.
  • 15. Representation and Analysis of Speech Signals  Continuous speech is a set of complicated audio signals which makes producing them artificially difficult.  Speech signals are usually considered as voiced or unvoiced, but in some cases they are something between these two.  Voiced sounds consist of fundamental frequency (F0) and its harmonic components produced by vocal cords (vocal folds).  The airflow is forced through a vocal tract constriction which can occur in several places between glottis and mouth.  Some sounds are produced with complete stoppage of airflow followed by a sudden release, producing an impulsive turbulent excitation often followed by a more protracted turbulent excitation (Allen et al., 1987).
  • 16. Applications of Speech Synthesis  The application of synthetic speech is expanding fast whilst the quality of TTS systems is also increasing steadily.  Speech synthesis systems are also becoming more affordable for common customers, making these systems suitable for everyday use.  For example, better availability of TTS systems may increase employability for people with communication difficulties.  Other Applications and Future Directions (e.g. Human-Machine Interaction)  Applications for Telecommunications and Multimedia.
  • 17. Speech Synthesis Module  The TTS system converts an arbitrary ASCII text to speech.  The first step involves extracting the phonetic components of the message, and we obtain a string of symbols representing sound-units (phonemes or allophones), boundaries between words, phrases and sentences along with a set of prosody markers (indicating the speed, the intonation etc.).  The second step consists of finding the match between the sequence of symbols and appropriate items stored in the phonetic inventory and binding them together to form the acoustic signal for the voice output device.
  • 18. Choice of Speech Engine and Programming Language  The speech engine used in this new system was the FreeTTS speech engine.  FreeTTS was used because it is programmed using JAVA (the backbone programming language of this designed TTS system).  It also supports SAPI (Speech Application Programming Interface) which is in synchronism with the JSAPI (Java Speech Application Programming Interface). JSAPI was also the standardized interface used in the new system.  FreeTTS includes an engine for the vocal synthesis that supports a certain number of voices (male and female) at different frequencies.  It is recommended to use JSAPI to interface with FreeTTS because JSAPI interface provides the best methods of controlling and using FreeTTS.
  • 19. CONCLUSION:  Synthesizing text is a high technology advancement and artificial formation of speech given a text to be spoken.  With Text-to-Speech synthesis, we can actually mediate and fill in the lacuna provided by not fully exploiting the capabilities of some handicapped individuals.  It's never been so easy to use a text-to-speech program, as just one click and your computer will speak any text aloud in a clear, natural sounding voice.  This training can come in handy with proper tutor on how to handle JSML language and how to use it to annotate text for the proper output and emphasis.
  • 20. REFERENCES:  Bailey MS, Hemmeter J. Characteristics of noninstitutionalized DI and SSI program participants, 2010 update. 2014. [July 10, 2015].  Bainbridge K. Speech problems among US children (age 3-17 years). Washington, DC: 2015. [May 18]. (Workshop presentation to the Committee on the Evaluation of the Supplemental Security Income (SSI) Disability Program for Children with Speech Disorders and Language Disorders).  Campbell TF, Dollaghan CA, Rockette HE, Paradise JL, Feldman HM, Shriberg LD, Sabo DL, Kurs-Lasky M. Risk factors for speech delay of unknown origin in 3-year-old children. Child Development. 2003;74(2):346–357.  Clegg J, Hollis C, Mawhood L, Rutter M. Developmental language disorders—A follow- up in later adult life. Cognitive, language and psychosocial outcomes. Journal of Child Psychology and Psychiatry and Allied Disciplines. 2005;46(2):128–149.  Eadie P, Morgan A, Okoumunne OC, Eecen KT, Wake M, Reilly S. Speech sound disorder at 4 years: Prevalence, comorbidities, and predictors in a community cohort of children. Developmental Medicine & Child Neurology. 2015;57(6):578–584.