SlideShare a Scribd company logo
1 of 5
Download to read offline
M. Siva Kumar et al Int. Journal of Engineering Research and Applications www.ijera.com
ISSN: 2248-9622, Vol. 4, Issue 3(Version 1), March 2014, pp.913-917
www.ijera.com 913 | P a g e
Text To Speech System for Telugu Language
M. Siva Kumar1
, E. Prakash Babu2
, Dr. M. V. Subba Reddy3
, M. S. Praveen
Kumar4
1
(Student, Department of Computer Science and Engineering, Sri Venkatesa Perumal College of Engineering
and Technology, Puttur, Andhra Pradesh, India)
2
(Assoc. Prof, Department of Computer Science and Engineering, Sri Venkatesa Perumal College of
Engineering and Technology, Puttur, Andhra Pradesh, India)
3
(Professor, Department of Computer Science and Engineering, Sri Venkatesa Perumal College of Engineering
and Technology, Puttur, Andhra Pradesh, India)
4
(Student, Department of Computer Science and Engineering, Sri Venkatesa Perumal College of Engineering
and Technology, Puttur, Andhra Pradesh, India)
Abstract
Telugu is one of the oldest languages in India. This paper describes the development of Telugu Text-to-Speech
System (TTS).In Telugu TTS the input is Telugu text in Unicode. The voices are sampled from real recorded
speech. The objective of a text to speech system is to convert an arbitrary text into its corresponding spoken
waveform. Speech synthesis is a process of building machinery that can generate human-like speech from any
text input to imitate human speakers. Text processing and speech generation are two main components of a text
to speech system. To build a natural sounding speech synthesis system, it is essential that text processing
component produce an appropriate sequence of phonemic units. Generation of sequence of phonetic units for a
given standard word is referred to as letter to phoneme rule or text to phoneme rule. The complexity of
these rules and their derivation depends upon the nature of the language. The quality of a speech synthesizer is
judged by its closeness to the natural human voice and understandability. In this paper we described an approach
to build a Telugu TTS system using concatenative synthesis method with syllable as a basic unit of
concatenation.
Keywords - Text processing, speech generation, phoneme, grapheme, Speech synthesis.
I. INTRODUCTION
A. Speech Synthesis
The conversion of words in written form
into speech is non-trivial. Even if we can store a huge
dictionary for most common words; the TTS system
still needs to deal with millions of names and
acronyms. Moreover, in order to sound natural, the
intonation of the sentences must be appropriately
generated. Synthesis of speech cannot be
accomplished by cutting and pasting smaller units
together. Attention has to be paid to smoothing out
the discontinuities in such a process so that the
resulting signal approximates natural speech.
According to the speech generation model used,
speech synthesis can be classified into three
categories as Articulatory synthesis, Formant
synthesis and Concatenative synthesis. Based on the
degree of manual intervention in the design, speech
synthesis can be classified into two categories
Synthesis by rule and Data-driven synthesis.
Prosody and intonation are quite important
for natural sounding speech.. There are in existence
speech synthesis systems which replicate the
prosodic features of human speech. This involves
fairly complex parsing of the input sentences and
using rather complex rules to determine the
intonation patterns.
Generation of sequence of phonetic units for
a given standard word is referred to as letter to
phoneme rule or text to phoneme rule. Voiced sounds
were simulated with a computer model of the vocal
fold composed of a single mass vibrating both
parallel and perpendicular to the air flow.
The work is divided into 3 main modules.
1. Converting Telugu script to Unicode
2. Differentiating Grapheme
i. Combination of Consonant-vowel
ii. Combination of Consonant-consonant
3. Generating voice
i. Identify the Grapheme recognizer
ii. Identify the Telugu Audio source
1. Converting Telugu Script to Unicode
In English ASCII characters are used
where as In Telugu Unicode characters are used.
ASCII takes 8-bits for each character. Unicode takes
16-bits for each character. Unicode provides a unique
RESEARCH ARTICLE OPEN ACCESS
M. Siva Kumar et al Int. Journal of Engineering Research and Applications www.ijera.com
ISSN: 2248-9622, Vol. 4, Issue 3(Version 1), March 2014, pp.913-917
www.ijera.com 914 | P a g e
number for every character, no matter what the
platform, no matter what the program, no matter what
the language.
2. Differentiating Grapheme
In natural speech, durations of phonetic
segments are strongly dependent on contextual
factors. For synthetic speech to sound natural, the
module for computing segmental duration must
mimic these contextual effects as closely as possible.
Grapheme:
Graphemes are “functional spelling units”
encompassing one or more letters of the text input,
a grapheme in the text input corresponds to a
single phoneme.
Phoneme:
Phones characterize any sound that can be
produced by a human vocal tract, if a phone is part of
a specific language; it becomes a phoneme of the
language. Phonemes are the elementary sounds of a
language.
In this paper we are going to
differentiate A character in Indian language scripts is
close to syllable and can be typically of the
following form:
Different Combinations:
V-Vowel
C- Consonant
C+V-Consonant+Vowel
C+C-Consonant+Consonant
C+C+V-Consonant+Consonant+Vowel
C+C+C--Consonant+Consonant+Consonant
Example:
Fig.2. Differentiating grapheme
3.Generating Voice
The function of Text-To-Speech (TTS)
system is to convert the given text to a spoken
waveform. This conversion involves text processing
and speech generation processes. These processes
have connections to linguistic theory, models of
speech production, and acoustic-phonetic
characterization of language. Text processing
including end-of-sentence detection, text
normalization. Word pronunciation, including the
pronunciation of names and the disambiguation of
homographs.
In this approach, the pre-recorded speech
segments which are to be used in the synthesizer
are stored exactly as how it is recorded.
Additional information of the speech waveform is
attached to the sound to provide proper annotation of
the speech waveform.
To synthesize a particular language, required
units (diphones) from the database which doesn't
contain any language specific information and
these selected units were then typically altered
by signal processing functions to meet the
language specific target specification generated by
different modules in the synthesizer. To build a
voice/speech for a language text, the steps involved
are as follows
B. Existing system
Many of the improvements in speech
synthesis over the past years have come from creative
use of the technologies developed for speech
recognition. We can build systems that interact
through speech, which the system can listen to what
was said, compute or do something, and then speak
Fig.1. Converting Telugu script to Unicode
Fig.1. Converting Telugu script to Unicode
Telugu Text Unicode
Fig.3. Generating Voice
Telugu Language Text as Input
Text Processing
Grapheme - Phoneme
conversion
Prosodic Modeling
Acoustic Synthesis
Wave Data
Generation
M. Siva Kumar et al Int. Journal of Engineering Research and Applications www.ijera.com
ISSN: 2248-9622, Vol. 4, Issue 3(Version 1), March 2014, pp.913-917
www.ijera.com 915 | P a g e
back, using spoken language generation. Festivox is
speech synthesis system developed by speech group
at CMU; it provides a general framework for building
speech synthesis systems. It offers full text to speech
through a number API’s from shell level, though a
scheme command interpreter, as a C++ library, from
Java, and an Emacs interface. The Festvox project:
automating the processes involved in building
synthetic voices for new languages. Creating festival
TTS for other languages, especially Indian languages
is very similar and will mostly require language
specific changes to be made to the code in these
modules. Festival uses diphone as basic unit.
Speech synthesis is the artificial production
of human speech. A computer system used for this
purpose is called a speech synthesizer, and can be
implemented in software or hardware products. A
text-to-speech (TTS) system converts normal
language text into speech; other systems render
symbolic linguistic representations like phonetic
transcriptions into speech. The overview of the
problems that occur during text-to-speech (TTS)
conversion and describe the particular solutions to
these problems taken within the AT&T Bell
Laboratories TTS system.
II. TELUGU LANGUAGE
The scripts of Indian languages have
originated from the ancient Brahmi script. The basic
units of writing system are characters which are
orthographic representation of speech sounds. A
character in Indian language scripts is close to
syllable and can be typically of the following form:
C, V, CV, CCV and CVC, where C is a consonant and
V is a vowel. There are about 35 consonants and
about 18 vowels in Indian languages. An important
feature of Indian language scripts is their phonetic
nature. There is more or less one to one
correspondence between what is written and what is
spoken. The rules required to map the letters to
sounds of Indian languages are almost straight
forward. All Indian language scripts have common
phonetic base.
Syllabification Rules
There is almost one to one correspondence
between what is written and what is spoken in Indian
Languages. Each character in Indian language script
has a correspondence to a sound of that language. In
Indian languages, a consonant character is inherently
bound with the vowel sound /a/, and is almost always
pronounced with this vowel. This occurs at both word
final and word middle positions. A few heuristic rules
to detect IVS of a consonant character are noted
below. While letter to phone rules are almost
straightforward in Indian languages, the
syllabification rules are not trivial. There is need to
come up with some rules to break the word into
syllables. We have derived certain simplistic rules for
syllabification i.e. rules for grouping clusters of
C*VC* based on heuristic analysis of several words
in Telugu language.
III. SYSTEM ARCHITECTURE
The main purpose of the system is to
convert an arbitrary text into its corresponding
spoken waveform. Text processing and speech
generation are two main components of a text to
speech system. To build a natural sounding speech
synthesis system, it is essential that text processing
component produce an appropriate sequence of
phonemic units. Generation of sequence of phonetic
units for a given standard word is referred to as letter
to phoneme rule or text to phoneme rule. The
complexity of these rules and their derivation
depends upon the nature of the language. In Telugu
TTS the input is Telugu text in Unicode.
Text syllabification will segment the
normalized text to syllable unit according to Telugu
language rules. Phone set Definition module defines
the complete set of phones used in Telugu speech. It
also includes feature definitions of these phones.
Lexical Analysis module is used to arrive at the
phones that make up the pronunciation of a particular
word. Since Telugu is phonetic in nature, we do not
require a dictionary for lexical analysis. Instead, this
module defines letter-to-sound rules (LTS) which are
used to arrive at the speech phones based on the
spelling of the word
The input text is converted into readable text
and the syllabification module will generate the
sequence of syllables that should be extracted from
the speech corpus to be concatenated and play the
sound files.
M. Siva Kumar et al Int. Journal of Engineering Research and Applications www.ijera.com
ISSN: 2248-9622, Vol. 4, Issue 3(Version 1), March 2014, pp.913-917
www.ijera.com 916 | P a g e
IV. RESULTS
A Telugu TTS system using syllables as
basic unit of concatenation is presented. The quality
of the synthesized speech is reasonably natural. The
proposed approach minimizes the co articulation
effects and prosodic mismatch between two
adjacent units. Selection of a unit is based on the
presiding and succeeding context of the syllables
and the position of the syllable. The system
implements a matching function which assigns
weight to the syllable based on its nature and the
syllable with maximum weight is selected as output
speech units. We have observed the efficiency of this
approach for Telugu language and found that the
performance of this approach is better.
Fig.5. Playing the generated voice
The results showed that there is a strong
correlation between the values of the source
parameter in the vowel midpoint and the vowel
duration. The same parameters tend to decrease on
vowel onsets and to increase on vowels offsets. This
seems to indicate a prosodic nature of these
parameters requiring special treatment in
concatenative-based TTS systems that use source
modification techniques, such as pitch
synchronous overlap add and multipulse.
V. CONCLUSION
Speech synthesis techniques , it is much
easier to build a voice in a language with fewer
sentences and a smaller Speech]. The proposed
approach minimizes the co articulation effects and
prosodic mismatch between two adjacent units.
Selection of a unit is based on the presiding and
succeeding context of the syllables and the position
of the syllable. The system implements a matching
function which assigns weight to the syllable based
on its nature and the syllable with maximum
weight is selected as output speech units. We have
observed the efficiency of this approach for Telugu
language and found that the performance of this
approach is better.
Telugu TTS system using syllables as basic
unit of concatenation is presented. The quality of the
synthesized speech is reasonably natural. The
proposed approach minimizes the co articulation
effects and prosodic mismatch between two adjacent
units. Selection of a unit is based on the presiding and
succeeding context of the syllables and the position
of the syllable. The system implements a matching
function which assigns weight to the syllable based
on its nature and the syllable with maximum weight
is selected as output speech units. We have observed
the efficiency of this approach for Telugu language
and found that the performance of this approach is
better.
VI. FUTURE ENHANCEMENTS
The generated speech shows distortion at the
concatenation point of two syllables. If this distortion
is significant then it would loose the naturalness. This
distortion can be minimized by adding smoothing
module which would modify the speech parameters
like pitch formants and intensity at the concatenation
point. More attention needs to be paid to the
frequency and amplitude range as well as the
duration of each pre-recorded segments to produce
high quality speech.
A number of other ongoing projects are
aimed at developing a POS tag set, POStagger
and a tagged corpus for Sinhala. Further work will
focus on expanding the pronunciation lexicon. At
present, the G2P rules are incapable of providing
accurate pronunciation for most compound words.
Thus, we are planning to construct a lexicon
consisting of compound words along with common
high frequency words found in our Sinhala text
corpus, which are currently incorrectly phonetized.
REFERENCES
[1] Lakshmi A, Hema A Murthy. A Syllable
Based Continuous Speech Recognizer for
Tamil. In Proc. of the 2nd Int.
Workshop on East-Asian Language
Resources and Evaluation,2009.
[2] Sreekanth Majji, Ramakrisshnan A.G
“Festival Based maiden TTS system for
Tamil Language”, 3rd Language &
Technology Conference: Human Language
Technologies as a Challenge for Computer
Science and Linguistics October 5-7, 2007,
PoznaĔ, Poland.
[3] ZenH.,NoseT.,YamagishiJ.,SakoS.,Masuko
T.,Black A.W., andTokudaK.,“The hmm-
based speech synthesis system
version2.0,” in Proc.ofISCASSW6, Bonn,
Germany,2007.
M. Siva Kumar et al Int. Journal of Engineering Research and Applications www.ijera.com
ISSN: 2248-9622, Vol. 4, Issue 3(Version 1), March 2014, pp.913-917
www.ijera.com 917 | P a g e
[4] BlackA.W. ZenH. and TokudaK. “Statistical
parametric speech synthesis,” in proceeding
sofIEEEInt.Conf. Acoust. Speech, and
Signal Processing, Honolulu, USA, 2007.
[5] Rohit Kumar, S. P. Kishore, “Automatic
Pruning of Unit Selection Speech Databases
for Synthesis without loss of Naturalness”,
submitted to ICSLP, Jeju Island, Korea,2004
[6] S P Kishore, Rohit Kumar and Rajeev
Sangal, “A Data Driven Synthesis Approach
For Indian Languages using Syllables as
BasicUnit”, in Proceedings of Intl. Conf. on
NLP (ICON) 2002, pp. 311-316, Mumbai,
India, 200
[7] A.M. Zeki and N. Azizah, "A Speech
Synthesizer for Malay Language”, National
Conference on Research and Development
in Computer Science, Selangor, Malaysia,
October 2001
[8] X. Huang, A.Acero and H.-W. Hon,
“Spoken Language Processing A Guide to
Theory, Algorithm and System
Development”, New Jersey: Prentice Hall,
2001
[9] Min Chu, Hu Peng, Hong-yun Yang, Eric
Chang, “Selecting Non-Uniform Units from
a very large corpus for Concatenative
Speech Synthesizer”, in Proceedings of
ICASSP, Salt Lake City, 2001
[10] Sproat R. (1995): finite-state architecture
for tokenization and grapheme-to-phoneme
conversion for multilingual text analysis”. In
From text to tags: Issues in multilingual
language analysis. Proc. ACL SIGDAT
Workshop (Dublin, Ireland), 65-72
[11] Sproat R., Olive J. (1995): “Text to speech
syn-thesis”. AT&T T echnical Journal 74(2),
35-44
[12] Sproat R., Olive J. (1996): “A modular
architec-ture for multi-lingual text-to-
speech”. In J. van Santen, R. Sproat, J. Olive
and J. Hirschberg (eds.), Progress in speech
synthesis (Springer, New York).
[13] T alkin D., Rowley J. (1990): “Pitch-syn-
chronous analysis and synthesis for TTS
systems”.Proc. ESCA Workshop on Speech
Synthesis (Autrans, France), 55-58.
[14] A.M. Zeki and N. Azizah, "A Speech
Synthesizer for Malay Language”, National
Conference on Research and Development
in Computer Science, Selangor, Malaysia,
October 2001.

More Related Content

What's hot

Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...ravi sharma
 
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...kevig
 
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATION
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATIONA ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATION
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATIONkevig
 
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...kevig
 
Development of text to speech system for yoruba language
Development of text to speech system for yoruba languageDevelopment of text to speech system for yoruba language
Development of text to speech system for yoruba languageAlexander Decker
 
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...kevig
 
A Tool to Search and Convert Reduplicate Words from Hindi to Punjabi
A Tool to Search and Convert Reduplicate Words from Hindi to PunjabiA Tool to Search and Convert Reduplicate Words from Hindi to Punjabi
A Tool to Search and Convert Reduplicate Words from Hindi to PunjabiIJERA Editor
 
Implementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large DictionaryImplementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large Dictionaryiosrjce
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silencepaperpublications3
 
Phonetic Recognition In Words For Persian Text To Speech Systems
Phonetic Recognition In Words For Persian Text To Speech SystemsPhonetic Recognition In Words For Persian Text To Speech Systems
Phonetic Recognition In Words For Persian Text To Speech Systemspaperpublications3
 
Implementation of Text To Speech for Marathi Language Using Transcriptions Co...
Implementation of Text To Speech for Marathi Language Using Transcriptions Co...Implementation of Text To Speech for Marathi Language Using Transcriptions Co...
Implementation of Text To Speech for Marathi Language Using Transcriptions Co...IJERA Editor
 
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEijnlc
 
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...kevig
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
 
An implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerAn implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerijnlc
 
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEkevig
 

What's hot (17)

Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
 
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
 
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATION
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATIONA ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATION
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATION
 
Ijetcas14 458
Ijetcas14 458Ijetcas14 458
Ijetcas14 458
 
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...
 
Development of text to speech system for yoruba language
Development of text to speech system for yoruba languageDevelopment of text to speech system for yoruba language
Development of text to speech system for yoruba language
 
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...
 
A Tool to Search and Convert Reduplicate Words from Hindi to Punjabi
A Tool to Search and Convert Reduplicate Words from Hindi to PunjabiA Tool to Search and Convert Reduplicate Words from Hindi to Punjabi
A Tool to Search and Convert Reduplicate Words from Hindi to Punjabi
 
Implementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large DictionaryImplementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large Dictionary
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
 
Phonetic Recognition In Words For Persian Text To Speech Systems
Phonetic Recognition In Words For Persian Text To Speech SystemsPhonetic Recognition In Words For Persian Text To Speech Systems
Phonetic Recognition In Words For Persian Text To Speech Systems
 
Implementation of Text To Speech for Marathi Language Using Transcriptions Co...
Implementation of Text To Speech for Marathi Language Using Transcriptions Co...Implementation of Text To Speech for Marathi Language Using Transcriptions Co...
Implementation of Text To Speech for Marathi Language Using Transcriptions Co...
 
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
 
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
An implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerAn implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzer
 
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
 

Viewers also liked (20)

D43042025
D43042025D43042025
D43042025
 
F43053237
F43053237F43053237
F43053237
 
Dg4301625635
Dg4301625635Dg4301625635
Dg4301625635
 
Cc4301455457
Cc4301455457Cc4301455457
Cc4301455457
 
F43022431
F43022431F43022431
F43022431
 
Cf4301466469
Cf4301466469Cf4301466469
Cf4301466469
 
Cm4301507513
Cm4301507513Cm4301507513
Cm4301507513
 
D43021519
D43021519D43021519
D43021519
 
Eo4301852855
Eo4301852855Eo4301852855
Eo4301852855
 
By4301435440
By4301435440By4301435440
By4301435440
 
D43062127
D43062127D43062127
D43062127
 
I43055257
I43055257I43055257
I43055257
 
Ew4301904907
Ew4301904907Ew4301904907
Ew4301904907
 
Fa4301925930
Fa4301925930Fa4301925930
Fa4301925930
 
El4301832837
El4301832837El4301832837
El4301832837
 
Cj4301492497
Cj4301492497Cj4301492497
Cj4301492497
 
J43025258
J43025258J43025258
J43025258
 
C43021014
C43021014C43021014
C43021014
 
Lk3619561964
Lk3619561964Lk3619561964
Lk3619561964
 
On Performance
On PerformanceOn Performance
On Performance
 

Similar to Ey4301913917

An expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicAn expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicijnlc
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET Journal
 
HINDI AND MARATHI TO ENGLISH MACHINE TRANSLITERATION USING SVM
HINDI AND MARATHI TO ENGLISH MACHINE TRANSLITERATION USING SVMHINDI AND MARATHI TO ENGLISH MACHINE TRANSLITERATION USING SVM
HINDI AND MARATHI TO ENGLISH MACHINE TRANSLITERATION USING SVMijnlc
 
A Context-based Numeral Reading Technique for Text to Speech Systems
A Context-based Numeral Reading Technique for Text to Speech Systems A Context-based Numeral Reading Technique for Text to Speech Systems
A Context-based Numeral Reading Technique for Text to Speech Systems IJECEIAES
 
Implementation Of Syntax Parser For English Language Using Grammar Rules
Implementation Of Syntax Parser For English Language Using Grammar RulesImplementation Of Syntax Parser For English Language Using Grammar Rules
Implementation Of Syntax Parser For English Language Using Grammar RulesIJERA Editor
 
On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...rahulmonikasharma
 
A Corpus-Based Concatenative Speech Synthesis System for Marathi
A Corpus-Based Concatenative Speech Synthesis System for MarathiA Corpus-Based Concatenative Speech Synthesis System for Marathi
A Corpus-Based Concatenative Speech Synthesis System for Marathiiosrjce
 
Survey On Speech Synthesis
Survey On Speech SynthesisSurvey On Speech Synthesis
Survey On Speech SynthesisCSCJournals
 
A SURVEY OF LANGUAGE-DETECTION, FONTDETECTION AND FONT-CONVERSION SYSTEMS FOR...
A SURVEY OF LANGUAGE-DETECTION, FONTDETECTION AND FONT-CONVERSION SYSTEMS FOR...A SURVEY OF LANGUAGE-DETECTION, FONTDETECTION AND FONT-CONVERSION SYSTEMS FOR...
A SURVEY OF LANGUAGE-DETECTION, FONTDETECTION AND FONT-CONVERSION SYSTEMS FOR...IJCI JOURNAL
 
Tutorial - Speech Synthesis System
Tutorial - Speech Synthesis SystemTutorial - Speech Synthesis System
Tutorial - Speech Synthesis SystemIJERA Editor
 
Phonetic Dictionary for Natural Language Processing: Kannada
Phonetic Dictionary for Natural Language Processing: KannadaPhonetic Dictionary for Natural Language Processing: Kannada
Phonetic Dictionary for Natural Language Processing: KannadaIJERA Editor
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silencepaperpublications3
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShashank Shisodia
 

Similar to Ey4301913917 (20)

An expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicAn expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabic
 
G1803013542
G1803013542G1803013542
G1803013542
 
F017163443
F017163443F017163443
F017163443
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
 
HINDI AND MARATHI TO ENGLISH MACHINE TRANSLITERATION USING SVM
HINDI AND MARATHI TO ENGLISH MACHINE TRANSLITERATION USING SVMHINDI AND MARATHI TO ENGLISH MACHINE TRANSLITERATION USING SVM
HINDI AND MARATHI TO ENGLISH MACHINE TRANSLITERATION USING SVM
 
I1 geetha3 revathi
I1 geetha3 revathiI1 geetha3 revathi
I1 geetha3 revathi
 
A Context-based Numeral Reading Technique for Text to Speech Systems
A Context-based Numeral Reading Technique for Text to Speech Systems A Context-based Numeral Reading Technique for Text to Speech Systems
A Context-based Numeral Reading Technique for Text to Speech Systems
 
Implementation Of Syntax Parser For English Language Using Grammar Rules
Implementation Of Syntax Parser For English Language Using Grammar RulesImplementation Of Syntax Parser For English Language Using Grammar Rules
Implementation Of Syntax Parser For English Language Using Grammar Rules
 
On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...
 
A Corpus-Based Concatenative Speech Synthesis System for Marathi
A Corpus-Based Concatenative Speech Synthesis System for MarathiA Corpus-Based Concatenative Speech Synthesis System for Marathi
A Corpus-Based Concatenative Speech Synthesis System for Marathi
 
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
 
Survey On Speech Synthesis
Survey On Speech SynthesisSurvey On Speech Synthesis
Survey On Speech Synthesis
 
A SURVEY OF LANGUAGE-DETECTION, FONTDETECTION AND FONT-CONVERSION SYSTEMS FOR...
A SURVEY OF LANGUAGE-DETECTION, FONTDETECTION AND FONT-CONVERSION SYSTEMS FOR...A SURVEY OF LANGUAGE-DETECTION, FONTDETECTION AND FONT-CONVERSION SYSTEMS FOR...
A SURVEY OF LANGUAGE-DETECTION, FONTDETECTION AND FONT-CONVERSION SYSTEMS FOR...
 
Tutorial - Speech Synthesis System
Tutorial - Speech Synthesis SystemTutorial - Speech Synthesis System
Tutorial - Speech Synthesis System
 
Ijetcas14 444
Ijetcas14 444Ijetcas14 444
Ijetcas14 444
 
Pxc3898474
Pxc3898474Pxc3898474
Pxc3898474
 
Phonetic Dictionary for Natural Language Processing: Kannada
Phonetic Dictionary for Natural Language Processing: KannadaPhonetic Dictionary for Natural Language Processing: Kannada
Phonetic Dictionary for Natural Language Processing: Kannada
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
 
Ijetcas14 575
Ijetcas14 575Ijetcas14 575
Ijetcas14 575
 

Recently uploaded

Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Recently uploaded (20)

Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

Ey4301913917

  • 1. M. Siva Kumar et al Int. Journal of Engineering Research and Applications www.ijera.com ISSN: 2248-9622, Vol. 4, Issue 3(Version 1), March 2014, pp.913-917 www.ijera.com 913 | P a g e Text To Speech System for Telugu Language M. Siva Kumar1 , E. Prakash Babu2 , Dr. M. V. Subba Reddy3 , M. S. Praveen Kumar4 1 (Student, Department of Computer Science and Engineering, Sri Venkatesa Perumal College of Engineering and Technology, Puttur, Andhra Pradesh, India) 2 (Assoc. Prof, Department of Computer Science and Engineering, Sri Venkatesa Perumal College of Engineering and Technology, Puttur, Andhra Pradesh, India) 3 (Professor, Department of Computer Science and Engineering, Sri Venkatesa Perumal College of Engineering and Technology, Puttur, Andhra Pradesh, India) 4 (Student, Department of Computer Science and Engineering, Sri Venkatesa Perumal College of Engineering and Technology, Puttur, Andhra Pradesh, India) Abstract Telugu is one of the oldest languages in India. This paper describes the development of Telugu Text-to-Speech System (TTS).In Telugu TTS the input is Telugu text in Unicode. The voices are sampled from real recorded speech. The objective of a text to speech system is to convert an arbitrary text into its corresponding spoken waveform. Speech synthesis is a process of building machinery that can generate human-like speech from any text input to imitate human speakers. Text processing and speech generation are two main components of a text to speech system. To build a natural sounding speech synthesis system, it is essential that text processing component produce an appropriate sequence of phonemic units. Generation of sequence of phonetic units for a given standard word is referred to as letter to phoneme rule or text to phoneme rule. The complexity of these rules and their derivation depends upon the nature of the language. The quality of a speech synthesizer is judged by its closeness to the natural human voice and understandability. In this paper we described an approach to build a Telugu TTS system using concatenative synthesis method with syllable as a basic unit of concatenation. Keywords - Text processing, speech generation, phoneme, grapheme, Speech synthesis. I. INTRODUCTION A. Speech Synthesis The conversion of words in written form into speech is non-trivial. Even if we can store a huge dictionary for most common words; the TTS system still needs to deal with millions of names and acronyms. Moreover, in order to sound natural, the intonation of the sentences must be appropriately generated. Synthesis of speech cannot be accomplished by cutting and pasting smaller units together. Attention has to be paid to smoothing out the discontinuities in such a process so that the resulting signal approximates natural speech. According to the speech generation model used, speech synthesis can be classified into three categories as Articulatory synthesis, Formant synthesis and Concatenative synthesis. Based on the degree of manual intervention in the design, speech synthesis can be classified into two categories Synthesis by rule and Data-driven synthesis. Prosody and intonation are quite important for natural sounding speech.. There are in existence speech synthesis systems which replicate the prosodic features of human speech. This involves fairly complex parsing of the input sentences and using rather complex rules to determine the intonation patterns. Generation of sequence of phonetic units for a given standard word is referred to as letter to phoneme rule or text to phoneme rule. Voiced sounds were simulated with a computer model of the vocal fold composed of a single mass vibrating both parallel and perpendicular to the air flow. The work is divided into 3 main modules. 1. Converting Telugu script to Unicode 2. Differentiating Grapheme i. Combination of Consonant-vowel ii. Combination of Consonant-consonant 3. Generating voice i. Identify the Grapheme recognizer ii. Identify the Telugu Audio source 1. Converting Telugu Script to Unicode In English ASCII characters are used where as In Telugu Unicode characters are used. ASCII takes 8-bits for each character. Unicode takes 16-bits for each character. Unicode provides a unique RESEARCH ARTICLE OPEN ACCESS
  • 2. M. Siva Kumar et al Int. Journal of Engineering Research and Applications www.ijera.com ISSN: 2248-9622, Vol. 4, Issue 3(Version 1), March 2014, pp.913-917 www.ijera.com 914 | P a g e number for every character, no matter what the platform, no matter what the program, no matter what the language. 2. Differentiating Grapheme In natural speech, durations of phonetic segments are strongly dependent on contextual factors. For synthetic speech to sound natural, the module for computing segmental duration must mimic these contextual effects as closely as possible. Grapheme: Graphemes are “functional spelling units” encompassing one or more letters of the text input, a grapheme in the text input corresponds to a single phoneme. Phoneme: Phones characterize any sound that can be produced by a human vocal tract, if a phone is part of a specific language; it becomes a phoneme of the language. Phonemes are the elementary sounds of a language. In this paper we are going to differentiate A character in Indian language scripts is close to syllable and can be typically of the following form: Different Combinations: V-Vowel C- Consonant C+V-Consonant+Vowel C+C-Consonant+Consonant C+C+V-Consonant+Consonant+Vowel C+C+C--Consonant+Consonant+Consonant Example: Fig.2. Differentiating grapheme 3.Generating Voice The function of Text-To-Speech (TTS) system is to convert the given text to a spoken waveform. This conversion involves text processing and speech generation processes. These processes have connections to linguistic theory, models of speech production, and acoustic-phonetic characterization of language. Text processing including end-of-sentence detection, text normalization. Word pronunciation, including the pronunciation of names and the disambiguation of homographs. In this approach, the pre-recorded speech segments which are to be used in the synthesizer are stored exactly as how it is recorded. Additional information of the speech waveform is attached to the sound to provide proper annotation of the speech waveform. To synthesize a particular language, required units (diphones) from the database which doesn't contain any language specific information and these selected units were then typically altered by signal processing functions to meet the language specific target specification generated by different modules in the synthesizer. To build a voice/speech for a language text, the steps involved are as follows B. Existing system Many of the improvements in speech synthesis over the past years have come from creative use of the technologies developed for speech recognition. We can build systems that interact through speech, which the system can listen to what was said, compute or do something, and then speak Fig.1. Converting Telugu script to Unicode Fig.1. Converting Telugu script to Unicode Telugu Text Unicode Fig.3. Generating Voice Telugu Language Text as Input Text Processing Grapheme - Phoneme conversion Prosodic Modeling Acoustic Synthesis Wave Data Generation
  • 3. M. Siva Kumar et al Int. Journal of Engineering Research and Applications www.ijera.com ISSN: 2248-9622, Vol. 4, Issue 3(Version 1), March 2014, pp.913-917 www.ijera.com 915 | P a g e back, using spoken language generation. Festivox is speech synthesis system developed by speech group at CMU; it provides a general framework for building speech synthesis systems. It offers full text to speech through a number API’s from shell level, though a scheme command interpreter, as a C++ library, from Java, and an Emacs interface. The Festvox project: automating the processes involved in building synthetic voices for new languages. Creating festival TTS for other languages, especially Indian languages is very similar and will mostly require language specific changes to be made to the code in these modules. Festival uses diphone as basic unit. Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech. The overview of the problems that occur during text-to-speech (TTS) conversion and describe the particular solutions to these problems taken within the AT&T Bell Laboratories TTS system. II. TELUGU LANGUAGE The scripts of Indian languages have originated from the ancient Brahmi script. The basic units of writing system are characters which are orthographic representation of speech sounds. A character in Indian language scripts is close to syllable and can be typically of the following form: C, V, CV, CCV and CVC, where C is a consonant and V is a vowel. There are about 35 consonants and about 18 vowels in Indian languages. An important feature of Indian language scripts is their phonetic nature. There is more or less one to one correspondence between what is written and what is spoken. The rules required to map the letters to sounds of Indian languages are almost straight forward. All Indian language scripts have common phonetic base. Syllabification Rules There is almost one to one correspondence between what is written and what is spoken in Indian Languages. Each character in Indian language script has a correspondence to a sound of that language. In Indian languages, a consonant character is inherently bound with the vowel sound /a/, and is almost always pronounced with this vowel. This occurs at both word final and word middle positions. A few heuristic rules to detect IVS of a consonant character are noted below. While letter to phone rules are almost straightforward in Indian languages, the syllabification rules are not trivial. There is need to come up with some rules to break the word into syllables. We have derived certain simplistic rules for syllabification i.e. rules for grouping clusters of C*VC* based on heuristic analysis of several words in Telugu language. III. SYSTEM ARCHITECTURE The main purpose of the system is to convert an arbitrary text into its corresponding spoken waveform. Text processing and speech generation are two main components of a text to speech system. To build a natural sounding speech synthesis system, it is essential that text processing component produce an appropriate sequence of phonemic units. Generation of sequence of phonetic units for a given standard word is referred to as letter to phoneme rule or text to phoneme rule. The complexity of these rules and their derivation depends upon the nature of the language. In Telugu TTS the input is Telugu text in Unicode. Text syllabification will segment the normalized text to syllable unit according to Telugu language rules. Phone set Definition module defines the complete set of phones used in Telugu speech. It also includes feature definitions of these phones. Lexical Analysis module is used to arrive at the phones that make up the pronunciation of a particular word. Since Telugu is phonetic in nature, we do not require a dictionary for lexical analysis. Instead, this module defines letter-to-sound rules (LTS) which are used to arrive at the speech phones based on the spelling of the word The input text is converted into readable text and the syllabification module will generate the sequence of syllables that should be extracted from the speech corpus to be concatenated and play the sound files.
  • 4. M. Siva Kumar et al Int. Journal of Engineering Research and Applications www.ijera.com ISSN: 2248-9622, Vol. 4, Issue 3(Version 1), March 2014, pp.913-917 www.ijera.com 916 | P a g e IV. RESULTS A Telugu TTS system using syllables as basic unit of concatenation is presented. The quality of the synthesized speech is reasonably natural. The proposed approach minimizes the co articulation effects and prosodic mismatch between two adjacent units. Selection of a unit is based on the presiding and succeeding context of the syllables and the position of the syllable. The system implements a matching function which assigns weight to the syllable based on its nature and the syllable with maximum weight is selected as output speech units. We have observed the efficiency of this approach for Telugu language and found that the performance of this approach is better. Fig.5. Playing the generated voice The results showed that there is a strong correlation between the values of the source parameter in the vowel midpoint and the vowel duration. The same parameters tend to decrease on vowel onsets and to increase on vowels offsets. This seems to indicate a prosodic nature of these parameters requiring special treatment in concatenative-based TTS systems that use source modification techniques, such as pitch synchronous overlap add and multipulse. V. CONCLUSION Speech synthesis techniques , it is much easier to build a voice in a language with fewer sentences and a smaller Speech]. The proposed approach minimizes the co articulation effects and prosodic mismatch between two adjacent units. Selection of a unit is based on the presiding and succeeding context of the syllables and the position of the syllable. The system implements a matching function which assigns weight to the syllable based on its nature and the syllable with maximum weight is selected as output speech units. We have observed the efficiency of this approach for Telugu language and found that the performance of this approach is better. Telugu TTS system using syllables as basic unit of concatenation is presented. The quality of the synthesized speech is reasonably natural. The proposed approach minimizes the co articulation effects and prosodic mismatch between two adjacent units. Selection of a unit is based on the presiding and succeeding context of the syllables and the position of the syllable. The system implements a matching function which assigns weight to the syllable based on its nature and the syllable with maximum weight is selected as output speech units. We have observed the efficiency of this approach for Telugu language and found that the performance of this approach is better. VI. FUTURE ENHANCEMENTS The generated speech shows distortion at the concatenation point of two syllables. If this distortion is significant then it would loose the naturalness. This distortion can be minimized by adding smoothing module which would modify the speech parameters like pitch formants and intensity at the concatenation point. More attention needs to be paid to the frequency and amplitude range as well as the duration of each pre-recorded segments to produce high quality speech. A number of other ongoing projects are aimed at developing a POS tag set, POStagger and a tagged corpus for Sinhala. Further work will focus on expanding the pronunciation lexicon. At present, the G2P rules are incapable of providing accurate pronunciation for most compound words. Thus, we are planning to construct a lexicon consisting of compound words along with common high frequency words found in our Sinhala text corpus, which are currently incorrectly phonetized. REFERENCES [1] Lakshmi A, Hema A Murthy. A Syllable Based Continuous Speech Recognizer for Tamil. In Proc. of the 2nd Int. Workshop on East-Asian Language Resources and Evaluation,2009. [2] Sreekanth Majji, Ramakrisshnan A.G “Festival Based maiden TTS system for Tamil Language”, 3rd Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics October 5-7, 2007, PoznaĔ, Poland. [3] ZenH.,NoseT.,YamagishiJ.,SakoS.,Masuko T.,Black A.W., andTokudaK.,“The hmm- based speech synthesis system version2.0,” in Proc.ofISCASSW6, Bonn, Germany,2007.
  • 5. M. Siva Kumar et al Int. Journal of Engineering Research and Applications www.ijera.com ISSN: 2248-9622, Vol. 4, Issue 3(Version 1), March 2014, pp.913-917 www.ijera.com 917 | P a g e [4] BlackA.W. ZenH. and TokudaK. “Statistical parametric speech synthesis,” in proceeding sofIEEEInt.Conf. Acoust. Speech, and Signal Processing, Honolulu, USA, 2007. [5] Rohit Kumar, S. P. Kishore, “Automatic Pruning of Unit Selection Speech Databases for Synthesis without loss of Naturalness”, submitted to ICSLP, Jeju Island, Korea,2004 [6] S P Kishore, Rohit Kumar and Rajeev Sangal, “A Data Driven Synthesis Approach For Indian Languages using Syllables as BasicUnit”, in Proceedings of Intl. Conf. on NLP (ICON) 2002, pp. 311-316, Mumbai, India, 200 [7] A.M. Zeki and N. Azizah, "A Speech Synthesizer for Malay Language”, National Conference on Research and Development in Computer Science, Selangor, Malaysia, October 2001 [8] X. Huang, A.Acero and H.-W. Hon, “Spoken Language Processing A Guide to Theory, Algorithm and System Development”, New Jersey: Prentice Hall, 2001 [9] Min Chu, Hu Peng, Hong-yun Yang, Eric Chang, “Selecting Non-Uniform Units from a very large corpus for Concatenative Speech Synthesizer”, in Proceedings of ICASSP, Salt Lake City, 2001 [10] Sproat R. (1995): finite-state architecture for tokenization and grapheme-to-phoneme conversion for multilingual text analysis”. In From text to tags: Issues in multilingual language analysis. Proc. ACL SIGDAT Workshop (Dublin, Ireland), 65-72 [11] Sproat R., Olive J. (1995): “Text to speech syn-thesis”. AT&T T echnical Journal 74(2), 35-44 [12] Sproat R., Olive J. (1996): “A modular architec-ture for multi-lingual text-to- speech”. In J. van Santen, R. Sproat, J. Olive and J. Hirschberg (eds.), Progress in speech synthesis (Springer, New York). [13] T alkin D., Rowley J. (1990): “Pitch-syn- chronous analysis and synthesis for TTS systems”.Proc. ESCA Workshop on Speech Synthesis (Autrans, France), 55-58. [14] A.M. Zeki and N. Azizah, "A Speech Synthesizer for Malay Language”, National Conference on Research and Development in Computer Science, Selangor, Malaysia, October 2001.