Speech is the important mode of communication and is the current research
topic. The concentration is mostly focused on synthesis and analyzing part.Apart of
synthesizing, text to speech system is developed.Speech synthesis is an artificial production
of human speech.A text to speech system (TTS) is to convert an arbitrary text into speech.In
India different languages have been spoken each being the mother tongue of tens of millions
of people.In this paper,the text to speech system is primarily developed for Telugu, a
Dravidian language predominantly spoken in Indian state of Andhra Pradesh.The
important qualities expected from this system are naturalness and intelligibility.Telugu TTS
can be developed using other synthesis methods like articulatory synthesis,formant synthesis
and concatenative synthesis.This paper describes a development of a Telugu text to speech
system using concatenative synthesis method on mobile based system OMAP 3530 (ARM
Cortex A-8 core) in Linux.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Implementation of English-Text to Marathi-Speech (ETMS) SynthesizerIOSR Journals
This document summarizes an implementation of an English-text to Marathi-speech synthesizer. The synthesizer uses a unit selection approach based on concatenative synthesis to produce natural sounding Marathi speech from English text input. Over 28,000 Marathi syllables, words and sentences were recorded from a female speaker and used to create the speech corpus. Formant frequencies (F1, F2, F3) were analyzed from the synthesized speech using MATLAB and PRAAT tools to evaluate the quality and naturalness of the output.
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
call for paper 2012, hard copy of journal, research paper publishing, where to publish research paper,
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEijnlc
Manipuri is both a minority and morphologically rich language with genetic features similar to Tibeto Burman languages. It has Subject-Object-Verb (SOV) order, agglutinative verb morphology and is monosyllabic. Morphology and syntax are not clearly distinguished in this language. Natural Language
Processing (NLP) is a useful research field of computer science that deals with processing of a large amount of natural language corpus. The NLP applications encompass E-Dictionary, Morphological Analyzer, Reduplicated Multi-Word Expression (RMWE), Named Entity Recognition (NER), Part of Speech
(POS) Tagging, Machine Translation (MT), Word Net, Word Sense Disambiguation (WSD) etc. In this paper, we present a study on the advancements in NLP applications for Manipuri language, at the same time presenting a comparison table of the approaches and techniques adopted and the results obtained of each of the applications followed by a detail discussion of each work.
Emotional telugu speech signals classification based on k nn classifiereSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...iosrjce
This document describes a Marathi text-to-speech (TTS) synthesis system based on a concatenative approach using syllables as the basic speech units. The system analyzes input text, performs syllabification based on linguistic rules, retrieves corresponding speech files from a corpus, concatenates the files while minimizing discontinuities at boundaries, and outputs synthesized speech. A Marathi speech corpus was created containing over 1000 sentences from various domains. Subjective quality tests found the synthesized speech to have naturalness and intelligibility comparable to natural speech. The system demonstrates an effective approach for Marathi TTS using a syllable-based concatenative method.
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEkevig
Manipuri is both a minority and morphologically rich language with genetic features similar to Tibeto Burman languages. It has Subject-Object-Verb (SOV) order, agglutinative verb morphology and ismonosyllabic. Morphology and syntax are not clearly distinguished in this language. Natural Language Processing (NLP) is a useful research field of computer science that deals with processing of a large amount of natural language corpus. The NLP applications encompass E-Dictionary, Morphological
Analyzer, Reduplicated Multi-Word Expression (RMWE), Named Entity Recognition (NER), Part of Speech (POS) Tagging, Machine Translation (MT), Word Net, Word Sense Disambiguation (WSD) etc. In this paper, we present a study on the advancements in NLP applications for Manipuri language, at the same time presenting a comparison table of the approaches and techniques adopted and the results obtained of each of the applications followed by a detail discussion of each work.
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHODijait
This document describes the development of a part-of-speech tagger for Marathi text using a trigram statistical approach. The trigram method assigns POS tags to words based on the probabilities of tag transitions given the previous two tags. The tagger was evaluated on a test corpus of 2000 sentences and achieved an accuracy of 91.63%. Future work will aim to improve accuracy by expanding the training corpus with more tagged sentences. The document also provides background on previous work developing POS taggers for other Indian languages and challenges in tagging morphologically rich languages like Marathi.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Implementation of English-Text to Marathi-Speech (ETMS) SynthesizerIOSR Journals
This document summarizes an implementation of an English-text to Marathi-speech synthesizer. The synthesizer uses a unit selection approach based on concatenative synthesis to produce natural sounding Marathi speech from English text input. Over 28,000 Marathi syllables, words and sentences were recorded from a female speaker and used to create the speech corpus. Formant frequencies (F1, F2, F3) were analyzed from the synthesized speech using MATLAB and PRAAT tools to evaluate the quality and naturalness of the output.
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
call for paper 2012, hard copy of journal, research paper publishing, where to publish research paper,
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEijnlc
Manipuri is both a minority and morphologically rich language with genetic features similar to Tibeto Burman languages. It has Subject-Object-Verb (SOV) order, agglutinative verb morphology and is monosyllabic. Morphology and syntax are not clearly distinguished in this language. Natural Language
Processing (NLP) is a useful research field of computer science that deals with processing of a large amount of natural language corpus. The NLP applications encompass E-Dictionary, Morphological Analyzer, Reduplicated Multi-Word Expression (RMWE), Named Entity Recognition (NER), Part of Speech
(POS) Tagging, Machine Translation (MT), Word Net, Word Sense Disambiguation (WSD) etc. In this paper, we present a study on the advancements in NLP applications for Manipuri language, at the same time presenting a comparison table of the approaches and techniques adopted and the results obtained of each of the applications followed by a detail discussion of each work.
Emotional telugu speech signals classification based on k nn classifiereSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...iosrjce
This document describes a Marathi text-to-speech (TTS) synthesis system based on a concatenative approach using syllables as the basic speech units. The system analyzes input text, performs syllabification based on linguistic rules, retrieves corresponding speech files from a corpus, concatenates the files while minimizing discontinuities at boundaries, and outputs synthesized speech. A Marathi speech corpus was created containing over 1000 sentences from various domains. Subjective quality tests found the synthesized speech to have naturalness and intelligibility comparable to natural speech. The system demonstrates an effective approach for Marathi TTS using a syllable-based concatenative method.
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEkevig
Manipuri is both a minority and morphologically rich language with genetic features similar to Tibeto Burman languages. It has Subject-Object-Verb (SOV) order, agglutinative verb morphology and ismonosyllabic. Morphology and syntax are not clearly distinguished in this language. Natural Language Processing (NLP) is a useful research field of computer science that deals with processing of a large amount of natural language corpus. The NLP applications encompass E-Dictionary, Morphological
Analyzer, Reduplicated Multi-Word Expression (RMWE), Named Entity Recognition (NER), Part of Speech (POS) Tagging, Machine Translation (MT), Word Net, Word Sense Disambiguation (WSD) etc. In this paper, we present a study on the advancements in NLP applications for Manipuri language, at the same time presenting a comparison table of the approaches and techniques adopted and the results obtained of each of the applications followed by a detail discussion of each work.
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHODijait
This document describes the development of a part-of-speech tagger for Marathi text using a trigram statistical approach. The trigram method assigns POS tags to words based on the probabilities of tag transitions given the previous two tags. The tagger was evaluated on a test corpus of 2000 sentences and achieved an accuracy of 91.63%. Future work will aim to improve accuracy by expanding the training corpus with more tagged sentences. The document also provides background on previous work developing POS taggers for other Indian languages and challenges in tagging morphologically rich languages like Marathi.
This document discusses the basic tasks involved in natural language processing (NLP). It describes the different phases of NLP including phonetics, lexical analysis, syntactic analysis, semantic analysis, discourse analysis, and pragmatic analysis. It then explains some basic NLP activities like tokenization, sentence splitting, and part-of-speech tagging. The goal of NLP is to enable computers to understand and process human languages through computational modeling.
Marathi Text-To-Speech Synthesis using Natural Language Processingiosrjce
IOSR journal of VLSI and Signal Processing (IOSRJVSP) is a double blind peer reviewed International Journal that publishes articles which contribute new results in all areas of VLSI Design & Signal Processing. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on advanced VLSI Design & Signal Processing concepts and establishing new collaborations in these areas.
Design and realization of microelectronic systems using VLSI/ULSI technologies require close collaboration among scientists and engineers in the fields of systems architecture, logic and circuit design, chips and wafer fabrication, packaging, testing and systems applications. Generation of specifications, design and verification must be performed at all abstraction levels, including the system, register-transfer, logic, circuit, transistor and process levels
Development of text to speech system for yoruba languageAlexander Decker
This document describes the development of a text-to-speech (TTS) system for the Yoruba language. It begins with background on TTS systems and an overview of previous work developing TTS for other languages but not extensively for Yoruba. The authors then describe the architecture and design of the Yoruba TTS system they developed using a concatenative synthesis method. This includes analyzing the phonology and syllable structure of Yoruba, and developing components for syllable identification, prosody assignment, and speech signal processing. An evaluation of the system found 70% of respondents found it usable.
This document discusses a project to directly translate Hindi text to Tamil text without an intermediate language like English. It describes using techniques like part-of-speech tagging, statistical machine translation, word sense disambiguation using the Lesk algorithm, and morphological analysis. The goal is to build an architecture that can take Hindi input, perform the necessary NLP techniques, and output the translation in Tamil. References are provided for related work.
The document discusses natural language processing (NLP) for Tamil to Hindi conversion. It introduces the Universal Networking Language (UNL) as an intermediate representation to express information across languages. UNL allows text to be converted to different languages like converting a webpage to various natural languages. The document then discusses the advantages of developing machine translation between Tamil and other languages, particularly English and Hindi. It outlines the components needed for a Tamil-Hindi machine translation system, including morphological analyzers for Tamil and Hindi, a word mapping unit, and generators.
Segmentation Words for Speech Synthesis in Persian Language Based On Silencepaperpublications3
Abstract: In speech synthesis in text to speech systems, the words usually break to different parts and use from recorded sound of each part for play words. This paper use silent in word's pronunciation for better quality of speech. Most algorithms divide words to syllable and some of them divide words to phoneme, but This paper benefit from silent in intonation and divide words at silent region and then set equivalent sound of each parts whereupon joining the parts is trusty and speech quality being more smooth . this paper concern Persian language but extendable to another language. This method has been tested with MOS test and intelligibility, naturalness and fluidity are better.
Keywords:TTS, SBS, Sillable, Diphone.
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language ModelsIRJET Journal
The document describes a proposed system to translate Tamil speech to Indian Sign Language (ISL) using speech recognition and natural language processing algorithms. It aims to help hearing-impaired people communicate independently. The system would use the CMU Sphinx speech recognition tool to convert spoken Tamil to text, then apply grammar rules and machine learning to translate the text to ISL displayed through video or animated avatars. The document reviews similar existing systems and research on speech recognition and sign language translation to inform the design and implementation of the proposed Tamil-ISL system.
Myanmar named entity corpus and its use in syllable-based neural named entity...IJECEIAES
This document describes the development of the first manually annotated named entity corpus for the Myanmar language. It contains approximately 170,000 named entities tagged with types like person, location, organization, race, time and number. The document also discusses experiments using various deep neural network architectures for named entity recognition on Myanmar text, without additional feature engineering. Results showed that syllable-based neural models outperformed the baseline conditional random field model. This research aims to apply neural networks to Myanmar natural language processing and promote future work on this under-resourced language.
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...kevig
Neural machine translation is a new approach to machine translation that has shown the effective results
for high-resource languages. Recently, the attention-based neural machine translation with the large scale
parallel corpus plays an important role to achieve high performance for translation results. In this
research, a parallel corpus for Myanmar-English language pair is prepared and attention-based neural
machine translation models are introduced based on word to word level, character to word level, and
syllable to word level. We do the experiments of the proposed model to translate the long sentences and to
address morphological problems. To decrease the low resource problem, source side monolingual data are
also used. So, this work investigates to improve Myanmar to English neural machine translation system.
The experimental results show that syllable to word level neural mahine translation model obtains an
improvement over the baseline systems.
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...kevig
In this paper, phoneme sequences are used as language information to perform code-switched language
identification (LID). With the one-pass recognition system, the spoken sounds are converted into
phonetically arranged sequences of sounds. The acoustic models are robust enough to handle multiple
languages when emulating multiple hidden Markov models (HMMs). To determine the phoneme similarity
among our target languages, we reported two methods of phoneme mapping. Statistical phoneme-based
bigram language models (LM) are integrated into speech decoding to eliminate possible phone
mismatches. The supervised support vector machine (SVM) is used to learn to recognize the phonetic
information of mixed-language speech based on recognized phone sequences. As the back-end decision is
taken by an SVM, the likelihood scores of segments with monolingual phone occurrence are used to
classify language identity. The speech corpus was tested on Sepedi and English languages that are often
mixed. Our system is evaluated by measuring both the ASR performance and the LID performance
separately. The systems have obtained a promising ASR accuracy with data-driven phone merging
approach modelled using 16 Gaussian mixtures per state. In code-switched speech and monolingual
speech segments respectively, the proposed systems achieved an acceptable ASR and LID accuracy.
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATIONkevig
Phonetic typing using the English alphabet has become widely popular nowadays for social media and chat services. As a result, a text containing various English and Bangla words and phrases has become increasingly common. Existing transliteration tools display poor performance for such texts. This paper proposes a robust Three-stage Hybrid Transliteration (THT) framework that can transliterate both English words and phonetic typed Bangla words satisfactorily. This is achieved by adopting a hybrid approach of dictionary-based and rule-based techniques. Experimental results confirm superiority of THT as it significantly outperforms the benchmark transliteration tool.
A Marathi Hidden-Markov Model Based Speech Synthesis Systemiosrjce
IOSR journal of VLSI and Signal Processing (IOSRJVSP) is a double blind peer reviewed International Journal that publishes articles which contribute new results in all areas of VLSI Design & Signal Processing. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on advanced VLSI Design & Signal Processing concepts and establishing new collaborations in these areas.
Design and realization of microelectronic systems using VLSI/ULSI technologies require close collaboration among scientists and engineers in the fields of systems architecture, logic and circuit design, chips and wafer fabrication, packaging, testing and systems applications. Generation of specifications, design and verification must be performed at all abstraction levels, including the system, register-transfer, logic, circuit, transistor and process levels
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...kevig
This study extracted and analyzed the linguistic speech patterns that characterize Japanese anime or game characters. Conventional morphological analyzers, such as MeCab, segment words with high performance, but they are unable to segment broken expressions or utterance endings that are not listed in the dictionary, which often appears in lines of anime or game characters. To overcome this challenge, we propose segmenting lines of Japanese anime or game characters using subword units that were proposed mainly for deep learning, and extracting frequently occurring strings to obtain expressions that characterize their utterances. We analyzed the subword units weighted by TF/IDF according to gender, age, and each anime character and show that they are linguistic speech patterns that are specific for each feature. Additionally, a classification experiment shows that the model with subword units outperformed that with the conventional method.
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
In recent decades speech interactive systems have gained increasing importance. Performance of an ASR system mainly depends on the availability of large corpus of speech. The conventional method of building a large vocabulary speech recognizer for any language uses a top-down approach to speech. This approach requires large speech corpus with sentence or phoneme level transcription of the speech utterances. The transcriptions must also include different speech order so that the recognizer can build models for all the sounds present. But, for Telugu language, because of its complex nature, a very large, well annotated speech database is very difficult to build. It is very difficult, if not impossible, to cover all the words of any Indian language, where each word may have thousands and millions of word forms. A significant part of grammar that is handled by syntax in English (and other similar languages) is handled within morphology in Telugu. Phrases including several words (that is, tokens) in English would be mapped on to a single word in Telugu.Telugu language is phonetic in nature in addition to rich in morphology. That is why the speech technology developed for English cannot be applied to Telugu language. This paper highlights the work carried out in an attempt to build a voice enabled text editor with capability of automatic term suggestion. Main claim of the paper is the recognition enhancement process developed by us for suitability of highly inflecting, rich morphological languages. This method results in increased speech recognition accuracy with very much reduction in corpus size. It also adapts Telugu words to the database dynamically, resulting in growth of the corpus.
This paper presents a machine translation system that translates simple assertive English sentences to Marathi sentences. The system performs morphological analysis, part-of-speech tagging, and local word grouping to convert the meaning of the English sentence to the corresponding Marathi sentence. An English to Marathi bilingual dictionary is used for translation. The system aims to help people with primary education understand English words by providing translations to their native Marathi language.
Tamil-English Document Translation Using Statistical Machine Translation Appr...baskaran_md
The Paper presents a new method for translating a text document from Tamil to English. Our method is based on the Statistical Machine Translation Approach, combined with the Morphological Analysis, due to the fact that Tamil is a highly-inflected language. This paper presents a slight modification in SMT to make the approach more efficient and effective, and the experimental results have proven the method to be speed and accurate in the translation process.
This document presents an efficient rule-based system for morphological parsing of the Tamil language. It discusses the agglutinative nature of Tamil morphology and the need for morphological analysis in applications such as machine translation. The proposed system uses a combination of rule-based and machine learning approaches to analyze Tamil words and identify their root forms and inflections. It was implemented using resources like the EMILLE corpus and Tamil WordNet and allows for morphological parsing of Tamil texts.
Machine verification and identification of telugu metrical poetry 1.1Dileep Miriyala
The document discusses machine verification and identification of Telugu metrical poetry (Chandassu). It provides an overview of key concepts in Chandassu such as gana structure, yati, prasa, and prasa yati. It then describes how the Chandam software works by extracting features from input text and matching them against known Chandassu structures to verify or identify the Chandassu. A demo of the software is shown analyzing a sample poem.
Vyaavahaarika telugu bhaasha - Spoken Telugu Language Movement - Gidugu and V...Veturi Anandamurthy
My presentation at Mauritius on 29th August during the Telugu Language Day Celebrations jointly organized by the Mahatma Gandhi Institute and the Government of Mauritius.Special reference to Gidugu and Veturi and their contribution to the Spoken language movement
This document discusses the importance of Telugu literature and language. It notes that Telugu is a Dravidian language predominantly spoken in Andhra Pradesh, India. It also provides background on the history and evolution of the Telugu language from ancient to modern times. The document outlines the basic structure and components of the Telugu language, including its alphabet, pronouns, sentence structure, and digits. It concludes by expressing concern over the decreasing number of Telugu speakers and advocates for native speakers to preserve their mother tongue.
Telugu Language is sweeter than Honey. Telugu Language Italian of the East. Telugu People's Achievements and many more things about Telugu to the World
The presentation was created by B. Saikrishna. It likely contained information on a topic presented in a clear manner intended for an audience. Further details about the content or purpose of the presentation are not provided in the single sentence document.
This document discusses the basic tasks involved in natural language processing (NLP). It describes the different phases of NLP including phonetics, lexical analysis, syntactic analysis, semantic analysis, discourse analysis, and pragmatic analysis. It then explains some basic NLP activities like tokenization, sentence splitting, and part-of-speech tagging. The goal of NLP is to enable computers to understand and process human languages through computational modeling.
Marathi Text-To-Speech Synthesis using Natural Language Processingiosrjce
IOSR journal of VLSI and Signal Processing (IOSRJVSP) is a double blind peer reviewed International Journal that publishes articles which contribute new results in all areas of VLSI Design & Signal Processing. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on advanced VLSI Design & Signal Processing concepts and establishing new collaborations in these areas.
Design and realization of microelectronic systems using VLSI/ULSI technologies require close collaboration among scientists and engineers in the fields of systems architecture, logic and circuit design, chips and wafer fabrication, packaging, testing and systems applications. Generation of specifications, design and verification must be performed at all abstraction levels, including the system, register-transfer, logic, circuit, transistor and process levels
Development of text to speech system for yoruba languageAlexander Decker
This document describes the development of a text-to-speech (TTS) system for the Yoruba language. It begins with background on TTS systems and an overview of previous work developing TTS for other languages but not extensively for Yoruba. The authors then describe the architecture and design of the Yoruba TTS system they developed using a concatenative synthesis method. This includes analyzing the phonology and syllable structure of Yoruba, and developing components for syllable identification, prosody assignment, and speech signal processing. An evaluation of the system found 70% of respondents found it usable.
This document discusses a project to directly translate Hindi text to Tamil text without an intermediate language like English. It describes using techniques like part-of-speech tagging, statistical machine translation, word sense disambiguation using the Lesk algorithm, and morphological analysis. The goal is to build an architecture that can take Hindi input, perform the necessary NLP techniques, and output the translation in Tamil. References are provided for related work.
The document discusses natural language processing (NLP) for Tamil to Hindi conversion. It introduces the Universal Networking Language (UNL) as an intermediate representation to express information across languages. UNL allows text to be converted to different languages like converting a webpage to various natural languages. The document then discusses the advantages of developing machine translation between Tamil and other languages, particularly English and Hindi. It outlines the components needed for a Tamil-Hindi machine translation system, including morphological analyzers for Tamil and Hindi, a word mapping unit, and generators.
Segmentation Words for Speech Synthesis in Persian Language Based On Silencepaperpublications3
Abstract: In speech synthesis in text to speech systems, the words usually break to different parts and use from recorded sound of each part for play words. This paper use silent in word's pronunciation for better quality of speech. Most algorithms divide words to syllable and some of them divide words to phoneme, but This paper benefit from silent in intonation and divide words at silent region and then set equivalent sound of each parts whereupon joining the parts is trusty and speech quality being more smooth . this paper concern Persian language but extendable to another language. This method has been tested with MOS test and intelligibility, naturalness and fluidity are better.
Keywords:TTS, SBS, Sillable, Diphone.
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language ModelsIRJET Journal
The document describes a proposed system to translate Tamil speech to Indian Sign Language (ISL) using speech recognition and natural language processing algorithms. It aims to help hearing-impaired people communicate independently. The system would use the CMU Sphinx speech recognition tool to convert spoken Tamil to text, then apply grammar rules and machine learning to translate the text to ISL displayed through video or animated avatars. The document reviews similar existing systems and research on speech recognition and sign language translation to inform the design and implementation of the proposed Tamil-ISL system.
Myanmar named entity corpus and its use in syllable-based neural named entity...IJECEIAES
This document describes the development of the first manually annotated named entity corpus for the Myanmar language. It contains approximately 170,000 named entities tagged with types like person, location, organization, race, time and number. The document also discusses experiments using various deep neural network architectures for named entity recognition on Myanmar text, without additional feature engineering. Results showed that syllable-based neural models outperformed the baseline conditional random field model. This research aims to apply neural networks to Myanmar natural language processing and promote future work on this under-resourced language.
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...kevig
Neural machine translation is a new approach to machine translation that has shown the effective results
for high-resource languages. Recently, the attention-based neural machine translation with the large scale
parallel corpus plays an important role to achieve high performance for translation results. In this
research, a parallel corpus for Myanmar-English language pair is prepared and attention-based neural
machine translation models are introduced based on word to word level, character to word level, and
syllable to word level. We do the experiments of the proposed model to translate the long sentences and to
address morphological problems. To decrease the low resource problem, source side monolingual data are
also used. So, this work investigates to improve Myanmar to English neural machine translation system.
The experimental results show that syllable to word level neural mahine translation model obtains an
improvement over the baseline systems.
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITC...kevig
In this paper, phoneme sequences are used as language information to perform code-switched language
identification (LID). With the one-pass recognition system, the spoken sounds are converted into
phonetically arranged sequences of sounds. The acoustic models are robust enough to handle multiple
languages when emulating multiple hidden Markov models (HMMs). To determine the phoneme similarity
among our target languages, we reported two methods of phoneme mapping. Statistical phoneme-based
bigram language models (LM) are integrated into speech decoding to eliminate possible phone
mismatches. The supervised support vector machine (SVM) is used to learn to recognize the phonetic
information of mixed-language speech based on recognized phone sequences. As the back-end decision is
taken by an SVM, the likelihood scores of segments with monolingual phone occurrence are used to
classify language identity. The speech corpus was tested on Sepedi and English languages that are often
mixed. Our system is evaluated by measuring both the ASR performance and the LID performance
separately. The systems have obtained a promising ASR accuracy with data-driven phone merging
approach modelled using 16 Gaussian mixtures per state. In code-switched speech and monolingual
speech segments respectively, the proposed systems achieved an acceptable ASR and LID accuracy.
A ROBUST THREE-STAGE HYBRID FRAMEWORK FOR ENGLISH TO BANGLA TRANSLITERATIONkevig
Phonetic typing using the English alphabet has become widely popular nowadays for social media and chat services. As a result, a text containing various English and Bangla words and phrases has become increasingly common. Existing transliteration tools display poor performance for such texts. This paper proposes a robust Three-stage Hybrid Transliteration (THT) framework that can transliterate both English words and phonetic typed Bangla words satisfactorily. This is achieved by adopting a hybrid approach of dictionary-based and rule-based techniques. Experimental results confirm superiority of THT as it significantly outperforms the benchmark transliteration tool.
A Marathi Hidden-Markov Model Based Speech Synthesis Systemiosrjce
IOSR journal of VLSI and Signal Processing (IOSRJVSP) is a double blind peer reviewed International Journal that publishes articles which contribute new results in all areas of VLSI Design & Signal Processing. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on advanced VLSI Design & Signal Processing concepts and establishing new collaborations in these areas.
Design and realization of microelectronic systems using VLSI/ULSI technologies require close collaboration among scientists and engineers in the fields of systems architecture, logic and circuit design, chips and wafer fabrication, packaging, testing and systems applications. Generation of specifications, design and verification must be performed at all abstraction levels, including the system, register-transfer, logic, circuit, transistor and process levels
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...kevig
This study extracted and analyzed the linguistic speech patterns that characterize Japanese anime or game characters. Conventional morphological analyzers, such as MeCab, segment words with high performance, but they are unable to segment broken expressions or utterance endings that are not listed in the dictionary, which often appears in lines of anime or game characters. To overcome this challenge, we propose segmenting lines of Japanese anime or game characters using subword units that were proposed mainly for deep learning, and extracting frequently occurring strings to obtain expressions that characterize their utterances. We analyzed the subword units weighted by TF/IDF according to gender, age, and each anime character and show that they are linguistic speech patterns that are specific for each feature. Additionally, a classification experiment shows that the model with subword units outperformed that with the conventional method.
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
In recent decades speech interactive systems have gained increasing importance. Performance of an ASR system mainly depends on the availability of large corpus of speech. The conventional method of building a large vocabulary speech recognizer for any language uses a top-down approach to speech. This approach requires large speech corpus with sentence or phoneme level transcription of the speech utterances. The transcriptions must also include different speech order so that the recognizer can build models for all the sounds present. But, for Telugu language, because of its complex nature, a very large, well annotated speech database is very difficult to build. It is very difficult, if not impossible, to cover all the words of any Indian language, where each word may have thousands and millions of word forms. A significant part of grammar that is handled by syntax in English (and other similar languages) is handled within morphology in Telugu. Phrases including several words (that is, tokens) in English would be mapped on to a single word in Telugu.Telugu language is phonetic in nature in addition to rich in morphology. That is why the speech technology developed for English cannot be applied to Telugu language. This paper highlights the work carried out in an attempt to build a voice enabled text editor with capability of automatic term suggestion. Main claim of the paper is the recognition enhancement process developed by us for suitability of highly inflecting, rich morphological languages. This method results in increased speech recognition accuracy with very much reduction in corpus size. It also adapts Telugu words to the database dynamically, resulting in growth of the corpus.
This paper presents a machine translation system that translates simple assertive English sentences to Marathi sentences. The system performs morphological analysis, part-of-speech tagging, and local word grouping to convert the meaning of the English sentence to the corresponding Marathi sentence. An English to Marathi bilingual dictionary is used for translation. The system aims to help people with primary education understand English words by providing translations to their native Marathi language.
Tamil-English Document Translation Using Statistical Machine Translation Appr...baskaran_md
The Paper presents a new method for translating a text document from Tamil to English. Our method is based on the Statistical Machine Translation Approach, combined with the Morphological Analysis, due to the fact that Tamil is a highly-inflected language. This paper presents a slight modification in SMT to make the approach more efficient and effective, and the experimental results have proven the method to be speed and accurate in the translation process.
This document presents an efficient rule-based system for morphological parsing of the Tamil language. It discusses the agglutinative nature of Tamil morphology and the need for morphological analysis in applications such as machine translation. The proposed system uses a combination of rule-based and machine learning approaches to analyze Tamil words and identify their root forms and inflections. It was implemented using resources like the EMILLE corpus and Tamil WordNet and allows for morphological parsing of Tamil texts.
Machine verification and identification of telugu metrical poetry 1.1Dileep Miriyala
The document discusses machine verification and identification of Telugu metrical poetry (Chandassu). It provides an overview of key concepts in Chandassu such as gana structure, yati, prasa, and prasa yati. It then describes how the Chandam software works by extracting features from input text and matching them against known Chandassu structures to verify or identify the Chandassu. A demo of the software is shown analyzing a sample poem.
Vyaavahaarika telugu bhaasha - Spoken Telugu Language Movement - Gidugu and V...Veturi Anandamurthy
My presentation at Mauritius on 29th August during the Telugu Language Day Celebrations jointly organized by the Mahatma Gandhi Institute and the Government of Mauritius.Special reference to Gidugu and Veturi and their contribution to the Spoken language movement
This document discusses the importance of Telugu literature and language. It notes that Telugu is a Dravidian language predominantly spoken in Andhra Pradesh, India. It also provides background on the history and evolution of the Telugu language from ancient to modern times. The document outlines the basic structure and components of the Telugu language, including its alphabet, pronouns, sentence structure, and digits. It concludes by expressing concern over the decreasing number of Telugu speakers and advocates for native speakers to preserve their mother tongue.
Telugu Language is sweeter than Honey. Telugu Language Italian of the East. Telugu People's Achievements and many more things about Telugu to the World
The presentation was created by B. Saikrishna. It likely contained information on a topic presented in a clear manner intended for an audience. Further details about the content or purpose of the presentation are not provided in the single sentence document.
This document discusses using a literature-based research methodology for student dissertations. It outlines:
1) A constructionist approach to teaching and learning that encourages students to actively engage with research.
2) Key aspects of a literature-based dissertation including defining a research problem, reviewing relevant literature, using a structured methodology, and engaging students in the research process.
3) Components of a literature-based methodology such as sampling strategies, data collection tools for reviewing literature, and analyzing data thematically while considering disciplinary and professional contexts.
This document provides an overview of the literature review process. It defines a literature review, discusses the types and purposes of literature reviews. It outlines the key steps in conducting a literature review including defining the research problem, searching relevant literature, planning the review, taking notes, and summarizing findings. Sources of literature are primary sources like reports and theses, secondary sources like books and journals, and tertiary sources like indexes and abstracts. The document provides examples of formatting literature in bibliographies and taking structured notes on index cards to organize the review.
This 3 sentence document discusses presenting information. It mentions creating a presentation, listing important points, and organizing the content in a clear manner. The document calls for structuring the presentation effectively and focusing on key details.
The document provides an overview of a presentation on types of research given by Manoj Patel. It defines research and lists its main objectives as extending knowledge, revealing hidden facts, generalizing laws, and verifying existing theories and facts. The presentation then describes several common types of research, including descriptive and analytical research, applied and fundamental research, quantitative and qualitative research, conceptual and empirical research, and others. It provides examples to illustrate the differences between each type.
CLASSIFICATION OF RESEARCH BY PURPOSE & METHODDr.Shazia Zamir
This document classifies research by purpose and method. For purpose, it discusses basic vs applied research, research and development, and evaluative research. For method, it discusses historical research which describes past conditions, descriptive research which describes present data and characteristics, and experimental research which manipulates variables to discern effects.
There are many ways to classify research, including by purpose, goal, level of investigation, type of analysis, scope, choice of answers to problems, statistical content, and time element. Some of the main classifications are basic/pure research conducted for intellectual purposes versus applied research which tests theories in practice, quantitative research which uses statistics versus non-quantitative, and historical research which describes the past versus descriptive or experimental.
The document outlines nine basic research methods and their typical steps:
1. Historical Research aims to reconstruct the past objectively. Key steps include defining the problem, collecting primary and secondary data, and evaluating findings.
2. Descriptive Research systematically describes facts and characteristics of a population. Key steps include defining objectives, designing data collection, collecting data, and reporting results.
3. Developmental Research investigates patterns of growth over time. Key steps include defining the problem, reviewing literature, designing the approach, collecting data, and evaluating results.
The document outlines key aspects of research methodology including:
1. The objectives of research such as defining problems, formulating hypotheses, collecting and evaluating data, making deductions, and testing conclusions.
2. The different types of research including descriptive, applied, quantitative, conceptual, empirical, qualitative, fundamental, and analytical research.
3. The methods of collecting data including primary methods like questionnaires, observations, interviews, and schedules and secondary methods of collecting published and unpublished data from various sources.
This document discusses research methodology. It defines research and describes key aspects of conducting research including defining problems, formulating hypotheses, collecting and analyzing data, reaching conclusions, and testing conclusions. It also discusses different types of research based on their application, objectives, and inquiry mode. Finally, it outlines important qualities of a good researcher including having an analytical mind, being able to engage people, and staying calm under pressure.
Research is the systematic and objective analysis and recording of controlled observations that may lead to the development of generalizations, principles, or theories, resulting in prediction and possible control of events .
The document discusses various types of research including applied research, basic research, correlational research, descriptive research, ethnographic research, experimental research, and exploratory research. Applied research seeks practical solutions to problems, while basic research expands knowledge without a direct application. Correlational research examines relationships between variables without determining cause and effect. Descriptive research provides accurate portrayals of characteristics, and ethnographic research involves in-depth study of cultures. Experimental research establishes cause-and-effect through controlled manipulation of variables.
This document provides an overview of key concepts in research methodology, including:
1. It defines research as an organized and systematic process of finding answers to questions through a defined set of steps and procedures.
2. It discusses different types of research including quantitative, qualitative, basic, applied, longitudinal, descriptive, classification, comparative, exploratory, explanatory, causal, theory testing, and theory building research.
3. It also discusses alternatives to research-based knowledge such as relying on authority, tradition, common sense, media, and personal experience.
This document describes the development of a text-to-speech synthesizer for the Pali language. It discusses previous work on speech synthesis systems for Indian languages. It then outlines the methodology used, including developing a phone set and speech database for Pali, and using a unit selection approach for speech synthesis. The system was evaluated based on the naturalness of the synthesized speech output. Results showed smooth spectral changes at concatenation points and uniform spectral changes across syllable boundaries, indicating the system produces intelligible synthetic Pali speech.
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...IJERA Editor
Marathi is one of the oldest languages in India. This research paper describes the development of Marathi Textto-
Speech System (TTS). In Marathi TTS the input is Marathi text in Unicode. The voices are sampled from real
recorded speech. The objective of a text to speech system is to convert an arbitrary text into its corresponding
spoken waveform. Speech synthesis is a process of building machinery that can generate human-like speech
from any text input to imitate human speakers. Text processing and speech generation are two main components
of a text to speech system. To build a natural sounding speech synthesis system, it is essential that text
processing component produce an appropriate sequence of phonemic units. Generation of sequence of phonetic
units for a given standard word is referred to as letter to phoneme rule or text to phoneme rule. The
complexity of these rules and their derivation depends upon the nature of the language. The quality of a speech
synthesizer is judged by its closeness to the natural human voice and understandability. In this research paper we
described an approach to build a Marathi TTS system using concatenative synthesis method with syllable as a
basic unit of concatenation.
This document describes a rule-based machine translation system for translating English text to Telugu. It discusses the challenges of developing such a system, including differences in grammar between the two languages. An algorithm is proposed that uses rules, probabilities, and rough sets to classify sentences and select the best word translations. The system works by tokenizing English sentences, tagging the words with parts of speech, looking up word translations in a bilingual dictionary, and concatenating the Telugu words to form the output sentence.
The document describes the implementation of a natural sounding speech synthesizer for the Marathi language using English text input. It discusses concatenative speech synthesis using a unit selection approach. Over 28,580 syllables, words and sentences recorded from a female speaker were used to create an inventory of speech units. The synthesizer was tested and able to generate natural sounding output and waveforms. Formant frequencies were analyzed using MATLAB and PRAAT tools to evaluate the quality of the synthesized speech.
A Corpus-Based Concatenative Speech Synthesis System for Marathiiosrjce
IOSR journal of VLSI and Signal Processing (IOSRJVSP) is a double blind peer reviewed International Journal that publishes articles which contribute new results in all areas of VLSI Design & Signal Processing. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on advanced VLSI Design & Signal Processing concepts and establishing new collaborations in these areas.
Design and realization of microelectronic systems using VLSI/ULSI technologies require close collaboration among scientists and engineers in the fields of systems architecture, logic and circuit design, chips and wafer fabrication, packaging, testing and systems applications. Generation of specifications, design and verification must be performed at all abstraction levels, including the system, register-transfer, logic, circuit, transistor and process levels
Role of language engineering to preserve endangered languagesDr. Amit Kumar Jha
Role of Language Engineering to Preserve Endangered Languages discusses how language engineering can help preserve endangered languages through documentation and digitization. Language engineering is the application of computer science to develop language-related software and hardware. It involves techniques like speech and text processing to develop systems that can understand, interpret, and generate human language. Documenting endangered languages through recording speech samples and collecting texts is important for preservation. Language engineering makes this documentation process easier through tools like speech-to-text, text-to-speech, and transcription tools. It also allows for digital storage of language data, which helps preserve languages for longer as digital data is more durable than other forms of storage. Developing applications that use endangered languages, like translation systems,
Natural Language Processing: State of The Art, Current Trends and Challengesantonellarose
Diksha Khurana1
, Aditya Koli1
, Kiran Khatter1,2 and Sukhdev Singh1,2
1Department of Computer Science and Engineering
Manav Rachna International University, Faridabad-121004, India
2Accendere Knowledge Management Services Pvt. Ltd., India
Language engineering applies knowledge of human language to develop computer systems that can understand, interpret, and generate human language. It uses techniques implemented in software and language resources stored in repositories. Key areas of language engineering include speech recognition, natural language processing, language translation, and more. It relies on resources like lexicons, grammars, and corpora to analyze different aspects of language.
Phrase Identification is one of the most critical and widely studied in Natural Language processing (NLP) tasks. Verb Phrase Identification within a sentence is very useful for a variety of application on NLP. One of the core enabling technologies required in NLP applications is a Morphological Analysis. This paper presents the Myanmar Verb Phrase Identification and Translation Algorithm and develops a Markov Model with Morphological Analysis. The system is based on Rule-Based Maximum Matching Approach. In Machine Translation, Large amount of information is needed to guide the translation process. Myanmar Language is inflected language and there are very few creations and researches of Lexicon in Myanmar, comparing to other language such as English, French and Czech etc. Therefore, this system is proposed Myanmar Verb Phrase identification and translation model based on Syntactic Structure and Morphology of Myanmar Language by using Myanmar- English bilingual lexicon. Markov Model is also used to reformulate the translation probability of Phrase pairs. Experiment results showed that proposed system can improve translation quality by applying morphological analysis on Myanmar Language.
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET Journal
This document describes a text-to-speech synthesis system for the Hindi language developed using the Festival framework. The system takes Hindi text as input and outputs synthesized speech. It uses a syllable-based concatenative approach where Hindi words are segmented into syllables which are then matched to recorded audio files and concatenated to generate speech. Challenges in developing text-to-speech for Hindi include accurate pronunciation rules and producing natural prosody. The system aims to improve the naturalness of synthesized Hindi speech output.
Implementation of Marathi Language Speech Databases for Large Dictionaryiosrjce
IOSR journal of VLSI and Signal Processing (IOSRJVSP) is a double blind peer reviewed International Journal that publishes articles which contribute new results in all areas of VLSI Design & Signal Processing. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on advanced VLSI Design & Signal Processing concepts and establishing new collaborations in these areas.
Design and realization of microelectronic systems using VLSI/ULSI technologies require close collaboration among scientists and engineers in the fields of systems architecture, logic and circuit design, chips and wafer fabrication, packaging, testing and systems applications. Generation of specifications, design and verification must be performed at all abstraction levels, including the system, register-transfer, logic, circuit, transistor and process levels
This document presents a mini project report on developing a text-to-speech synthesizer using .NET Framework. The objectives are to help visually impaired people read text and enable machines to communicate verbally. It describes the theoretical background of speech synthesis, the front-end and back-end processes, and details the use of concatenative synthesis using a recorded speech database. The document outlines the code, demonstrates the text-to-speech conversion, and discusses applications, advantages, limitations and future enhancements.
Emotional telugu speech signals classification based on k nn classifiereSAT Journals
Abstract Speech processing is the study of speech signals, and the methods used to process them. In application such as speech coding, speech synthesis, speech recognition and speaker recognition technology, speech processing is employed. In speech classification, the computation of prosody effects from speech signals plays a major role. In emotional speech signals pitch and frequency is a most important parameters. Normally, the pitch value of sad and happy speech signals has a great difference and the frequency value of happy is higher than sad speech. But, in some cases the frequency of happy speech is nearly similar to sad speech or frequency of sad speech is similar to happy speech. In such situation, it is difficult to recognize the exact speech signal. To reduce such drawbacks, in this paper we propose a Telugu speech emotion classification system with three features like Energy Entropy, Short Time Energy, Zero Crossing Rate and K-NN classifier for the classification. Features are extracted from the speech signals and given to the K-NN. The implementation result shows the effectiveness of proposed speech emotion classification system in classifying the Telugu speech signals based on their prosody effects. The performance of the proposed speech emotion classification system is evaluated by conducting cross validation on the Telugu speech database. Keywords: Emotion Classification, K-NN classifier, Energy Entropy, Short Time Energy, Zero Crossing Rate.
This document discusses improving the word accuracy of an automatic speech recognition (ASR) system for the Telugu language. It analyzes the substitution errors in the system using two different lexical models - one based on stress-timed English phonemes (CMU lexicon) and one handcrafted lexicon for syllable-timed Telugu (UOH lexicon). The UOH lexicon improves word accuracy by 20-30% compared to the CMU lexicon by better modeling the phonetic characteristics of Telugu. The paper also examines the effect of gender, accents, and non-native speakers on substitution errors and the resulting confusion matrices provide insight into the most commonly substituted phonemes.
An expert system for automatic reading of a text written in standard arabicijnlc
In this work we present our expert system of Automatic reading or speech synthesis based on a text
written in Standard Arabic, our work is carried out in two great stages: the creation of the sound data
base, and the transformation of the written text into speech (Text To Speech TTS). This transformation is
done firstly by a Phonetic Orthographical Transcription (POT) of any written Standard Arabic text with
the aim of transforming it into his corresponding phonetics sequence, and secondly by the generation of
the voice signal which corresponds to the chain transcribed. We spread out the different of conception of
the system, as well as the results obtained compared to others works studied to realize TTS based on
Standard Arabic.
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...Syeful Islam
More than hundreds of millions of people of almost all levels of education and attitudes from different country communicate with each other for different purposes using various languages. Machine translation is highly demanding due to increasing the usage of web based Communication. One of the major problem of Bengali translation is identified a naming word from a sentence, which is relatively simple in English language, because such entities start with a capital letter. In Bangla we do not have concept of small or capital letters and there is huge no. of different naming entity available in Bangla. Thus we find difficulties in understanding whether a word is a naming word or not. Here we have introduced a new approach to identify naming word from a Bengali sentence for machine translation system without storing huge no. of naming entity in word dictionary. The goal is to make possible Bangla sentence conversion with minimal storing word in dictionary.
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESLinda Garcia
This document summarizes several existing grammar checkers for various natural languages. It discusses rule-based, statistical, and hybrid approaches to grammar checking. Grammar checkers described include those for Afan Oromo, Amharic, Swedish, Icelandic, Nepali, and Portuguese. The document analyzes the approaches, methodologies, advantages, and limitations of each grammar checker.
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGEScsandit
This document summarizes and reviews various grammar checkers for natural languages. It begins by defining key concepts in natural language processing like computational linguistics and grammar checking. It then describes the general working of grammar checkers, which involves preprocessing text, analyzing morphology and syntax, and identifying grammatical errors. The document surveys grammar checking approaches for several languages like rule-based, statistical, and hybrid methods. Specific grammar checkers are discussed for languages like Afan Oromo, Amharic, Swedish, Icelandic, Nepali, and Portuguese. The review concludes by analyzing the features and limitations of existing grammar checking systems.
A Context-based Numeral Reading Technique for Text to Speech Systems IJECEIAES
This paper presents a novel technique for context based numeral reading in Indian language text to speech systems. The model uses a set of rules to determine the context of the numeral pronunciation and is being integrated with the waveform concatenation technique to produce speech out of the input text in Indian languages. For this purpose, the three Indian languages Odia, Hindi and Bengali are considered. To analyze the performance of the proposed technique, a set of experiments are performed considering different context of numeral pronunciations and the results are compared with existing syllable-based technique. The results obtained from different experiments shows the effectiveness of the proposed technique in producing intelligible speech out of the entered text utterances compared to the existing technique even with very less storage and execution time.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
This document summarizes a research paper that proposes using an artificial neural network tuned by a simulated annealing algorithm for real-time credit card fraud detection. The paper describes how simulated annealing can be used to train the weights of a neural network model to classify credit card transactions as fraudulent or non-fraudulent based on attributes of past transactions. The algorithm is tested on a real-world credit card transaction dataset and is found to effectively classify most transactions correctly, though some misclassifications still occur.
Wireless sensor networks (WSN) have been widely used in various applications.
In these networks nodes collect data from the attached sensors and send their data to a base
station. However, nodes in WSN have limited power supply in form of battery so the nodes
are expected to minimize energy consumption in order to maximize the lifetime of WSN. A
number of techniques have been proposed in the literature to reduce the energy
consumption significantly. In this paper, we propose a new clustering based technique
which is a modification of the popular LEACH algorithm. In this technique, first cluster
heads are elected using the improved LEACH algorithm as usual, and then a cluster of
nodes is formed based on the distance between node and cluster head. Finally, data from
node is transferred to cluster head. Cluster heads forward data, after applying aggregation,
to the cluster head that is closer to it than sink in forward direction or directly to the sink.
This reduction in distance travelled improves the performance over LEACH algorithm
significantly.
This document provides an overview of vertical handover decision strategies in heterogeneous wireless networks. It begins with an introduction to always best connectivity requirements in next generation networks that allow users to move between different network technologies. It then discusses the key aspects of handover management, including the three phases of initiation, decision, and execution. Various criteria for the handover decision process are described, such as received signal strength, network connection time, available bandwidth, power consumption, cost, security, and user preferences. Different types of handover decision strategies are categorized, including those based on network conditions, user preferences, multiple attributes, fuzzy logic/neural networks, and context awareness. The strategies are analyzed and their advantages/disadvantages compared.
This paper presents the design and performance comparison of a two stage
operational amplifier topology using CMOS and BiCMOS technology. This conventional op
amp circuit was designed by using RF model of BSIM3V3 in 0.6 μm CMOS technology and
0.35 μm BiCMOS technology. Both the op amp circuits were designed and simulated,
analyzed and performance parameters are compared. The performance parameters such as
gain, phase margin, CMRR, PSRR, power consumption etc achieved are compared. Finally,
we conclude the suitability of CMOS technology over BiCMOS technology for low power
RF design.
In Cognitive Radio Networks (CRN), Cooperative Spectrum Sensing (CSS) is
used to improve performance of spectrum sensing techniques used for detection of licensed
(Primary) user’s signal. In CSS, the spectrum sensing information from multiple unlicensed
(Secondary) users are combined to take final decision about presence of primary signal. The
mixing techniques used to generate final decision about presence of PU’s signal are also
called as Fusion techniques / rules. The fusion techniques are further classified as data
fusion and decision fusion techniques. In data fusion technique all the secondary users
(SUs) share their raw information of spectrum detection like detected energy or other
statistical information, while in decision fusion technique all the SUs take their local
decisions and share the decision by sending ‘0’ or ‘1’ corresponding to absence and presence
of PU’s signal respectively. The rules used in decision fusion techniques are OR rule, AND
rule and K-out-of-N rule. The CSS is further classified as distributed CSS and centralized
CSS. In distributed CSS all the SUs share the spectrum detection information with each
other and by mixing the shared information; all the SUs take final decision individually. In
centralized CSS all the SUs send their detected information to a secondary base station /
central unit which combines the shared information and takes final decision. The secondary
base station shares the final decision with all the SUs in the CRN. This paper covers
overview of information fusion methods used for CSS and analysis of decision fusion rules
with simulation results.
This paper analyzes the impact of network scalability on various physical attributes of Zigbee networks. Simulations were conducted using Qualnet to evaluate the performance of the Zigbee physical layer based on energy consumption and throughput. Energy consumption was analyzed for different modulation schemes (ASK, BPSK, OQPSK), network sizes (2-50 nodes), and clear channel assessment modes. The results showed that OQPSK and ASK had lower energy consumption than BPSK. Throughput was highest for OQPSK. While carrier sense had slightly higher throughput than other CCA modes, the energy consumption differences between CCA modes were minor.
This paper gives a brief idea of the moving objects tracking and its application.
In sport it is challenging to track and detect motion of players in video frames. Task
represents optical flow analysis to do motion detection and particle filter to track players
and taking consideration of regions with movement of players in sports video. Optical flow
vector calculation gives motion of players in video frame. This paper presents improved
Luacs Kanade algorithm explained for optical flow computation for large displacement and
more accuracy in motion estimation.
A rapid progress is seen in the field of robotics both in educational and industrial
automation sectors. The Robotics education in particular is gaining technological advances
and providing more learning opportunities. In automotive sector, there is a necessity and
demand to automate daily human activities by robot. With such an advancement and
demand for robotics, the realization of a popular computer game will help students to learn
and acquire skills in the field of robotics. The computer game such as Pacman offers
challenges on both software and hardware fronts. In software, it provides challenges in
developing algorithms for a robot to escape from the pool of attacking robots and to develop
algorithms for multiple ghost robots to attack the Pacman. On the hardware front, it
provides a challenge to integrate various systems to realize the game. This project aims to
demonstrate the pacman game in real world as well as in simulation. For simulation
purpose Player/Stage is used to develop single-client and multi-client architectures. The
multi- client architecture in player/stage uses one global simulation proxy to which all the
robot models are connected. This reduces the overhead to manage multiple robots proxy.
The single-client architecture enables only two robot models to connect to the simulation
proxy. Multi-client approach offers flexibility to add sensors to each port which will be used
distinctly by the client attached to the respective robot. The robots are named as Pacman
and Ghosts, which try to escape and attack respectively. Use of Network Camera has been
done to detect the global positions of the robots and data is shared through inter-process
communication.
In Content-Based Image Retrieval (CBIR) systems, the visual contents of the
images in the database are took out and represented by multi-dimensional characteristic
vectors. A well known CBIR system that retrieves images by unsupervised method known
as cluster based image retrieval system. For enhancing the performance and retrieval rate
of CBIR system, we fuse the visual contents of an image. Recently, we developed two
cluster-based CBIR systems by fusing the scores of two visual contents of an image. In this
paper, we analyzed the performance of the two recommended CBIR systems at different
levels of precision using images of varying sizes and resolutions. We also compared the
performance of the recommended systems with that of the other two existing CBIR systems
namely UFM and CLUE. Experimentally, we find that the recommended systems
outperform the other two existing systems and one recommended system also comparatively
performed better in every resolution of image.
Information Systems and Networks are subjected to electronic attacks. When
network attacks hit, organizations are thrown into crisis mode. From the IT department to
call centers, to the board room and beyond, all are fraught with danger until the situation is
under control. Traditional methods which are used to overcome these threats (e.g. firewall,
antivirus software, password protection etc.) do not provide complete security to the system.
This encourages the researchers to develop an Intrusion Detection System which is capable
of detecting and responding to such events. This review paper presents a comprehensive
study of Genetic Algorithm (GA) based Intrusion Detection System (IDS). It provides a
brief overview of rule-based IDS, elaborates the implementation issues of Genetic Algorithm
and also presents a comparative analysis of existing studies.
Step by step operations by which we make a group of objects in which attributes
of all the objects are nearly similar, known as clustering. So, a cluster is a collection of
objects that acquire nearly same attribute values. The property of an object in a cluster is
similar to other objects in same cluster but different with objects of other clusters.
Clustering is used in wide range of applications like pattern recognition, image processing,
data analysis, machine learning etc. Nowadays, more attention has been put on categorical
data rather than numerical data. Where, the range of numerical attributes organizes in a
class like small, medium, high, and so on. There is wide range of algorithm that used to
make clusters of given categorical data. Our approach is to enhance the working on well-
known clustering algorithm k-modes to improve accuracy of algorithm. We proposed a new
approach named “High Accuracy Clustering Algorithm for Categorical datasets”.
Brain tumor is a malformed growth of cells within brain which may be
cancerous or non-cancerous. The term ‘malformed’ indicates the existence of tumor. The
tumor may be benign or malignant and it needs medical support for further classification.
Brain tumor must be detected, diagnosed and evaluated in earliest stage. The medical
problems become grave if tumor is detected at the later stage. Out of various technologies
available for diagnosis of brain tumor, MRI is the preferred technology which enables the
diagnosis and evaluation of brain tumor. The current work presents various clustering
techniques that are employed to detect brain tumor. The classification involves classification
of images into normal and malformed (if detected the tumor). The algorithm deals with
steps such as preprocessing, segmentation, feature extraction and classification of MR brain
images. Finally, the confirmatory step is specifying the tumor area by technique called
region of interest.
A Proxy signature scheme enables a proxy signer to sign a message on behalf of
the original signer. In this paper, we propose ECDLP based solution for chen et. al [1]
scheme. We describe efficient and secure Proxy multi signature scheme that satisfy all the
proxy requirements and require only elliptic curve multiplication and elliptic curve addition
which needs less computation overhead compared to modular exponentiations also our
scheme is withstand against original signer forgery and public key substitution attack.
This document proposes a digital watermarking technique using LSB replacement with secret key insertion for enhanced data security. The technique works by inserting a watermark into the least significant bits of pixels in an image. A secret key is also inserted during transmission for additional security. The watermarked image is generated without noticeably impacting image quality. The proposed method was tested on sample images and successfully embedded watermarks while maintaining visual quality. The technique aims to provide copyright protection and authentication of digital images and documents.
Today among various medium of data transmission or storage our sensitive data
are not secured with a third-party, that we used to take help of. Cryptography plays an
important role in securing our data from malicious attack. This paper present a partial
image encryption based on bit-planes permutation using Peter De Jong chaotic map for
secure image transmission and storage. The proposed partial image encryption is a raw data
encryption method where bits of some bit-planes are shuffled among other bit-planes based
on chaotic maps proposed by Peter De Jong. By using the chaotic behavior of the Peter De
Jong map the position of all the bit-planes are permuted. The result of the several
experimental, correlation analysis and sensitivity test shows that the proposed image
encryption scheme provides an efficient and secure way for real-time image encryption and
decryption.
This paper presents a survey of Dependency Analysis of Service Oriented
Architecture (SOA) based systems. SOA presents newer aspects of dependency analysis due
to its different architectural style and programming paradigm. This paper surveys the
previous work taken on dependency analysis of service oriented systems. This study shows
the strengths and weaknesses of current approaches and tools available for dependency
analysis task in context of SOA. The main motivation of this work is to summarize the
recent approaches in this field of research, identify major issue and challenges in
dependency analysis of SOA based systems and motivate further research on this topic.
In this paper, proposed a novel implementation of a Soft-Core system using
micro-blaze processor with virtex-5 FPGA. Till now Hard-Core processors are used in
FPGA processor cores. Hard cores are a fixed gate-level IP functions within the FPGA
fabrics. Now the proposed processor is Soft-Core Processor, this is a microprocessor fully
described in software, usually in an HDL. This can be implemented by using EDK tool. In
this paper, developed a system which is having a micro-blaze processor is the combination
of both hardware & Software. By using this system, user can control and communicate all
the peripherals which are in the supported board by using Xilinx platform to develop an
embedded system. Implementing of Soft-Core process system with different peripherals like
UART interface, SPA flash interface, SRAM interface has to be designed using Xilinx
Embedded Development Kit (EDK) tools.
The article presents a simple algorithm to construct minimum spanning tree and
to find shortest path between pair of vertices in a graph. Our illustration includes the proof
of termination. The complexity analysis and simulation results have also been included.
Wimax technology has reshaped the framework of broadband wireless internet
service. It provides the internet service to unconnected or detached areas such as east South
Africa, rural areas of America and Asia region. Full duplex helpers employed with one of
the relay stations selection and indexing method that is Randomized Distributed Space Time
are used to expand the coverage area of primary Wimax station. The basic problem was
identified at cell edge due to weather conditions (rain, fog), insertion of destruction because
of multiple paths in the same communication channel and due to interference created by
other users in that communication. It is impractical task for the receiver station to decode
the transmitted signal successfully at the cell edges, which increases the high packet loss and
retransmissions. But Wimax is a outstanding technology which is used for improving the
quality of internet service and also it offers various services like Voice over Internet
Protocol, Video conferencing and Multimedia broadcast etc where a little delay in packet
transmission can cause a big loss in the communication. Even setup and initialization of
another Wimax station nearer to each other is not a good alternate, where any mobile
station can easily handover to another base station if it gets a strong signal from other one.
But in rural areas, for few numbers of customers, installation of base station nearer to each
other is costlier task. In this review article, we present a scheme using R-DSTC technique to
choose and select helpers (relay nodes) randomly to expand the coverage area and help to
mobile station as a helper to provide secure communication with base station. In this work,
we use full duplex helpers for better utilization of bandwidth.
Radio Frequency identification (RFID) technology has become emerging
technique for tracking and items identification. Depend upon the function; various RFID
technologies could be used. Drawback of passive RFID technology, associated to the range
of reading tags and assurance in difficult environmental condition, puts boundaries on
performance in the real life situation [1]. To improve the range of reading tags and
assurance, we consider implementing active backscattering tag technology. For making
mobiles of multiple radio standards in 4G network; the Software Defined Radio (SDR)
technology is used. Restrictions in Existing RFID technologies and SDR technology, can be
eliminated by the development and implementation of the Software Defined Radio (SDR)
active backscattering tag compatible with the EPC global UHF Class 1 Generation 2 (Gen2)
RFID standard. Such technology can be used for many of applications and services.
Main Java[All of the Base Concepts}.docxadhitya5119
This is part 1 of my Java Learning Journey. This Contains Custom methods, classes, constructors, packages, multithreading , try- catch block, finally block and more.
Chapter wise All Notes of First year Basic Civil Engineering.pptxDenish Jangid
Chapter wise All Notes of First year Basic Civil Engineering
Syllabus
Chapter-1
Introduction to objective, scope and outcome the subject
Chapter 2
Introduction: Scope and Specialization of Civil Engineering, Role of civil Engineer in Society, Impact of infrastructural development on economy of country.
Chapter 3
Surveying: Object Principles & Types of Surveying; Site Plans, Plans & Maps; Scales & Unit of different Measurements.
Linear Measurements: Instruments used. Linear Measurement by Tape, Ranging out Survey Lines and overcoming Obstructions; Measurements on sloping ground; Tape corrections, conventional symbols. Angular Measurements: Instruments used; Introduction to Compass Surveying, Bearings and Longitude & Latitude of a Line, Introduction to total station.
Levelling: Instrument used Object of levelling, Methods of levelling in brief, and Contour maps.
Chapter 4
Buildings: Selection of site for Buildings, Layout of Building Plan, Types of buildings, Plinth area, carpet area, floor space index, Introduction to building byelaws, concept of sun light & ventilation. Components of Buildings & their functions, Basic concept of R.C.C., Introduction to types of foundation
Chapter 5
Transportation: Introduction to Transportation Engineering; Traffic and Road Safety: Types and Characteristics of Various Modes of Transportation; Various Road Traffic Signs, Causes of Accidents and Road Safety Measures.
Chapter 6
Environmental Engineering: Environmental Pollution, Environmental Acts and Regulations, Functional Concepts of Ecology, Basics of Species, Biodiversity, Ecosystem, Hydrological Cycle; Chemical Cycles: Carbon, Nitrogen & Phosphorus; Energy Flow in Ecosystems.
Water Pollution: Water Quality standards, Introduction to Treatment & Disposal of Waste Water. Reuse and Saving of Water, Rain Water Harvesting. Solid Waste Management: Classification of Solid Waste, Collection, Transportation and Disposal of Solid. Recycling of Solid Waste: Energy Recovery, Sanitary Landfill, On-Site Sanitation. Air & Noise Pollution: Primary and Secondary air pollutants, Harmful effects of Air Pollution, Control of Air Pollution. . Noise Pollution Harmful Effects of noise pollution, control of noise pollution, Global warming & Climate Change, Ozone depletion, Greenhouse effect
Text Books:
1. Palancharmy, Basic Civil Engineering, McGraw Hill publishers.
2. Satheesh Gopi, Basic Civil Engineering, Pearson Publishers.
3. Ketki Rangwala Dalal, Essentials of Civil Engineering, Charotar Publishing House.
4. BCP, Surveying volume 1
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPRAHUL
This Dissertation explores the particular circumstances of Mirzapur, a region located in the
core of India. Mirzapur, with its varied terrains and abundant biodiversity, offers an optimal
environment for investigating the changes in vegetation cover dynamics. Our study utilizes
advanced technologies such as GIS (Geographic Information Systems) and Remote sensing to
analyze the transformations that have taken place over the course of a decade.
The complex relationship between human activities and the environment has been the focus
of extensive research and worry. As the global community grapples with swift urbanization,
population expansion, and economic progress, the effects on natural ecosystems are becoming
more evident. A crucial element of this impact is the alteration of vegetation cover, which plays a
significant role in maintaining the ecological equilibrium of our planet.Land serves as the foundation for all human activities and provides the necessary materials for
these activities. As the most crucial natural resource, its utilization by humans results in different
'Land uses,' which are determined by both human activities and the physical characteristics of the
land.
The utilization of land is impacted by human needs and environmental factors. In countries
like India, rapid population growth and the emphasis on extensive resource exploitation can lead
to significant land degradation, adversely affecting the region's land cover.
Therefore, human intervention has significantly influenced land use patterns over many
centuries, evolving its structure over time and space. In the present era, these changes have
accelerated due to factors such as agriculture and urbanization. Information regarding land use and
cover is essential for various planning and management tasks related to the Earth's surface,
providing crucial environmental data for scientific, resource management, policy purposes, and
diverse human activities.
Accurate understanding of land use and cover is imperative for the development planning
of any area. Consequently, a wide range of professionals, including earth system scientists, land
and water managers, and urban planners, are interested in obtaining data on land use and cover
changes, conversion trends, and other related patterns. The spatial dimensions of land use and
cover support policymakers and scientists in making well-informed decisions, as alterations in
these patterns indicate shifts in economic and social conditions. Monitoring such changes with the
help of Advanced technologies like Remote Sensing and Geographic Information Systems is
crucial for coordinated efforts across different administrative levels. Advanced technologies like
Remote Sensing and Geographic Information Systems
9
Changes in vegetation cover refer to variations in the distribution, composition, and overall
structure of plant communities across different temporal and spatial scales. These changes can
occur natural.
How to Setup Warehouse & Location in Odoo 17 InventoryCeline George
In this slide, we'll explore how to set up warehouses and locations in Odoo 17 Inventory. This will help us manage our stock effectively, track inventory levels, and streamline warehouse operations.
it describes the bony anatomy including the femoral head , acetabulum, labrum . also discusses the capsule , ligaments . muscle that act on the hip joint and the range of motion are outlined. factors affecting hip joint stability and weight transmission through the joint are summarized.
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
How to Manage Your Lost Opportunities in Odoo 17 CRMCeline George
Odoo 17 CRM allows us to track why we lose sales opportunities with "Lost Reasons." This helps analyze our sales process and identify areas for improvement. Here's how to configure lost reasons in Odoo 17 CRM
A workshop hosted by the South African Journal of Science aimed at postgraduate students and early career researchers with little or no experience in writing and publishing journal articles.
How to Make a Field Mandatory in Odoo 17Celine George
In Odoo, making a field required can be done through both Python code and XML views. When you set the required attribute to True in Python code, it makes the field required across all views where it's used. Conversely, when you set the required attribute in XML views, it makes the field required only in the context of that particular view.
This slide is special for master students (MIBS & MIFB) in UUM. Also useful for readers who are interested in the topic of contemporary Islamic banking.
2. 467
text to a spoken waveform.Telugu TTS in mainly used for illiterate and it serves as an aid to visually
impaired and Language Education.It can also be used in some other applications like talking books and toys,
Games,Telecommunication and multimedia etc.,
Synthesized speech can be produced by different methods.These are classified into three groups [2].
Articulatory Synthesis,which attempts to model the human speech production system directly through
articulators like tongue jaw etc..Formant Synthesis,which is done by exciting a set of resonators by voicing
sources or noise generator to achieve the desired speech spectrum.Concatenative Synthesis,which uses
different pre-recorded samples derived from natural speech.Most of the synthesis systems use formant and
concatenative methods. The articulatory method is too difficult for high quality implementations [3],but may
arise as a potential method in future. In this work Telugu text to speech system has been implemented using
concatenative synthesis for natural sounding telugu speech.
II. CONCATENATIVE SYNTHESIS
Naturalness of synthetic speech produced by state-of-the art speech synthesis systems is mainly attributed to
the use of concatenative speech synthesis that uses phonemes, diphones, syllables, words or sentences as
basic speech units.Text is synthesized by selecting appropriate units from a speech database and
concatenating them.The concatenation of segments of recorded speech is known as Concatenative
synthesis.Connecting pre-recorded natural utterances is the easiest way to produce intelligible and natural
sounding speech.
Concatenative synthesis is classified into three main sub-types.
A.Unit selection synthesis
In unit selection synthesis large databases of recorded speech are used.
B. Domain-specific synthesis
Domain specific synthesis concatenates the pre-recorded words and phrases to create complete utterances.It
is used in applications like transit schedule announcements or weather reports, railway stations where the
most of the text remains same and the output is limited to a specific domain.
C. Diphone synthesis
Diphone synthesis considers only diphones occurring in a language and maintains a minimal speech
database.In diphone synthesis, only one example of each diphone is contained in the database.The quality of
the resulting speech is high and natural [4]. In the present work, diphone synthesis has been adopted to
develop telugu TTS.
III. FRAME WORK OF TELUGU TEXT TO SPEECH SYSTEM
Telugu language is now one of the 5 classical languages of India.Telugu language ranks third by the number
of native speakers in India.The block diagram of Telugu Text To Speech (TTS) system is shown in Fig.1.The
explaination of each block is as follows.
A. Telugu Text Input
Telugu text to speech system accepts input as Telugu Unicode text [5](in UTF-8 encoding) and speaks out
the text.
B. Text Analysis
Text analysis is nothing but text normalization [6].This converts raw text into the equivalent of written-out
words & isolates the words present in the text. Text normalization then searches for numbers, times, dates,
and other symbolic representations.These are analysed and converted into words. Text analysis includes
tokenization, token identification and token to word conversion.
1. Tokenization: In this process,it converts the string of characters into a list of tokens.This means that the
original text is separated according to the whitespace in between them.
2. Token Identification: Identification of general types of tokens of digits as years, dates, numbers etc.
3. Token to word mapping: This module provides the rules to map the tokens in an utterance to Telugu
words.The database contains some default variable telugu dotted abbreviation list.
Examples :(" " " క "), ("ఉ" "ఉదయం"), ("1/4" " "), ("%" " తం"), (“2” “ ం “).
3. 468
Figure. 1: Block Diagram of Telugu Text To Speech (TTS) System
The text pre-processing flow is explained with example “2 я ” and is shown in Fig.2.
Figure. 2: Flow diagram of Text Pre-Processing
C. Pronunciation Generation
The Pronunciation generation module generates the sequence of basic units using a lexicon of units and
letter-to-sound rules.
1. Lexicon: It is a subsystem that provides pronunciations for words.It is a list of all speech units like
monosyllables, bi syllables and tri syllables.Lexical entries consist of three basic elements.They are a head
word, a part of speech and a pronunciation.This entry has internal format, identifying syllable structure, stress
markings and phones Some Examples of lexical entries are shown in the below Table.I.
TABLE.I. EXAMPLES OF LEXICAL ENTRIES
Head word Parts of speech Pronounciation
walkers n-noun ((( w o o ) 1) (( k @ z ) 0))
monument n-noun ((( m o )1) (( n y u ) 0) (( m @ n t )0)).
present v-verb ((( p r e ) 0) (( z @ n t ) 1)) )
2. Letter to Sound Rules:It is practically impossible to assign pronunciation and list all words in a
lexicon.The basic letter to sound rule is very simple but powerful enough to build reasonably complex letter
to sound.The basic form of a rule is as follows [6] :
(LEFT CONTEXT [ITEMS ] RIGHT CONTEXT = NEWITEMS )
( # [ c h ] C = k ) ; # - a word boundary, C - the set of all consonants.
Eg: 1.christmas – #[ch]r =k , 2.champion - #[ch]a=ch.
In these examples ch followed by a consonant is pronounced as ‘K’ and ch followed a vowel is pronounced
as ‘Cha’.
D. Prosodic Phrasing
In natural speech, humans tend to group words together with noticeable breaks or disjunctions between
them.These groups can be identified as prosodic phrases[7].Prosodic phrasing plays an important role in
4. 469
structuring utterances by dividing them into meaningful chunks of information.Text-to-Speech systems
should be able to identify these prosodic phrases to produce intelligible and natural sounding speech. In
highly inflective languages like Telugu, most words in running texts occur in inflected forms.In an effort to
identify linguistically meaningful features that affect prosodic phrasing, a new feature, namely morpheme
tag, is defined for telugu language.
Morpheme is a meaningful linguistic unit consisting of a word or word element which cannot be divided into
smaller meaningful parts.A set of 19 ‘morpheme tags’ are identified that occur at word boundaries (word
endings) are shown in the Table II.
TABLE II: LIST OF MORPHEME TAGS IN TELUGU
Telugu Morpheme Name Example word
IO DhEsamlO
ThO PattudhalathO
Aru AnnAru
Ndhi Cheppindhi
Ani ChEyAlani
Lu VisEshAlu
Nni PrabhuthvAnni
Nna ChErukunna
Oni RAshtramlOni
Chi nu.nchi
Na Jarigina
Ki AdhupulOki
Ini PurOgathini
Ga Sandharbh.ngA
Ku PrAnthAlaku
Nu LakshyAlanu
Pai Charyapai
La Charyala
.n Prabhuthv.n
E. Segmental Duration Generation
TTS systems need to generate speech units with appropriate durations in order to produce natural sounding
synthetic speech.Duration value for each segment of speech is predefined and it can be changed according to
the application [8]. The classification and regression tree (CART) based duration models [9] are used for
segmental prediction for Telugu. The CART method is used to build the decision tree such that the branches
correspond to questions that minimize the impurity of the sub-clusters.
F. Database
Database defines a telugu diphone set by considering phone features like whether it is vowel or consonant
,vowel length, vowel height , vowel frontnes ,lip rounding ,consonant type ,place of articulation[9].
G. Waveform generation
The waveform generation component takes as input the phonetic and prosodic information generated by the
various components described above, and generates the speech output through speakers.
IV. IMPLEMENTATION
We have developed the Telugu text to speech synthesizer on a Mobile device.The mobile device is a beagle
board which consists of OMAP3530 processor with mobile operating system Angstrom ported with the
programmable environment supporting component implemented in C++ language.The flow chart of telugu
text to speech system on mobile device is shown in the Fig.3.
A. Components of Telugu TTS
1.Mobile Based Device: The Mobile based device is a Beagle board, an OMAP3530 platform designed
specifically to address the Open Source Community.Use of the OMAP3530 DCBB72 device which is the
5. 470
Figure. 3: Flow chart of Telugu TTS
720MHZ version of the OMAP3530.There are many features on this board which are useful for Open
Embedded Developers.However, this project uses only few of the features.It has been equipped with a
minimum set of features to allow the user to experience the power of the OMAP3530[10].By utilizing
standard interfaces,the Beagle Board is highly extensible to add many features and interfaces.
The high level block diagram consist of OMAP3530[10] processor with SVideo, Touch Screen, Stereo In &
Out, USB Host, SD MMC, JTAG, LCD, Expansion pins, Reset & User buttons. Beagle board high level
diagram is shown in the Figure 4.
2. Software
• Linux on the Beagle Board
• Angstrom(mobile operating system which is Linux distribution)
Figure 4 : Beagle board High Level Block Diagram
3. Porting Angstrom OS: Make two partitions on the SD/MMC card into FAT partition (MLO, u-boot,
uImage) and Ext2 partition.
The five (5) boot phases are
• ROM loads x-load (MLO)
• X-load loads u-boot
• U-boot reads commands
• Commands load kernel(uImage)
• Kernel reads root file system.
6. 471
V. RESULTS AND DISCUSSIONS
The results have been depicted that the Telugu text to speech system is capable of real time operation and is
successfully developed on Mobile based device beagle board, OMAP3530.The telugu text in converted to
telugu speech is analysed by various stages.The Telugu TTS system flow is shown in the below Fig.5.
Figure.5: Telugu TTS system flow
To get an English speech,SayText command should be given with the text inserted under inverted colons.The
terminal of the beagle board uttering the telugu speech as output is connected to speakers.To get a telugu
speech (voice_telugu_NSK_diphone) command is given.When this command is given it calls all the telugu
diphones within the database.The input is the telugu text which have been saved in vnrtelugu.txt file and the
path has been given in the command.The output speech uttered is natural sounding and clear telugu speech.
VI. CONCLUSION
The full process of converting telugu text to speech is analyzed and various methods used for storing sound
and generating voice is studied.It also provides the facility to save the speech file of the input text and can
also play any of the previously saved audio file.Various intermediate stages namely, text normalization,
prosodic phrasing, pronunciation generation and generation segment durations for converting telugu text to
speech is analyzed.It follows the method of diphone concatenation and has a male voice database with
diphone as the storage unit.With a natural and clear sounding telugu speech telugu text to speech system have
been successfully developed on Mobile device beagle board OMAP3530 which will be useful as assistive
tool for visually impaired, illiterate and can be used in many other applications.
Developing text to speech systems for other Indian languages by adding prosody and handling multilingual
text Eg :“www.eenadupratibha.net,www.bscacademy.com వం ౖ ఉ తం ఆ ౖ ” is our
future work.A Web based application can also be designed which can convert text in any Indian languages
into speech.
ACKNOWLEDGEMENTS
The authors acknowledge with thanks to Dr. C.D. Naidu, Principal and Management of VNR VJIET for
their constant technical and financial support and encouragement. The research work of developing TTS for
Telugu language is part of ITRA.
7. 472
REFERENCES
[1] Yegnanarayana B.Yegnanarayana, S. Rajendran, V.R. Ramachandran, and A.S. Mad- hukumar. “Significance of
knowledge sources for a text-to-speech system for Indian languages”, Sad- hana, pages 147–169, 1994.
[2] Lemmetty.S.”Review of Speech Synthesis Technology”, Master’s thesis, Helsinki University of
Technology. March 30, 1999.
[3] A.Chauhan, V.Singh, S. P.Tomar, A. K.Chauhan.” A Text to Speech System for Hindi using English Language”, in
International Journal of Computer Science and Technology, Vol. 2 ,Issue 3,2011.
[4] G.V.Mantena,S.Rajendran,S.V.Gangashetty,B.Yegnanarayana and K.Prahallad,"Development of a spoken dialogue
system for accessing agricultural information in Telugu language", in preceding of International Conference on
Natural Language Processing(ICON), Kharagpur, India, 2011.
[5] UTF-8 encoding table and Unicode characters from website address http://www.utf8-chartable.de/unicode-utf8-
table.pl?start=3072&number=128.
[6] Black, A.Taylor,P;Caley.R.”The Festival Speech Synthesis System:system documentation, for festival version
1.4.1”,CSTR webpage,University of Edinburgh,2001.
[7] Black and Lenzo,Alan W. Black and Kevin Lenzo.”Optimal data selection for unit selection synthesis”, In ISCA,
4th Speech Synthesis Workshop, 2001.
[8] S.R. Rajeshkumar. “Significance of Durational Knowledge for a Text-to-Speech System in an Indian Language”,MS
dissertation, Indian Institute of Technology, Department of Computer Science and Engg., Madras, 1990.
[9] Black and Taylor, Alan W.Black and P.Taylor,”Automatically clustering similar units for unit selection in
[10] speech synthesis”, In Proceedings of EUROSPEECH’ 97, pages 601–604, 1997.
[11] Instruction manual provided by the TI Vendor -OMAP3530 Applications Processor by Texas Instruments in 2010.
AUTHORS
Dr.Y.Padma Sai obtained her B.Tech from Nagarjuna University,Guntur, M.E in Systems and
Signal Processing and Ph.D in Electronics and Communication Engineering, from Osmania
University,Hyderabad.She Started carrier as Quality Control Engineer and served for 5 years in
M/S. Suchitra Electronics Pvt. Ltd,Hyderabad.Later joined as Lecturer in the Department of ECE in
Deccan College of Engineering and Tech, Hyderabad served for one year.She then started working
in the Department of ECE VNRVJIET on July 1999 and held various positions.Presently.She is the
Head of the Department.She has presented 23 research papers in National and International
Conferences/Journals.Her areas of research interest are Bio-Medical, Signal and Image Processing.
She has received grants from AICTE, DIT and DST to carry out research activities in the
department.She is a member of IEEE, ISTE, ISOI and Fellow of IETE.She is executive member of
ISTE A.P Section.Her main objective is to impart quality education and learn New technologies and the scope is to fill
gap between industry and academics.
Safia Shaik received the B.E degree in electronics and communication engineering from Deccan
College of Engineering & Technology, affiliated Osmania University Hyderabad, AP, India, in
2011She is pursuing the M.Tech in Embedded systems at VNR Vignana Jyothi Institute of
Engineering & Technology, Bachupally, Hyderabad, India. Her research interests include Signal
Processing and Embedded Systems.
V. Priyanka Brahmaiah obtained her B.Tech. Degree from JNT University, Hyderabad in
2007, and M.Tech in VLSI System design from JNT University, Hyderabad in 2010.She has
started her career as Assistant professor and served for 2 years in MLR Institute of Technology
& Management, Dundigal, Hyderabad from June 2010 to June 2012. Assistant Professor in the
department of ECE in Gokaraju Rangaraju Institute of Engineering & Technology from July
2012 to November 2012. Assistant Professor in the department of ECE in VNR Vignana Jyothi
Institute of Engineering Technology from December 2012 to till date.She is a Life member of
ISTE and IETE. She presented four research papers in International Journals. Her areas of
research interest are Bio-Medical, Signal and Image Processing, Human Computer Interface.