SlideShare a Scribd company logo
1 of 3
Download to read offline
Mrs. Chhaya S. Patil. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 7, Issue 3, ( Part -5) March 2017, pp.34-36
www.ijera.com DOI: 10.9790/9622- 0703053436 34 | P a g e
A Review on Marathi Language Speech Database Development
for Automatic Speech Recognition (ASR) System.
Mrs. Chhaya S. Patil1
, Prof.Dr.Vaishali B.Patil2
,
1
Assistant Professor, R.C.P.E.T.’s Institute of Management Research & Development, Shirpur.
2
Associate Professor, R.C.P.E.T.’s Institute of Management Research & Development, Shirpur.
ABSTRACT
Speech is a natural means of communication between humans. Human being tried to develop computer that can
understand & talk like human. Digital content can research to the masses & facilitate the exchange of
information across peoples speaking different language in form of natural interface provided by language
technologies. Developing certain recognition system standard, speech database is a prerequisite. There is lot of
scope to develop automatic speech recognition (ASR) system using Indian languages which are of different
variations. This paper present review on speech database developed for Marathi language.
Keywords: Automatic speech Recognition, corpus, speech database, Isolated Words, Marathi
I. INTROUDUCTION
Information in digital world is accessible
to a few who can read or understand a particular
language. India has about 1652 dialects/ native
languages, in such multi-lingual society language
technologies play a crucial role. Speech
recognition, machine translation & speech
synthesis system could facilitate the exchange of
information between two people speaking two
different languages [1] The paper gives the review
on speech database developed in Marathi language.
This review would be helpful for researches that
are willing working in domain of automatic speech
recognition (ASR) system & speech processing in
Marathi language & various dialects of it. The
paper is organized as follows: Section II describe
about Marathi language Section III describe the
various speech database developed in Marathi
language section IV gives conclusion & future
work followed by references.
II. MARATHI LANGUAGE
Indian constitution recognizes 23
languages Marathi is one of them. Marathi is
written in Devanagri script. These languages have
been derived from Sanskrit which is also written in
Devanagri script. Standard Marathi is as the official
language in 35 different districts of Maharashtra
state. Standard Marathi & Warahadi Marathi is
major dialect of Marathi. Academics & print media
use Standard Marathi dialects. 42 characters of
spoken Marathi were distinguished by the Indic
scholars [5]
III. MARATHI SPEECH DATABASE
IIIT - Hyderabad with co-ordination of HP
lab Bangalore developed speech database for
various Indian languages including Marathi. This
database developed for large vocabulary speech
Recognition systems. CIIL corpus of Marathi
language was used for text corpus collection [12].
The speech data was recorded over calculated
number of landline & cellular phones using a
multi-channel computer telephony interface card
.Four different cell phones were use to capture
different micro phonic variations. Following table 1
shows number of speakers were selected.
Table 1: Number of speaker
Language Landline Cell
phone
Total
Marathi 92 84 176
Table 2 & 3 Shows age wise & gender wise distribution of speaker
Table 3: Age wise Speaker distribution
Language 18-30 30-40 40-50 50-60 >60
Marathi 77 56 26 12 5
Language Male Female
Marathi 91 85
RESEARCH ARTICLE OPEN ACCESS
Mrs. Chhaya S. Patil. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 7, Issue 3, ( Part -5) March 2017, pp.34-36
www.ijera.com DOI: 10.9790/9622- 0703053436 35 | P a g e
52 sentences of the optimal text was recorded by
each speaker .The transcriptions were manually
edited collected data & ranked base on the
goodness of the speech recorded “ Good “ ,”with
channel distorting”,” with background noise” &
“useless” these were categories of utterances [1].
IIIT-Hyderabad also developed speech database for
major Indian languages, Marathi is one of them [6].
The purpose of developing the IIT-H Indic speech
databases was to have speech & text corpora made
available in public domain, without copyright
restrictions for non- commercial & commercial use
[6]. Text corpora were selected from Wikipedia
dump of Indian languages released in 2008. Native
speaker were selected for recording purpose .The
age of speakers were 20-30 years. 1000
phonetically balanced sentences were selected form
text corpus. A single wave file has 50 utterances
[6]
TIFR (Mumbai) & IIT Bombay together
developed a speech database for agriculture
purpose. This Project sanctioned by the technology
development for Indian languages (TDIL) for the
development of speech recognition system for
using cell phones & landline. TIFR Mumbai & IIT
Bombay collected the speech data using two
dedicated phone line. Two volunteers were
appointed for the development of database.
Volunteers visited the various districts of
Maharashtra & speech sample collected the by
calling thru the dedicated phone line. Speech
sample were collected from 1600 speakers
approximately. As recording was done using phone
line having narrow band & speech along with
background noise, to overcome this difficulty
Volunteers also used digital voice recorder. [2]
IIT-Khargpur also developed speech
database for text independent speaker identification
in ASR. Recording frequency was 22,500 Hz,
mono channel, 16 bit resolution & 10 utterances.
The recording was done in various environment
conditions like road home, office, slums, college,
train, hills, valleys, remote villages, research labs
&farms. The total numbers of speakers were 60
[13]. A noticeable works also done at Dr. B.A.M
university Aurangabad by department of computer
science & IT the speech database developed for
limited vocabulary continuous speech database &
two different isolated speech database for
agricultures & travel purpose of Aurangabad &
speech database of numbers in Marathi language.
Following table shows summarized information
about these speech databases.
Table 4: Gender wise Speaker distribution
The Isolated words & Sentences were
selected from various blogs, news articles, forums
& websites over the internet. The speech sample
was collected from four districts i.e. Aurangabad
Jalana, Beed & Osmanabad of Marathwada region
[5]. PRAAT software was used for recording
speech & speech sample captured using Sennheiser
headsets.
The speech database for Marathi isolated word
recognition was developed at Dr. B.A.M.
University supported by DST under fast track
scheme entitle is “Design & development of
Marathi speech interface system”.
The vocabulary size of the database consist of
1) Marathi Vowels : 105 samples
2) Isolated words starting with each vowel : 420
samples
3) Sentences : 175 sample
The total no of speaker was 35 out of
which 17 were Females & 18 were males having
age from 22 to 35 years. High quality of audio
recording achieved by noise free & echo free room
having size 10x10. The sampling frequency was
11025 Hz & the recording was done using
Computerized speech laboratory (CSL) using the
single Channel [4] Isolated Marathi words
Mrs. Chhaya S. Patil. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 7, Issue 3, ( Part -5) March 2017, pp.34-36
www.ijera.com DOI: 10.9790/9622- 0703053436 36 | P a g e
emotional speech database developed for robust
automatic speech recognition (ASR) system & for
robotics. It also developed at Dr B.A.M University
supported by UGC as Major research project. Basic
& most commonly observed emotional like happy,
sad & angry were selected. Speech sample
collected from 25 male & 25 female speakers
having age group of 21 to 40 .The total no of words
were 24 with 3 utterances each. The total size of
utterance was 3600. In normal environment data
was recorded .PRAAT is used for recording &
speech in captured using Sennheiser headset. The
sampling rate was 1600 Hz & five is in 16 bit more
audio format [11].
Speech database of medicinal plants
names was developed by Dr. B.A.M University
supported by UGC for ASR of isolated Marathi
words .This database helpful in agriculture field as
well to medical student. Number of words was 100,
category of words were Medicinal plants which
growth in India, number of speaker were 100,
speakers age between 20 to 50 & Speaker from
Aurangabad region. Numbers of Utterances were 3,
total utterances were 30,000. Recording software
was PRAAT & Sampling frequency rate was16
kHz. File format was .wav .Recording done using
Sennheiser Headset [9]
IV. CONCLUSION
In this paper we have discussed some of
the speech databases developed in Marathi
language for various applications. After studying
the developed speech database we have been
motivated to developed isolated words speech
database for agriculture purpose in Ahirani dialect.
REFERENCES
[1]. Rohit kumar, S.P. Kishore, Anuman
chipalli Gopalkrishna, Rahul chitturi,
sachin joshi, satinder singh, R.N.V
sitaram, Development of Indian language
speech database for large vocabulary
speech recognition system proceedings of
International conference on speech &
computer (SPECOM) , Patras, Grees
oct 2005.
[2]. Pukhraj P. Shrishrimal, Ratnadep R.
Deshmukh, Vishal B.Waghmare “ Indian
Language speech database : Review “
Internal journal of computer application
(0925-888), volume 47-No. 5 june 2012
[3]. Yogesh K. Gedam, Paras V.Kothare,
Ratnadeep R. Deshmukh “Design &
development Speech database of Marathi
Numerals “Innternational journal of
Adanced Research In Maerch 2013
.Computer science s/w Engineering
volume 4 Issue-3
[4]. Bharati W. Gawali, santosh Gaikwad ,
Pravin annawar, suresh c, Mehrotra “
Marathi Isolated word recognition system
using MFCC & DTW Features “Proc of
Inc conf. on advances in computer science
2010
[5]. Pukhraj P shrishrimal , Ratnadeep R
Deshmukh Ganesh B. Janvave,
“Development of Marathi” Language
speech database from Marathwada
Region”
[6]. Kishore Prahllad, E naresh kumar,
Venketesh keri, S Rajendran, Alan W
Black “ The IIT-H indic speech
databases”.
[7]. Pooja v janse al, Design & development of
speech database for Travel purpose in
Marathi International journal of computer
science & mobile computing, vol3 issue
7,May 2014, pg.872-875.
[8]. [8] V.B.Waghmare, R.R Deshmukh , P.P
Shrishmal G.B. Janvale, “Development of
Isolated Marathi international journal of
computer applications. (0975-8887)
volume 94 no. 4 May 2014
[9]. Kishori R Ghule Ratnadeep R. Deshmukh,
“Automatic speech reconition of Marathi
isolated journal of computer science &
Information Technology vol.6 65), 2015
4296-4298.
[10]. P.P. Shrishrimal ,Ratnadep R. Deshmukh
,vishal B Loaghmane, “Marathi Isolated
words speech database for Agriculture
purpose”, International Journal of
Engineering Innovation & Research
volume-3 Issue 3 ,ISSN : 2277-5668
[11]. V.B Waghmare, R.R.Deshmukh, P.P.
Shrishrimal, G.B. Janvale “Development
of Isolated Marathi words Emotional
speech database”,Intemational journal of
Computer Application (0975-8887)
volume 94-No.4,May 2014
[12]. “http://tdil.mit.gov.in/corpora/ach-
corpora.htm#tech” Marathi CIIL corpus
[13]. https://www.ldc.upenn.edu/sites/www.ldc.
upenn.edu/files/agrawal2008.pdf

More Related Content

What's hot

ISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITION
ISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITIONISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITION
ISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITIONijnlc
 
NAMED ENTITY RECOGNITION FROM BENGALI NEWSPAPER DATA
NAMED ENTITY RECOGNITION FROM BENGALI NEWSPAPER DATANAMED ENTITY RECOGNITION FROM BENGALI NEWSPAPER DATA
NAMED ENTITY RECOGNITION FROM BENGALI NEWSPAPER DATAijnlc
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...kevig
 
Summer Research Project (Anusaaraka) Report
Summer Research Project (Anusaaraka) ReportSummer Research Project (Anusaaraka) Report
Summer Research Project (Anusaaraka) ReportAnwar Jameel
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...ijnlc
 
Driving cycle development for Kuala Terengganu city using k-means method
Driving cycle development for Kuala Terengganu city using k-means methodDriving cycle development for Kuala Terengganu city using k-means method
Driving cycle development for Kuala Terengganu city using k-means methodIJECEIAES
 
An Optical Character Recognition for Handwritten Devanagari Script
An Optical Character Recognition for Handwritten Devanagari ScriptAn Optical Character Recognition for Handwritten Devanagari Script
An Optical Character Recognition for Handwritten Devanagari ScriptIJERA Editor
 
Design and Development of a Malayalam to English Translator- A Transfer Based...
Design and Development of a Malayalam to English Translator- A Transfer Based...Design and Development of a Malayalam to English Translator- A Transfer Based...
Design and Development of a Malayalam to English Translator- A Transfer Based...Waqas Tariq
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Marathi-English CLIR using detailed user query and unsupervised corpus-based WSD
Marathi-English CLIR using detailed user query and unsupervised corpus-based WSDMarathi-English CLIR using detailed user query and unsupervised corpus-based WSD
Marathi-English CLIR using detailed user query and unsupervised corpus-based WSDIJERA Editor
 
A language independent approach to develop urduir system
A language independent approach to develop urduir systemA language independent approach to develop urduir system
A language independent approach to develop urduir systemcsandit
 
A LANGUAGE INDEPENDENT APPROACH TO DEVELOP URDUIR SYSTEM
A LANGUAGE INDEPENDENT APPROACH TO DEVELOP URDUIR SYSTEMA LANGUAGE INDEPENDENT APPROACH TO DEVELOP URDUIR SYSTEM
A LANGUAGE INDEPENDENT APPROACH TO DEVELOP URDUIR SYSTEMcscpconf
 
SCRIPTS AND NUMERALS IDENTIFICATION FROM PRINTED MULTILINGUAL DOCUMENT IMAGES
SCRIPTS AND NUMERALS IDENTIFICATION FROM PRINTED MULTILINGUAL DOCUMENT IMAGESSCRIPTS AND NUMERALS IDENTIFICATION FROM PRINTED MULTILINGUAL DOCUMENT IMAGES
SCRIPTS AND NUMERALS IDENTIFICATION FROM PRINTED MULTILINGUAL DOCUMENT IMAGEScscpconf
 
A Survey of Arabic Text Classification Models
A Survey of Arabic Text Classification Models A Survey of Arabic Text Classification Models
A Survey of Arabic Text Classification Models IJECEIAES
 
Robust Text Watermarking Technique for Authorship Protection of Hindi Languag...
Robust Text Watermarking Technique for Authorship Protection of Hindi Languag...Robust Text Watermarking Technique for Authorship Protection of Hindi Languag...
Robust Text Watermarking Technique for Authorship Protection of Hindi Languag...CSCJournals
 

What's hot (18)

Cl35491494
Cl35491494Cl35491494
Cl35491494
 
ISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITION
ISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITIONISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITION
ISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITION
 
NAMED ENTITY RECOGNITION FROM BENGALI NEWSPAPER DATA
NAMED ENTITY RECOGNITION FROM BENGALI NEWSPAPER DATANAMED ENTITY RECOGNITION FROM BENGALI NEWSPAPER DATA
NAMED ENTITY RECOGNITION FROM BENGALI NEWSPAPER DATA
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 
A SURVEY ON VARIOUS CLIR TECHNIQUES
A SURVEY ON VARIOUS CLIR TECHNIQUESA SURVEY ON VARIOUS CLIR TECHNIQUES
A SURVEY ON VARIOUS CLIR TECHNIQUES
 
Summer Research Project (Anusaaraka) Report
Summer Research Project (Anusaaraka) ReportSummer Research Project (Anusaaraka) Report
Summer Research Project (Anusaaraka) Report
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 
Driving cycle development for Kuala Terengganu city using k-means method
Driving cycle development for Kuala Terengganu city using k-means methodDriving cycle development for Kuala Terengganu city using k-means method
Driving cycle development for Kuala Terengganu city using k-means method
 
An Optical Character Recognition for Handwritten Devanagari Script
An Optical Character Recognition for Handwritten Devanagari ScriptAn Optical Character Recognition for Handwritten Devanagari Script
An Optical Character Recognition for Handwritten Devanagari Script
 
Design and Development of a Malayalam to English Translator- A Transfer Based...
Design and Development of a Malayalam to English Translator- A Transfer Based...Design and Development of a Malayalam to English Translator- A Transfer Based...
Design and Development of a Malayalam to English Translator- A Transfer Based...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Marathi-English CLIR using detailed user query and unsupervised corpus-based WSD
Marathi-English CLIR using detailed user query and unsupervised corpus-based WSDMarathi-English CLIR using detailed user query and unsupervised corpus-based WSD
Marathi-English CLIR using detailed user query and unsupervised corpus-based WSD
 
A language independent approach to develop urduir system
A language independent approach to develop urduir systemA language independent approach to develop urduir system
A language independent approach to develop urduir system
 
A LANGUAGE INDEPENDENT APPROACH TO DEVELOP URDUIR SYSTEM
A LANGUAGE INDEPENDENT APPROACH TO DEVELOP URDUIR SYSTEMA LANGUAGE INDEPENDENT APPROACH TO DEVELOP URDUIR SYSTEM
A LANGUAGE INDEPENDENT APPROACH TO DEVELOP URDUIR SYSTEM
 
SCRIPTS AND NUMERALS IDENTIFICATION FROM PRINTED MULTILINGUAL DOCUMENT IMAGES
SCRIPTS AND NUMERALS IDENTIFICATION FROM PRINTED MULTILINGUAL DOCUMENT IMAGESSCRIPTS AND NUMERALS IDENTIFICATION FROM PRINTED MULTILINGUAL DOCUMENT IMAGES
SCRIPTS AND NUMERALS IDENTIFICATION FROM PRINTED MULTILINGUAL DOCUMENT IMAGES
 
G1803013542
G1803013542G1803013542
G1803013542
 
A Survey of Arabic Text Classification Models
A Survey of Arabic Text Classification Models A Survey of Arabic Text Classification Models
A Survey of Arabic Text Classification Models
 
Robust Text Watermarking Technique for Authorship Protection of Hindi Languag...
Robust Text Watermarking Technique for Authorship Protection of Hindi Languag...Robust Text Watermarking Technique for Authorship Protection of Hindi Languag...
Robust Text Watermarking Technique for Authorship Protection of Hindi Languag...
 

Viewers also liked

A Study on Atomic Spectroscopic Term Symbols for Nonequivalent Electrons of (...
A Study on Atomic Spectroscopic Term Symbols for Nonequivalent Electrons of (...A Study on Atomic Spectroscopic Term Symbols for Nonequivalent Electrons of (...
A Study on Atomic Spectroscopic Term Symbols for Nonequivalent Electrons of (...IJERA Editor
 
Navigation Tools and Equipment and How They Have Improved Aviation Safety
Navigation Tools and Equipment and How They Have Improved Aviation SafetyNavigation Tools and Equipment and How They Have Improved Aviation Safety
Navigation Tools and Equipment and How They Have Improved Aviation SafetyIJERA Editor
 
Snapping During Gas Welding
Snapping During Gas WeldingSnapping During Gas Welding
Snapping During Gas WeldingIJERA Editor
 
The Effects of Marine Simulators on Training
The Effects of Marine Simulators on TrainingThe Effects of Marine Simulators on Training
The Effects of Marine Simulators on TrainingIJERA Editor
 
Quantitative Review Techniques of Edge Detection Operators.
Quantitative Review Techniques of Edge Detection Operators.Quantitative Review Techniques of Edge Detection Operators.
Quantitative Review Techniques of Edge Detection Operators.IJERA Editor
 
Design of Secured Ground Vehicle Event Data Recorder for Data Analysis
Design of Secured Ground Vehicle Event Data Recorder for Data AnalysisDesign of Secured Ground Vehicle Event Data Recorder for Data Analysis
Design of Secured Ground Vehicle Event Data Recorder for Data AnalysisIJERA Editor
 
Performance Evaluation of Self-Excited Cage and Cageless Three Phase Synchron...
Performance Evaluation of Self-Excited Cage and Cageless Three Phase Synchron...Performance Evaluation of Self-Excited Cage and Cageless Three Phase Synchron...
Performance Evaluation of Self-Excited Cage and Cageless Three Phase Synchron...IJERA Editor
 
Improved Thermal Performance of Solar Air Heater Using V-Rib with Symmetrical...
Improved Thermal Performance of Solar Air Heater Using V-Rib with Symmetrical...Improved Thermal Performance of Solar Air Heater Using V-Rib with Symmetrical...
Improved Thermal Performance of Solar Air Heater Using V-Rib with Symmetrical...IJERA Editor
 
Data Security and Data Dissemination of Distributed Data in Wireless Sensor N...
Data Security and Data Dissemination of Distributed Data in Wireless Sensor N...Data Security and Data Dissemination of Distributed Data in Wireless Sensor N...
Data Security and Data Dissemination of Distributed Data in Wireless Sensor N...IJERA Editor
 
Seismic Study of Building with Roof Top Telecommunication Towers
Seismic Study of Building with Roof Top Telecommunication TowersSeismic Study of Building with Roof Top Telecommunication Towers
Seismic Study of Building with Roof Top Telecommunication TowersIJERA Editor
 
Microstructural Analysis of the Influence of Ecap on the Pure Nb Rolling Plane
Microstructural Analysis of the Influence of Ecap on the Pure Nb Rolling PlaneMicrostructural Analysis of the Influence of Ecap on the Pure Nb Rolling Plane
Microstructural Analysis of the Influence of Ecap on the Pure Nb Rolling PlaneIJERA Editor
 
A Study on Project Planning Using the Deterministic and Probabilistic Models ...
A Study on Project Planning Using the Deterministic and Probabilistic Models ...A Study on Project Planning Using the Deterministic and Probabilistic Models ...
A Study on Project Planning Using the Deterministic and Probabilistic Models ...IJERA Editor
 
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...IJERA Editor
 
Enhancing the Efficiency of Solar Panel Using Cooling Systems
Enhancing the Efficiency of Solar Panel Using Cooling SystemsEnhancing the Efficiency of Solar Panel Using Cooling Systems
Enhancing the Efficiency of Solar Panel Using Cooling SystemsIJERA Editor
 
MHD Mixed Convection Flow from a Vertical Plate Embedded in a Saturated Porou...
MHD Mixed Convection Flow from a Vertical Plate Embedded in a Saturated Porou...MHD Mixed Convection Flow from a Vertical Plate Embedded in a Saturated Porou...
MHD Mixed Convection Flow from a Vertical Plate Embedded in a Saturated Porou...IJERA Editor
 
Multi-Purpose Robot using Raspberry Pi & Controlled by Smartphone
Multi-Purpose Robot using Raspberry Pi & Controlled by SmartphoneMulti-Purpose Robot using Raspberry Pi & Controlled by Smartphone
Multi-Purpose Robot using Raspberry Pi & Controlled by SmartphoneIJERA Editor
 
Does a Hybrid Approach of Agile and Plan-Driven Methods Work Better for IT Sy...
Does a Hybrid Approach of Agile and Plan-Driven Methods Work Better for IT Sy...Does a Hybrid Approach of Agile and Plan-Driven Methods Work Better for IT Sy...
Does a Hybrid Approach of Agile and Plan-Driven Methods Work Better for IT Sy...IJERA Editor
 
Projet Villard: Cuisine connectée 2025
Projet Villard: Cuisine connectée 2025Projet Villard: Cuisine connectée 2025
Projet Villard: Cuisine connectée 2025Digital Mis UPEM
 
OW revue de presse drones juillet 2016
OW revue de presse drones juillet 2016OW revue de presse drones juillet 2016
OW revue de presse drones juillet 2016Guillaume Thibault
 

Viewers also liked (20)

A Study on Atomic Spectroscopic Term Symbols for Nonequivalent Electrons of (...
A Study on Atomic Spectroscopic Term Symbols for Nonequivalent Electrons of (...A Study on Atomic Spectroscopic Term Symbols for Nonequivalent Electrons of (...
A Study on Atomic Spectroscopic Term Symbols for Nonequivalent Electrons of (...
 
Navigation Tools and Equipment and How They Have Improved Aviation Safety
Navigation Tools and Equipment and How They Have Improved Aviation SafetyNavigation Tools and Equipment and How They Have Improved Aviation Safety
Navigation Tools and Equipment and How They Have Improved Aviation Safety
 
Snapping During Gas Welding
Snapping During Gas WeldingSnapping During Gas Welding
Snapping During Gas Welding
 
The Effects of Marine Simulators on Training
The Effects of Marine Simulators on TrainingThe Effects of Marine Simulators on Training
The Effects of Marine Simulators on Training
 
Quantitative Review Techniques of Edge Detection Operators.
Quantitative Review Techniques of Edge Detection Operators.Quantitative Review Techniques of Edge Detection Operators.
Quantitative Review Techniques of Edge Detection Operators.
 
Design of Secured Ground Vehicle Event Data Recorder for Data Analysis
Design of Secured Ground Vehicle Event Data Recorder for Data AnalysisDesign of Secured Ground Vehicle Event Data Recorder for Data Analysis
Design of Secured Ground Vehicle Event Data Recorder for Data Analysis
 
Performance Evaluation of Self-Excited Cage and Cageless Three Phase Synchron...
Performance Evaluation of Self-Excited Cage and Cageless Three Phase Synchron...Performance Evaluation of Self-Excited Cage and Cageless Three Phase Synchron...
Performance Evaluation of Self-Excited Cage and Cageless Three Phase Synchron...
 
Improved Thermal Performance of Solar Air Heater Using V-Rib with Symmetrical...
Improved Thermal Performance of Solar Air Heater Using V-Rib with Symmetrical...Improved Thermal Performance of Solar Air Heater Using V-Rib with Symmetrical...
Improved Thermal Performance of Solar Air Heater Using V-Rib with Symmetrical...
 
Data Security and Data Dissemination of Distributed Data in Wireless Sensor N...
Data Security and Data Dissemination of Distributed Data in Wireless Sensor N...Data Security and Data Dissemination of Distributed Data in Wireless Sensor N...
Data Security and Data Dissemination of Distributed Data in Wireless Sensor N...
 
Seismic Study of Building with Roof Top Telecommunication Towers
Seismic Study of Building with Roof Top Telecommunication TowersSeismic Study of Building with Roof Top Telecommunication Towers
Seismic Study of Building with Roof Top Telecommunication Towers
 
Microstructural Analysis of the Influence of Ecap on the Pure Nb Rolling Plane
Microstructural Analysis of the Influence of Ecap on the Pure Nb Rolling PlaneMicrostructural Analysis of the Influence of Ecap on the Pure Nb Rolling Plane
Microstructural Analysis of the Influence of Ecap on the Pure Nb Rolling Plane
 
A Study on Project Planning Using the Deterministic and Probabilistic Models ...
A Study on Project Planning Using the Deterministic and Probabilistic Models ...A Study on Project Planning Using the Deterministic and Probabilistic Models ...
A Study on Project Planning Using the Deterministic and Probabilistic Models ...
 
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...
 
Enhancing the Efficiency of Solar Panel Using Cooling Systems
Enhancing the Efficiency of Solar Panel Using Cooling SystemsEnhancing the Efficiency of Solar Panel Using Cooling Systems
Enhancing the Efficiency of Solar Panel Using Cooling Systems
 
MHD Mixed Convection Flow from a Vertical Plate Embedded in a Saturated Porou...
MHD Mixed Convection Flow from a Vertical Plate Embedded in a Saturated Porou...MHD Mixed Convection Flow from a Vertical Plate Embedded in a Saturated Porou...
MHD Mixed Convection Flow from a Vertical Plate Embedded in a Saturated Porou...
 
Multi-Purpose Robot using Raspberry Pi & Controlled by Smartphone
Multi-Purpose Robot using Raspberry Pi & Controlled by SmartphoneMulti-Purpose Robot using Raspberry Pi & Controlled by Smartphone
Multi-Purpose Robot using Raspberry Pi & Controlled by Smartphone
 
Does a Hybrid Approach of Agile and Plan-Driven Methods Work Better for IT Sy...
Does a Hybrid Approach of Agile and Plan-Driven Methods Work Better for IT Sy...Does a Hybrid Approach of Agile and Plan-Driven Methods Work Better for IT Sy...
Does a Hybrid Approach of Agile and Plan-Driven Methods Work Better for IT Sy...
 
Projet Villard: Cuisine connectée 2025
Projet Villard: Cuisine connectée 2025Projet Villard: Cuisine connectée 2025
Projet Villard: Cuisine connectée 2025
 
OW revue de presse drones juillet 2016
OW revue de presse drones juillet 2016OW revue de presse drones juillet 2016
OW revue de presse drones juillet 2016
 
5 realisation
5 realisation5 realisation
5 realisation
 

Similar to A Review on Marathi Language Speech Database Development for Automatic Speech Recognition (ASR) System.

DEVELOPMENT OF PHONEME DOMINATED DATABASE FOR LIMITED DOMAIN T-T-S IN HINDI
DEVELOPMENT OF PHONEME DOMINATED DATABASE FOR LIMITED DOMAIN T-T-S IN HINDIDEVELOPMENT OF PHONEME DOMINATED DATABASE FOR LIMITED DOMAIN T-T-S IN HINDI
DEVELOPMENT OF PHONEME DOMINATED DATABASE FOR LIMITED DOMAIN T-T-S IN HINDIijaia
 
Marathi Text-To-Speech Synthesis using Natural Language Processing
Marathi Text-To-Speech Synthesis using Natural Language ProcessingMarathi Text-To-Speech Synthesis using Natural Language Processing
Marathi Text-To-Speech Synthesis using Natural Language Processingiosrjce
 
Text To Speech Synthesis System For Marathi Language Using Concatenation Tech...
Text To Speech Synthesis System For Marathi Language Using Concatenation Tech...Text To Speech Synthesis System For Marathi Language Using Concatenation Tech...
Text To Speech Synthesis System For Marathi Language Using Concatenation Tech...University of Southern Denmark
 
Implementation of English-Text to Marathi-Speech (ETMS) Synthesizer
Implementation of English-Text to Marathi-Speech (ETMS) SynthesizerImplementation of English-Text to Marathi-Speech (ETMS) Synthesizer
Implementation of English-Text to Marathi-Speech (ETMS) SynthesizerIOSR Journals
 
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017kevig
 
Creation of speech corpus for emotion analysis in Gujarati language and its e...
Creation of speech corpus for emotion analysis in Gujarati language and its e...Creation of speech corpus for emotion analysis in Gujarati language and its e...
Creation of speech corpus for emotion analysis in Gujarati language and its e...IJECEIAES
 
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKS
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKSTUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKS
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKSIJCI JOURNAL
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogrikevig
 
Top 10 cited articles in nlp
Top 10 cited articles in nlpTop 10 cited articles in nlp
Top 10 cited articles in nlpkevig
 
Hindi –tamil text translation
Hindi –tamil text translationHindi –tamil text translation
Hindi –tamil text translationVaibhav Agarwal
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
 
Survey of machine translation systems in india
Survey of machine translation systems in indiaSurvey of machine translation systems in india
Survey of machine translation systems in indiaijnlc
 
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKS
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKSTUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKS
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKSkevig
 
Tuning Dari Speech Classification Employing Deep Neural Networks
Tuning Dari Speech Classification Employing Deep Neural NetworksTuning Dari Speech Classification Employing Deep Neural Networks
Tuning Dari Speech Classification Employing Deep Neural Networkskevig
 
September 2021: Top10 Cited Articles in Natural Language Computing
September 2021: Top10 Cited Articles in Natural Language ComputingSeptember 2021: Top10 Cited Articles in Natural Language Computing
September 2021: Top10 Cited Articles in Natural Language Computingkevig
 
February 2024 - Top 10 cited articles.pdf
February 2024 - Top 10 cited articles.pdfFebruary 2024 - Top 10 cited articles.pdf
February 2024 - Top 10 cited articles.pdfkevig
 

Similar to A Review on Marathi Language Speech Database Development for Automatic Speech Recognition (ASR) System. (20)

DEVELOPMENT OF PHONEME DOMINATED DATABASE FOR LIMITED DOMAIN T-T-S IN HINDI
DEVELOPMENT OF PHONEME DOMINATED DATABASE FOR LIMITED DOMAIN T-T-S IN HINDIDEVELOPMENT OF PHONEME DOMINATED DATABASE FOR LIMITED DOMAIN T-T-S IN HINDI
DEVELOPMENT OF PHONEME DOMINATED DATABASE FOR LIMITED DOMAIN T-T-S IN HINDI
 
Marathi Text-To-Speech Synthesis using Natural Language Processing
Marathi Text-To-Speech Synthesis using Natural Language ProcessingMarathi Text-To-Speech Synthesis using Natural Language Processing
Marathi Text-To-Speech Synthesis using Natural Language Processing
 
G1802024651
G1802024651G1802024651
G1802024651
 
Text To Speech Synthesis System For Marathi Language Using Concatenation Tech...
Text To Speech Synthesis System For Marathi Language Using Concatenation Tech...Text To Speech Synthesis System For Marathi Language Using Concatenation Tech...
Text To Speech Synthesis System For Marathi Language Using Concatenation Tech...
 
F017163443
F017163443F017163443
F017163443
 
Implementation of English-Text to Marathi-Speech (ETMS) Synthesizer
Implementation of English-Text to Marathi-Speech (ETMS) SynthesizerImplementation of English-Text to Marathi-Speech (ETMS) Synthesizer
Implementation of English-Text to Marathi-Speech (ETMS) Synthesizer
 
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
 
Speech-Recognition.pptx
Speech-Recognition.pptxSpeech-Recognition.pptx
Speech-Recognition.pptx
 
Creation of speech corpus for emotion analysis in Gujarati language and its e...
Creation of speech corpus for emotion analysis in Gujarati language and its e...Creation of speech corpus for emotion analysis in Gujarati language and its e...
Creation of speech corpus for emotion analysis in Gujarati language and its e...
 
Cf32516518
Cf32516518Cf32516518
Cf32516518
 
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKS
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKSTUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKS
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKS
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
 
Top 10 cited articles in nlp
Top 10 cited articles in nlpTop 10 cited articles in nlp
Top 10 cited articles in nlp
 
Hindi –tamil text translation
Hindi –tamil text translationHindi –tamil text translation
Hindi –tamil text translation
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
Survey of machine translation systems in india
Survey of machine translation systems in indiaSurvey of machine translation systems in india
Survey of machine translation systems in india
 
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKS
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKSTUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKS
TUNING DARI SPEECH CLASSIFICATION EMPLOYING DEEP NEURAL NETWORKS
 
Tuning Dari Speech Classification Employing Deep Neural Networks
Tuning Dari Speech Classification Employing Deep Neural NetworksTuning Dari Speech Classification Employing Deep Neural Networks
Tuning Dari Speech Classification Employing Deep Neural Networks
 
September 2021: Top10 Cited Articles in Natural Language Computing
September 2021: Top10 Cited Articles in Natural Language ComputingSeptember 2021: Top10 Cited Articles in Natural Language Computing
September 2021: Top10 Cited Articles in Natural Language Computing
 
February 2024 - Top 10 cited articles.pdf
February 2024 - Top 10 cited articles.pdfFebruary 2024 - Top 10 cited articles.pdf
February 2024 - Top 10 cited articles.pdf
 

Recently uploaded

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"mphochane1998
 
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...soginsider
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxmaisarahman1
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.Kamal Acharya
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxJuliansyahHarahap1
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadhamedmustafa094
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdfKamal Acharya
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesMayuraD1
 
Air Compressor reciprocating single stage
Air Compressor reciprocating single stageAir Compressor reciprocating single stage
Air Compressor reciprocating single stageAbc194748
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARKOUSTAV SARKAR
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwaitjaanualu31
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdfKamal Acharya
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationBhangaleSonal
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersMairaAshraf6
 

Recently uploaded (20)

FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
Air Compressor reciprocating single stage
Air Compressor reciprocating single stageAir Compressor reciprocating single stage
Air Compressor reciprocating single stage
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to Computers
 

A Review on Marathi Language Speech Database Development for Automatic Speech Recognition (ASR) System.

  • 1. Mrs. Chhaya S. Patil. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 7, Issue 3, ( Part -5) March 2017, pp.34-36 www.ijera.com DOI: 10.9790/9622- 0703053436 34 | P a g e A Review on Marathi Language Speech Database Development for Automatic Speech Recognition (ASR) System. Mrs. Chhaya S. Patil1 , Prof.Dr.Vaishali B.Patil2 , 1 Assistant Professor, R.C.P.E.T.’s Institute of Management Research & Development, Shirpur. 2 Associate Professor, R.C.P.E.T.’s Institute of Management Research & Development, Shirpur. ABSTRACT Speech is a natural means of communication between humans. Human being tried to develop computer that can understand & talk like human. Digital content can research to the masses & facilitate the exchange of information across peoples speaking different language in form of natural interface provided by language technologies. Developing certain recognition system standard, speech database is a prerequisite. There is lot of scope to develop automatic speech recognition (ASR) system using Indian languages which are of different variations. This paper present review on speech database developed for Marathi language. Keywords: Automatic speech Recognition, corpus, speech database, Isolated Words, Marathi I. INTROUDUCTION Information in digital world is accessible to a few who can read or understand a particular language. India has about 1652 dialects/ native languages, in such multi-lingual society language technologies play a crucial role. Speech recognition, machine translation & speech synthesis system could facilitate the exchange of information between two people speaking two different languages [1] The paper gives the review on speech database developed in Marathi language. This review would be helpful for researches that are willing working in domain of automatic speech recognition (ASR) system & speech processing in Marathi language & various dialects of it. The paper is organized as follows: Section II describe about Marathi language Section III describe the various speech database developed in Marathi language section IV gives conclusion & future work followed by references. II. MARATHI LANGUAGE Indian constitution recognizes 23 languages Marathi is one of them. Marathi is written in Devanagri script. These languages have been derived from Sanskrit which is also written in Devanagri script. Standard Marathi is as the official language in 35 different districts of Maharashtra state. Standard Marathi & Warahadi Marathi is major dialect of Marathi. Academics & print media use Standard Marathi dialects. 42 characters of spoken Marathi were distinguished by the Indic scholars [5] III. MARATHI SPEECH DATABASE IIIT - Hyderabad with co-ordination of HP lab Bangalore developed speech database for various Indian languages including Marathi. This database developed for large vocabulary speech Recognition systems. CIIL corpus of Marathi language was used for text corpus collection [12]. The speech data was recorded over calculated number of landline & cellular phones using a multi-channel computer telephony interface card .Four different cell phones were use to capture different micro phonic variations. Following table 1 shows number of speakers were selected. Table 1: Number of speaker Language Landline Cell phone Total Marathi 92 84 176 Table 2 & 3 Shows age wise & gender wise distribution of speaker Table 3: Age wise Speaker distribution Language 18-30 30-40 40-50 50-60 >60 Marathi 77 56 26 12 5 Language Male Female Marathi 91 85 RESEARCH ARTICLE OPEN ACCESS
  • 2. Mrs. Chhaya S. Patil. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 7, Issue 3, ( Part -5) March 2017, pp.34-36 www.ijera.com DOI: 10.9790/9622- 0703053436 35 | P a g e 52 sentences of the optimal text was recorded by each speaker .The transcriptions were manually edited collected data & ranked base on the goodness of the speech recorded “ Good “ ,”with channel distorting”,” with background noise” & “useless” these were categories of utterances [1]. IIIT-Hyderabad also developed speech database for major Indian languages, Marathi is one of them [6]. The purpose of developing the IIT-H Indic speech databases was to have speech & text corpora made available in public domain, without copyright restrictions for non- commercial & commercial use [6]. Text corpora were selected from Wikipedia dump of Indian languages released in 2008. Native speaker were selected for recording purpose .The age of speakers were 20-30 years. 1000 phonetically balanced sentences were selected form text corpus. A single wave file has 50 utterances [6] TIFR (Mumbai) & IIT Bombay together developed a speech database for agriculture purpose. This Project sanctioned by the technology development for Indian languages (TDIL) for the development of speech recognition system for using cell phones & landline. TIFR Mumbai & IIT Bombay collected the speech data using two dedicated phone line. Two volunteers were appointed for the development of database. Volunteers visited the various districts of Maharashtra & speech sample collected the by calling thru the dedicated phone line. Speech sample were collected from 1600 speakers approximately. As recording was done using phone line having narrow band & speech along with background noise, to overcome this difficulty Volunteers also used digital voice recorder. [2] IIT-Khargpur also developed speech database for text independent speaker identification in ASR. Recording frequency was 22,500 Hz, mono channel, 16 bit resolution & 10 utterances. The recording was done in various environment conditions like road home, office, slums, college, train, hills, valleys, remote villages, research labs &farms. The total numbers of speakers were 60 [13]. A noticeable works also done at Dr. B.A.M university Aurangabad by department of computer science & IT the speech database developed for limited vocabulary continuous speech database & two different isolated speech database for agricultures & travel purpose of Aurangabad & speech database of numbers in Marathi language. Following table shows summarized information about these speech databases. Table 4: Gender wise Speaker distribution The Isolated words & Sentences were selected from various blogs, news articles, forums & websites over the internet. The speech sample was collected from four districts i.e. Aurangabad Jalana, Beed & Osmanabad of Marathwada region [5]. PRAAT software was used for recording speech & speech sample captured using Sennheiser headsets. The speech database for Marathi isolated word recognition was developed at Dr. B.A.M. University supported by DST under fast track scheme entitle is “Design & development of Marathi speech interface system”. The vocabulary size of the database consist of 1) Marathi Vowels : 105 samples 2) Isolated words starting with each vowel : 420 samples 3) Sentences : 175 sample The total no of speaker was 35 out of which 17 were Females & 18 were males having age from 22 to 35 years. High quality of audio recording achieved by noise free & echo free room having size 10x10. The sampling frequency was 11025 Hz & the recording was done using Computerized speech laboratory (CSL) using the single Channel [4] Isolated Marathi words
  • 3. Mrs. Chhaya S. Patil. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 7, Issue 3, ( Part -5) March 2017, pp.34-36 www.ijera.com DOI: 10.9790/9622- 0703053436 36 | P a g e emotional speech database developed for robust automatic speech recognition (ASR) system & for robotics. It also developed at Dr B.A.M University supported by UGC as Major research project. Basic & most commonly observed emotional like happy, sad & angry were selected. Speech sample collected from 25 male & 25 female speakers having age group of 21 to 40 .The total no of words were 24 with 3 utterances each. The total size of utterance was 3600. In normal environment data was recorded .PRAAT is used for recording & speech in captured using Sennheiser headset. The sampling rate was 1600 Hz & five is in 16 bit more audio format [11]. Speech database of medicinal plants names was developed by Dr. B.A.M University supported by UGC for ASR of isolated Marathi words .This database helpful in agriculture field as well to medical student. Number of words was 100, category of words were Medicinal plants which growth in India, number of speaker were 100, speakers age between 20 to 50 & Speaker from Aurangabad region. Numbers of Utterances were 3, total utterances were 30,000. Recording software was PRAAT & Sampling frequency rate was16 kHz. File format was .wav .Recording done using Sennheiser Headset [9] IV. CONCLUSION In this paper we have discussed some of the speech databases developed in Marathi language for various applications. After studying the developed speech database we have been motivated to developed isolated words speech database for agriculture purpose in Ahirani dialect. REFERENCES [1]. Rohit kumar, S.P. Kishore, Anuman chipalli Gopalkrishna, Rahul chitturi, sachin joshi, satinder singh, R.N.V sitaram, Development of Indian language speech database for large vocabulary speech recognition system proceedings of International conference on speech & computer (SPECOM) , Patras, Grees oct 2005. [2]. Pukhraj P. Shrishrimal, Ratnadep R. Deshmukh, Vishal B.Waghmare “ Indian Language speech database : Review “ Internal journal of computer application (0925-888), volume 47-No. 5 june 2012 [3]. Yogesh K. Gedam, Paras V.Kothare, Ratnadeep R. Deshmukh “Design & development Speech database of Marathi Numerals “Innternational journal of Adanced Research In Maerch 2013 .Computer science s/w Engineering volume 4 Issue-3 [4]. Bharati W. Gawali, santosh Gaikwad , Pravin annawar, suresh c, Mehrotra “ Marathi Isolated word recognition system using MFCC & DTW Features “Proc of Inc conf. on advances in computer science 2010 [5]. Pukhraj P shrishrimal , Ratnadeep R Deshmukh Ganesh B. Janvave, “Development of Marathi” Language speech database from Marathwada Region” [6]. Kishore Prahllad, E naresh kumar, Venketesh keri, S Rajendran, Alan W Black “ The IIT-H indic speech databases”. [7]. Pooja v janse al, Design & development of speech database for Travel purpose in Marathi International journal of computer science & mobile computing, vol3 issue 7,May 2014, pg.872-875. [8]. [8] V.B.Waghmare, R.R Deshmukh , P.P Shrishmal G.B. Janvale, “Development of Isolated Marathi international journal of computer applications. (0975-8887) volume 94 no. 4 May 2014 [9]. Kishori R Ghule Ratnadeep R. Deshmukh, “Automatic speech reconition of Marathi isolated journal of computer science & Information Technology vol.6 65), 2015 4296-4298. [10]. P.P. Shrishrimal ,Ratnadep R. Deshmukh ,vishal B Loaghmane, “Marathi Isolated words speech database for Agriculture purpose”, International Journal of Engineering Innovation & Research volume-3 Issue 3 ,ISSN : 2277-5668 [11]. V.B Waghmare, R.R.Deshmukh, P.P. Shrishrimal, G.B. Janvale “Development of Isolated Marathi words Emotional speech database”,Intemational journal of Computer Application (0975-8887) volume 94-No.4,May 2014 [12]. “http://tdil.mit.gov.in/corpora/ach- corpora.htm#tech” Marathi CIIL corpus [13]. https://www.ldc.upenn.edu/sites/www.ldc. upenn.edu/files/agrawal2008.pdf