SlideShare a Scribd company logo
1 of 7
Download to read offline
International Journal of Trend in Scientific Research and Development (IJTSRD)
Volume 3 Issue 5, August 2019 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470
@ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1904
Voice Command Mobile Phone Dialer
Aye Thida, Yee Wai Khaing
Artificial Intelligence Lab, University of Computer Studies, Mandalay, Myanmar
How to cite this paper: Aye Thida | Yee
Wai Khaing "Voice Command Mobile
Phone Dialer"
Published in
International
Journal of Trend in
Scientific Research
and Development
(ijtsrd), ISSN: 2456-
6470, Volume-3 |
Issue-5, August 2019, pp.1904-1909,
https://doi.org/10.31142/ijtsrd26814
Copyright © 2019 by author(s) and
International Journal ofTrend inScientific
Research and Development Journal. This
is an Open Access article distributed
under the terms of
the Creative
Commons Attribution
License (CC BY 4.0)
(http://creativecommons.org/licenses/by
/4.0)
ABSTRACT
Speech recognition is an advanced technology that uses desired equipment
and a service which can be controlled through voice without touching the
screen of the smart phone. In current century, there are manyresearches with
the help of speech recognition on mobile devices. In this system,mobilephone
users can command with their voice to easily make phone call. Google’s cloud
speech API is used to recognize the incoming user voice. The speech API
recognizes over 120 languages but it cannot correctly provide Myanmar
Language still now. The system will classify the Myanmar proper name
recognized by the Google's speech API to get the correct name with thehelpof
Naïve Bayesian Classifier. The contact name classified by NaiveBayes canonly
meet user’s desired one just written in English script and itcannotprovidethe
name written in Myanmar script. This system uses hybrid transliteration
approach to solve the contact name recorded by Myanmar script. Therefore
the system can make phone call to the contact name typed with not only
English script but also Myanmar script. The system applies Jaro-Winker
distance measure to outperform the accuracy of systemoutput.Successrateis
used to measure the performance of each process contained in the system.
This system is implemented with Android programming language.
KEYWORDS: speech recognigiotn; voice command; mobile phone dialer; styling;
I. INTRODUCTION
Speech is the easiest and most common way for people to
communicate. Speech is also faster than typing on a keypad
and more expressive than clicking on a menuitem.Therefore
speechapplications that recognize the human’snaturalvoice
arebecoming more popular in human life. From theresearch
point of view, there are many researches with the help of
speech recognition. Generally the meaning of speech
recognition is that it is an advanced technology that
translates spoken words into text. Speech recognition is one
of the NLP’s major tasks. Some NLP’s tasks are information
retrieval, question and answering, machine translation,
speech segmentation, transliteration and so on. Nowadays
there are plenty of applications and areas where speech
recognition is used. Most mobileinternet devices aremaking
interesting use of speech recognition. The iPhone and
Android devices are examples of that. Examples of mobile
applications, which implement recognize user’s speech in
these smartphone devices, are Siri (only iPhone) and Google
Now (both iPhone and Android). In this system, it will
implement phone calling with speech on android mobile
devices with the help of Google’s speech recognition engine.
The speech APIcancorrectlyrecognizeEnglishspokenwords
but not Myanmar language words. For the speech input of
Myanmar words, it can only recognize English words which
pronunciation is similar to the input Myanmar word.
Classification andsimilarity methods are needed in order to
correctly recognize Myanmar spoken words.
There are different kinds of systems that use Google speech
recognition engine or Google Speech API. Some of these
systems are as following:
B. Raghavendhar Reddy, E. Mahender [2] proposed an
application for sending SMS messages which uses Google’s
speech recognition on. TheVoiceSMSapplicationallowsuser
to input spoken information and send voice message as
desired text message. The user is able to manipulate text
message fast and easy without using keyboard, reducing
spent time and effort. In this case speech recognition
provides alternative to standard use of key board for text
input, creating another dimension in modern
communications.
Sagarjit Dash [3] introduced voice detection capability in the
quiz application for Android platform smart mobile phones
with the help of Google Speech API. Herethedeviceproposed
is an interactive android smart phone, which is capable of
recognizing spoken words. He proposed to develop
interactive application which can run on the tablet or any
android based phone. The application helps the user to give
the answer of a question using voice and through voice user
can go to the next question. Users can command a mobile
device to do something via speech.Thesecommandsarethen
immediately executed.
Ms. Anuja Jadhav Prof. ArvindPatil[4]demonstratedandroid
speech to text converter for SMS application and also the
requirement of speech to text conversion system. This
converter based on evaluating voice versus keypad as a
means for entry and editing of texts. In other words, mobile
SMS users can say messages can be voice/speech typed. A
speech to text converter is developed to send SMS. It is found
that large-vocabulary speech recognition can offer a very
competitive alternative to traditional text entry. This SMS
IJTSRD26814
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1905
application also implements speech recognition process at
Google’s speech server.
Hae-Duck J. Jeong, Sang-Kug Ye, Jiyoung Lim, Ilsun You, and
WooSeok Hyun [5] had proposed a computer remote control
system using voice recognition technologies of mobile
devices and wireless communication technologies for the
blind and physically disabled population as assistive
technology. These people experience difficulty and
inconvenience using computers through a keyboard and/or
mouse. The purpose of this system is to provide a way that
the blind and physically disabled population can easily
control many functions of a computer via voice. The
configuration of the system consists of a mobile device such
as a smartphone, a PC server, and a Google server that are
connected to each other. Users can commandamobiledevice
to do something via voice; such aswriting emails, checking
the weather forecast, or managing a schedule. These
commands are then immediately executed.
The proposed system also provided blind people with a
function via TTS (Text to Voice) of the Google server if they
want to receive contents of a document stored inacomputer.
People are interested in mobile phones because they can
actually stay in touch wherever they are. Now multi-touch
surface have been widely used as the main input methods on
mobile devices. But for visually impaired, it is difficult to use
these methods.Moreovernowadayspeopleareverybusyand
so they do their works in a timely fashion. For example,atthe
time of driving, a user wants to phone a call in his or her
contact list and so he or she searches it in many contacts.
Consequently it makes wasting time and even unexpected
accidents may occur. Therefore a speech recognition
application for mobile device is being developed to avoid
harmful accidents and it can save user’s valuable time.
Moreover voicecontrolisaneffectiveandefficientalternative
non-visual interaction mode which does not require target
locating and pointing.
II. Speech Recogntion
Speech recognition is the ability of a machine or program to
identify words and phrases in spoken language and convert
them to a machine-readable format. Speech recognition,also
known as speech-to-text or automatic speech recognition
(ASR) has been studied for more than two decades and
recentlybeenusedinvariouscommercialproducts.Thereare
plenty of applications and areas where speech recognition is
used. This technology has been more popular and successful
in the following:
 Device control. For individualswithdisabilitiesandbirth
defects that leave them unable to use their hands, NLP
technology can be used to control a mobile device. For
example, just saying "OK Google" to an Android phone
fires up a system that is all ears to your voice commands.
 Car Bluetooth systems. Many cars are equipped with a
system that connects its radio mechanism to the
smartphone through Bluetooth. Phone callscanbemade
and received without touching the smartphone, and can
even dial numbers by just saying them.
 Voice transcription. In areas where peoplehavetotypea
lot, some intelligent software captures their spoken
words and transcribes them into text. This is current in
certain word processing software.
III. Architecture of the System
When the system starts, it transliterates Myanmar contact
name that is recorded by Myanmar script in the contact list
to English script. And the system recognizes user speech as
an input by using Google’s Cloud Speech API. The output
English text of Google Speech API is classified by using
training data with the help of Naïve Bayesian Classifier to
meet user desired text (i.e. contact name). Then the system
calculates the similarity scores of the classified contact
name, the transliterated name and English contact name in
the contact list according to Jaro-Winkler similaritymethod.
And it compares the similarity scores of two strings (string1
is the classified name and string2 is the combination of
transliterated name and English contact name) and it
chooses the highest similarity score. If the highest similarity
score is greater than or equal to threshold value 8.4, the
system will make phone call to that contact name. The Fig 1
shows architecture of the system.
Fig.1. Architecture of the system
IV. Myanmar to English Transliteration
In this step, to get the contact names spelled with English
script the system transliterates the contact names written
with Myanmar script with the following steps as shown in
Figure 2.
 Normalization
 Syllable Segmentation
 Transliteration with Hybrid Approach
Fig.2. Process of Myanmar to English Transliteration
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1906
A. Text Normalization
Myanmar words can be classified into standard words (i.e.,
words with standard syllable structure)andirregularwords
(i.e., words with abbreviated characters or words written in
special traditional writing forms etc.). Since these irregular
words are written in different forms, it complicates the
syllabication process. To identifycorrectsyllableboundaries
in the given text, it must have standard syllable structure.
Therefore the irregular Myanmar words are needed to
normalize to outperform the syllable segmentation result.
Table 1 shows some examples for normalization ofirregular
words.
Table 1 Normalization of Myanmar Irregular Words
Irregular Word Normalization Output
တက္ကသိုလ တက်ကသိုလ်[te’ ka- tho]
အင်္ဂါ အင်ဂါ[in ga]
ယောက်ျား ယောက်ကျား[jau’ kya:]
ပြဿနာ ပြသ်သနာ[pja’ tha- na]
B. Syllable Segmentation
Myanmar is a syllabic script and syllable is a smallest unit in
Myanmar language. The combination of one or more
characters but not more than eight characters will become
one syllable; combination of one or more syllables becomes
one word. Syllable segmentation is the process of
determining the syllable boundaries in a sentence or a
document. In this system, syllable segmentation is done
based on a Myanmar word segmentation tool [6]. For
example – သန်တာချယ်ရီ  သန်[than] + တာ[da]+ချယ်[che] +
ရီ[ji].
C. Transliteration with Hybrid Approach
The system accepts the syllable segments and transliterates
them with the help of rules from BGN/PCGN 1970 + KNAB
modification 2002 and Transliteration: KNAB 2003
Agreement to accurate our measurement standard [6, 7].
Although these rules solve almost the transliteration
problems, they cannot solve in special case such as the
spelling of loan word “ချယ်ရီ[cherry]”. Therefore the system
applies direct mapping transliterationapproachtosolvethis
special case. The system uses a dictionaryfordirectmapping
transliteration approach. Table 2 and Table 3 describe
transliterated output of rule based approach and hybrid
approach respectively.
Table 2 Transliterated Output with Only Rule Based
Approach
Splitting
Word
သန်[than
]
တာ[da
]
ချယ်[che
]
ရီ[ji
]
Transliterate
d Word
than dar Chal yi
Table 3 Transliterated Outputs with Hybrid Approach
Splitting
Word
သန်[than] တာ[da] ချယ်[che] ရီ[ji]
Transliterated
Word
than dar Cherry
V. Training Data
In this system, training data is required to get user desired
contact name because speech API can only produce output
text that is most similar pronunciation of input Myanmar
proper name. According to Myanmar Orthography,thereare
1864 unique Myanmar syllables in which some syllables are
used to spell proper name but some are not (e.g. “ကိစ်[kei’]”,
“ဂိဇ်[gei’]”, “ဇောဋ်[zau’]”, etc. ).These syllables are trained
with the help of Google speech API in this system. Before
speech to text conversion of speech API,MyanmartoEnglish
transliteration for the above Myanmar syllables is required
because the training data, English script output of speech
API, is stored in the database. According to BGN/PCGN 1970
+ KNAB modification 2002 and Transliteration: KNAB 2003
Agreement for the Transliteration of Myanmar into English,
Myanmar syllables that are similar pronunciation, use the
same rules to transliterate English scripts. Table 4 shows
example of these transliteration rules.
Table 4 Example of Myanmar to English Transliteration
Myanmar Script Romanization
ဒ,ဓ, ဎ, ဍ D
ယ,ရ Y
ဗ,ဘ B
ကျမ်း, ကြမ်း Kyan
ညီ,ညီး ညင် Nyi
ဗန်,ဗမ်,ဗဏ် Ban
ယင်,ရင်,ယဉ်, ယာဉ် Yin
လတ်,လပ်,လာဘ် Lat
ဒန်,ဒံ,ဒဏ်,ဓမ် dan
Although there are 1864 unique Myanmar syllables, the
training dataset used in this system contains only 878
syllables in which each syllable is defined as one class label.
In other word, the dataset holds 878 unique class labels in
which each class label is trained 5 times. After each training
time for one syllable, the number and value of resulted
training data are different according to the accent of input
speech. Therefore there are totally 13,748 training data in
this system. For example “kyaw” class label (syllable) is
trained withGoogle’sspeechRecognitionAPIasthefollowing
Table 5.
Table 5 Training Data of Class Label “kyaw”
Output of Speech API Class Label
joel kyaw
jole kyaw
jo kyaw
jojo kyaw
joe kyaw
joh kyaw
joel kyaw
jo kyaw
jojo kyaw
joe kyaw
dole kyaw
jo kyaw
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1907
jojo kyaw
joel kyaw
joe kyaw
This system trains the syllables with unigram as described
above. The combinations of syllables frequently used in
Myanmar proper names can be trained with bigrams. The
Table 6 describes examples of training data of Myanmar
syllables with bigrams. If syllables are trained with bigrams,
contact name classified by Naïve Bayes and user’s desired
contact names are more similar than training with unigram.
The similarity values of unigram and bigrams arenot easyto
train all possible bigram combinations becausetherearetoo
many possible bigram combinations. But, this system is a
mobile phone based application. It stores trainingdata inthe
client and memory allocation is very important. Therefore
the system trains syllables with unigram and it uses
threshold value for smoothing of system output.
Table 6 Training Data of Bigrams for Class Label “su khin”
Output of Speech API Class Label
soup can sukhin
suitcase sukhin
soup kitchen sukhin
suki sukhin
soup king sukhin
soup can sukhin
suit cane sukhin
sukan sukhin
sukin sukhin
sue cain sukhin
soup can sukhin
soup kitchen sukhin
suit can sukhin
soupcon sukhin
soup canned sukhin
VI. Google’s Speech Recognition Engine or API
The main requirement of every speech to text conversion
system is a database which will compare pitches with
frequencies. If we develop the system which will convert the
speech into text that is for any user, it is very difficult job
because the frequency of a user is different from thatofother
user. If the system is global hence creating the speech
database for the mobile user is very much difficult because
comparison of sound frequency and pitch for millions and
billions mobile users is difficult again. To solve this issue of
speech recognition (ASR) systems on mobile devices, Google
Speech recognition engine or API which use a huge large
speech database regarding possible different pitches and
frequencies of a person's voice, is used to recognize user
speech in this system. The Cloud Speech-to-Text uses a
speech recognition engine that can understand one of a wide
variety of languages (e.g. English, Mandarin, Chinese,
Japanese and so on). This system uses English language
which is one of the languages supported by Google Speech
API. In some cases (e.g. Myanmar propername),althoughthe
API returns output text to the client, these texts are not
correct but similar pronunciation. The following Figure 2 is
an example for the output text of Google’s speech API.
Fig 2 Output of Google’s Speech API for Myanmar Proper
Name
VII. Classification with Naïve Bayesian Classifier
The Google’s speech API cannot correctly recognize
Myanmar proper name but can produce English word thatis
similar pronunciation of that input Myanmar proper name.
For example, if speech input is “yee/wai/khaing”, API
recognizes it as “sea/red/khine” this is not exact output. To
get the correct contact name, this system requires some
training to be done on the data input. Therefore Naïve
Bayesian Classifier will classify to get the correct name for
output name recognized with speech API by using the above
described training data. To calculate the probability of a
hypothesis(C) being true, given the prior knowledge(X),
Bayes’ Theorem is used as follows:
P(C|X)=(P(X|C)P(C))/(P(X))
The following Figure 3 is an example for classifying the
speech API output text by using Naïve Bayesian Classifier.
Fig. 3 Classified Text of Naïve Bayes Classifier
VIII. Making Phone Call
In this step, the system compares the similarity scores
calculated by Jaro-Winker method and chooses the highest
similarity score. If the highest similarity valueis greater than
or equal threshold value8.4,thesystem will make phone call
to the contact name. Decision of making phone call to the
contact name depends on threshold value. As a result the
system needs a good threshold value to make phone call to
the user desired contact name. Therefore the system was
tested with 463 contact names (311 non-nasalized contact
names plus 152 nasalized contact names) to get good
threshold value. These contact names are categorized into
nasalizedandnon-nasalizednamesbasedonMyanmarvowel
sounds. From threshold value 8.2 to 8.6 are set to know
which threshold value is optimized for this system as
described in Table 7. According to Table 7, if threshold value
8.5 is set, the success rate of correct phone call to non-
nasalized contact name is below80%,ifthresholdvalue8.3is
set, the success rate of correct phone call to non-nasalized
contact name is equal to that of correct phone call to non-
nasalized contact name when threshold value is 8.4, but the
success rates of incorrectly phone call to nasalized contact
name are significantly different. Therefore the system
specifies the good value of threshold value is 8.4.
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1908
Table 7 Success Rates of Each Threshold Value
IX. Evaluation of the System
This section reports the experimental results of Myanmar to English Transliteration, speech to text (STT) conversion of
Google’s speech API and making phone call processes.The performanceof each process appliedinthissystemis measuredwith
success rate. A success rate is one of many metrics used for measuring/quantifying usability and is the simplest accuracy
measurement method. To get the value of success rate for each process, the following formula is used.
A. Experimental Result of Myanmar to English Transliteration
Two types of Myanmar words (standard and irregular word) have been discussed in Section4.2.Sincethewordsare writtenin
different orthographic ways, it may cause problem to the transliteration system. Therefore this system has been tested with
348 distinct Myanmar proper names for transliteration process performance as shown in Table 8.
Table 8 Performance of Myanmar to English Transliteration
No Type of Word No. of Contact Name
No. of Correctly
Transliterated Name
% of Correct Translitern
for Each Word Type
1. Standard word 286 279 97%
2. Irregular word 62 57 91%
TOTAL 348 336 96%
B. Consequences of Google’s Speech API
Google’s speech API can recognize 120 languages and variants to support global user base. The Google’s speech API cannot
correctly recognize some languages such as Myanmar Language but it can produce output text which pronunciation is similar
to the input Myanmar word. Therefore the consonants in each consonant cluster have been analyzed to measure the
performance of Google’s Speech API. The accuracy of Google’s cloud speech API is shown in Table 9
Table 9 Accuracy of Google’s Speech API for Each Consonant Cluster
No Type of Consonant Cluster Usage Level % of Correct STT Conversion of API
1 0C20
Highest 87.5%
Medium 81.82%
Lowest 83.33%
2 0C2C3
Highest 81.82%
Lowest 66.67%
3 C1C20
Highest 66.67%
Lowest 50%
4 C1C2C3 25%
TOTAL 71.67%
C. Experimental Result of Overall System
The contact name consists of one or more syllables. The syllable may be only vowel or combination of vowel and consonant.
There are different types of consonants in Myanmar language already explained in Section 2.15. The system has been tested
with contact names based on Myanmar consonant types. The following Table 10 showstheperformanceofthesystemforeach
consonant type.
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1909
Table 10: Performance of the Overall System
No Type of Consot No. of Contact Name
No. of Making Phone
Call to Contact Name
% of Correctly Making
Phone Call
1 Nasal 548 366 66.78%
2 Stop 563 409 72.64%
3 Fricative 523 354 67.68%
4 Affricate 186 143 76.88%
5 Central Approximant 280 212 75.71%
6 Lateral Approximant 183 136 74.31%
According to the above Table 10, performance of the system especially decreases when a contactnamecontains eithernasalor
fricative consonant. Therefore the contact names are categorized intotwogroups:contactnamethatcontains nasal orfricative
consonants and contact name that does not contain both nasal and fricative consonants and the system was analyzed with
these two groups. The following Table 11 concludes the accuracy result of the system.
Table 11: Accuracy Result of the Overall System
No Type of Consonant
No. of Contact
Name
No. of Making Phone
Call to Contact Name
% of Correctly
Making Phone Call
1 With nasal or fricative 781 533 68.24%
2 Without nasal and fricative 119 108 90.75%
Total 900 641 71.22%
X. Result Discussion
The performance of Google’s speech API, transliteration
process and overall system have been analyzed. Firstly,
Myanmar to English transliteration process was tested with
348Myanmar proper names. It observed thatonly12proper
names out of 348 proper names are erroneous. Therefore it
received 96% of overall accuracy covering both types of
standard and irregular words. By doing error analysis, the
errors are caused by those words with different styles in
written and spoken format (e.g. “ပုသိမ်ကြီး [pa- thein gyi:]”,
“မေစံပယ်ညို [may sa- be nyo]”). Secondly, Google’s speech
API was tested with different kinds of Myanmar consonant
clusters. There are four consonant clusters (0C20, 0C2C3,
C1C20 and C1C2C3). The success rate of Google’s speech API
for Myanmar consonants is 71.67%. As a result, it is found
that the speech API cannot correctly recognize the
consonants which are not widely used such as “နွှ [nhwa.]”,
“လွှ [lhwa.]”, “မျှ [mjha.]” and so on. In this system, the
performance of making phone call process totally depends
on speech API. The Myanmar contact names contain one or
more syllables. Each syllable consists of atleastonevowel or
combination of consonant and vowel. The performance of
the overall system was analyzed with 900 testing contact
names. These contact names were categorizedintodifferent.
We conclude that the success rate values of these contact
names are from 66.78 % to 90.75% respectively. As a result,
it was examined that there are contact name recognition
errors when all syllables of the contact namearevowels (e.g.
“အေးအေးအောင်[ei: ei: aun]”, “အိအိ[ei ei]”), contact name
contains nasal or fricative consonants only (e.g.
“မြင့်မြတ်မွန်[mjin. mja’ mun]”, “ဆုဇော်ဇော် ဇင်[hsu. zo zo
zin]”) and combination of nasal andfricativeconsonants(e.g.
“ငုဝါစိုး [ngu. wa so:]”, “ညီဇေယျာလှိုင် [nyi zei ja lhain]”).
XI. Conclusion
Natural language processing techniques are becoming
widely popular scientific research areas as well as
Information Technology industry. Language technology
together with Information Technology can enhancethelives
of people with differentcapabilities.This systemimplements
voice command mobile phone dialer application. The
strength of the system is that it can make phone call to the
contact name written in either English script or Myanmar
scripts.
REFERENCES
[1] G. Eason, B. Noble, and I. N. Sneddon, “On certain
integrals of Lipschitz-Hankel typeinvolvingproducts of
Bessel functions,” Phil. Trans. Roy. Soc. London, vol.
A247, pp. 529-551, April 1955.
[2] B. R. Reddy, E. Mahender, “Speech to Text Conversion
using Android Platform” , International Journal of
Engineering Research and Applications (IJERA) ISSN:
2248-9622 www.ijera.com Vol. 3, Issue 1, January -
February 2013, pp.253-258.
[3] S. Dash, Master of Engineering , Department of
Computer Science & Engineering, Thapar University,
Patiala, Punjab, India,“Kuiz AppsUsingVoiceDetection
in Android Platform”, International Journal of
Computer Engineering and Applications,
[4] Ms. A. Jadhav, Prof. A. Patil, “Android Speech to Text
Converter for SMS Application”, IOSR Journal of
Engineering Mar. 2012, Vol. 2(3) pp: 420-423.
[5] H. D. J. Jeong, S. K.Ye, J. Lim, I. You, W. S. Hyun,
Department of Computer Software, Korean Bible
University Seoul, South Korea, “A Computer Remote
Control System Based on Speech Recognition
Technologies of Mobile Devices and Wireless
Communication Technologies”, IEEE Conference
Publication,2013,page no. 595-600.
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1910
[6] A. M. Mon, S. L. Phyue, M. M. Thein, S. S. Htay, T. T. Win,
"Analysis of Myanmar Word Boundary and
Segmentation by Using Statistical Approach",
International Conference on Advanced Computer
Theory and Engineering (ICACTE), Vol.5, Augest 2010.
[7] Romanization System for Burmese: BCG/PCGN 1970
Agreement, Office of the Superintendent, Government
Printing, Rangoon, Burma.
[8] T. H. Hlaing, Yoshiki MIKAMI, “Automatic
Syllabification of Myanmar Texts using Finite State
Transducer”, International Journal on Advances in ICT
for Emerging Regions, Volume 6, Number 2 2013.

More Related Content

What's hot

IRJET - Sign Language Recognition System
IRJET -  	  Sign Language Recognition SystemIRJET -  	  Sign Language Recognition System
IRJET - Sign Language Recognition SystemIRJET Journal
 
IRJET- Text Extraction from Text Based Image using Android
IRJET- Text Extraction from Text Based Image using AndroidIRJET- Text Extraction from Text Based Image using Android
IRJET- Text Extraction from Text Based Image using AndroidIRJET Journal
 
An Intelligent Career Counselling Bot A System for Counselling
An Intelligent Career Counselling Bot A System for CounsellingAn Intelligent Career Counselling Bot A System for Counselling
An Intelligent Career Counselling Bot A System for CounsellingIRJET Journal
 
IRJET - Voice based Email for Visually Challenged People
IRJET - Voice based Email for Visually Challenged PeopleIRJET - Voice based Email for Visually Challenged People
IRJET - Voice based Email for Visually Challenged PeopleIRJET Journal
 
Authenticated Access of ATM Cards
Authenticated Access of ATM CardsAuthenticated Access of ATM Cards
Authenticated Access of ATM Cardsijtsrd
 
Bershca: bringing chatbot into hotel industry in Indonesia
Bershca: bringing chatbot into hotel industry in IndonesiaBershca: bringing chatbot into hotel industry in Indonesia
Bershca: bringing chatbot into hotel industry in IndonesiaTELKOMNIKA JOURNAL
 
Chat-Bot for College Management System using A.I
Chat-Bot for College Management System using A.IChat-Bot for College Management System using A.I
Chat-Bot for College Management System using A.IIRJET Journal
 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET Journal
 
Detailed Study on Natural Language Processing Services.
Detailed Study on Natural Language Processing Services.Detailed Study on Natural Language Processing Services.
Detailed Study on Natural Language Processing Services.IRJET Journal
 
Why artificial intelligence matters in i os app development
Why artificial intelligence matters in i os app developmentWhy artificial intelligence matters in i os app development
Why artificial intelligence matters in i os app developmentConcetto Labs
 
Predicting cyber bullying on t witter using machine learning
Predicting cyber bullying on t witter using machine learningPredicting cyber bullying on t witter using machine learning
Predicting cyber bullying on t witter using machine learningMirXahid1
 
VOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEMVOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEMNikhil Ravi
 
ACM DEV '12 Proceedings of the 2nd ACM Symposium on Computing for Development
ACM DEV '12 Proceedings of the 2nd ACM Symposium on Computing for DevelopmentACM DEV '12 Proceedings of the 2nd ACM Symposium on Computing for Development
ACM DEV '12 Proceedings of the 2nd ACM Symposium on Computing for DevelopmentElsa Friscira
 
IRJET - Number/Text Translate from Image
IRJET -  	  Number/Text Translate from ImageIRJET -  	  Number/Text Translate from Image
IRJET - Number/Text Translate from ImageIRJET Journal
 
User stories collection via interactive chatbot to support requirements gathe...
User stories collection via interactive chatbot to support requirements gathe...User stories collection via interactive chatbot to support requirements gathe...
User stories collection via interactive chatbot to support requirements gathe...TELKOMNIKA JOURNAL
 
Presentationgroup
PresentationgroupPresentationgroup
Presentationgrouplax8055
 
An Improved Explicit Profile Matching In Mobile Social Networks
An Improved Explicit Profile Matching In Mobile Social NetworksAn Improved Explicit Profile Matching In Mobile Social Networks
An Improved Explicit Profile Matching In Mobile Social NetworksIJERA Editor
 
Detection of cyber-bullying
Detection of cyber-bullying Detection of cyber-bullying
Detection of cyber-bullying Ziar Khan
 

What's hot (20)

IRJET - Sign Language Recognition System
IRJET -  	  Sign Language Recognition SystemIRJET -  	  Sign Language Recognition System
IRJET - Sign Language Recognition System
 
IRJET- Text Extraction from Text Based Image using Android
IRJET- Text Extraction from Text Based Image using AndroidIRJET- Text Extraction from Text Based Image using Android
IRJET- Text Extraction from Text Based Image using Android
 
An Intelligent Career Counselling Bot A System for Counselling
An Intelligent Career Counselling Bot A System for CounsellingAn Intelligent Career Counselling Bot A System for Counselling
An Intelligent Career Counselling Bot A System for Counselling
 
IRJET - Voice based Email for Visually Challenged People
IRJET - Voice based Email for Visually Challenged PeopleIRJET - Voice based Email for Visually Challenged People
IRJET - Voice based Email for Visually Challenged People
 
Authenticated Access of ATM Cards
Authenticated Access of ATM CardsAuthenticated Access of ATM Cards
Authenticated Access of ATM Cards
 
Bershca: bringing chatbot into hotel industry in Indonesia
Bershca: bringing chatbot into hotel industry in IndonesiaBershca: bringing chatbot into hotel industry in Indonesia
Bershca: bringing chatbot into hotel industry in Indonesia
 
Chat-Bot for College Management System using A.I
Chat-Bot for College Management System using A.IChat-Bot for College Management System using A.I
Chat-Bot for College Management System using A.I
 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
 
Detailed Study on Natural Language Processing Services.
Detailed Study on Natural Language Processing Services.Detailed Study on Natural Language Processing Services.
Detailed Study on Natural Language Processing Services.
 
Voice browser
Voice browserVoice browser
Voice browser
 
Why artificial intelligence matters in i os app development
Why artificial intelligence matters in i os app developmentWhy artificial intelligence matters in i os app development
Why artificial intelligence matters in i os app development
 
Predicting cyber bullying on t witter using machine learning
Predicting cyber bullying on t witter using machine learningPredicting cyber bullying on t witter using machine learning
Predicting cyber bullying on t witter using machine learning
 
Application of qr 2-3-4
Application of qr 2-3-4Application of qr 2-3-4
Application of qr 2-3-4
 
VOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEMVOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEM
 
ACM DEV '12 Proceedings of the 2nd ACM Symposium on Computing for Development
ACM DEV '12 Proceedings of the 2nd ACM Symposium on Computing for DevelopmentACM DEV '12 Proceedings of the 2nd ACM Symposium on Computing for Development
ACM DEV '12 Proceedings of the 2nd ACM Symposium on Computing for Development
 
IRJET - Number/Text Translate from Image
IRJET -  	  Number/Text Translate from ImageIRJET -  	  Number/Text Translate from Image
IRJET - Number/Text Translate from Image
 
User stories collection via interactive chatbot to support requirements gathe...
User stories collection via interactive chatbot to support requirements gathe...User stories collection via interactive chatbot to support requirements gathe...
User stories collection via interactive chatbot to support requirements gathe...
 
Presentationgroup
PresentationgroupPresentationgroup
Presentationgroup
 
An Improved Explicit Profile Matching In Mobile Social Networks
An Improved Explicit Profile Matching In Mobile Social NetworksAn Improved Explicit Profile Matching In Mobile Social Networks
An Improved Explicit Profile Matching In Mobile Social Networks
 
Detection of cyber-bullying
Detection of cyber-bullying Detection of cyber-bullying
Detection of cyber-bullying
 

Similar to Voice Command Mobile Phone Dialer

Wake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneWake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneIJERA Editor
 
Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...TELKOMNIKA JOURNAL
 
How does speech recognition AI work.pdf
How does speech recognition AI work.pdfHow does speech recognition AI work.pdf
How does speech recognition AI work.pdfCiente
 
IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing SystemIRJET Journal
 
IRJET - Speech Recognition using Android
IRJET -  	  Speech Recognition using AndroidIRJET -  	  Speech Recognition using Android
IRJET - Speech Recognition using AndroidIRJET Journal
 
Instant speech translation 10BM60080 - VGSOM
Instant speech translation   10BM60080 - VGSOMInstant speech translation   10BM60080 - VGSOM
Instant speech translation 10BM60080 - VGSOMsathiyaseelanm
 
Voice Assistant Using Python and AI
Voice Assistant Using Python and AIVoice Assistant Using Python and AI
Voice Assistant Using Python and AIIRJET Journal
 
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”IRJET Journal
 
Review On Speech Recognition using Deep Learning
Review On Speech Recognition using Deep LearningReview On Speech Recognition using Deep Learning
Review On Speech Recognition using Deep LearningIRJET Journal
 
Assistive Examination System for Visually Impaired
Assistive Examination System for Visually ImpairedAssistive Examination System for Visually Impaired
Assistive Examination System for Visually ImpairedEditor IJCATR
 
How to Create a Voice-Assistant App Like Alexa.pdf
How to Create a Voice-Assistant App Like Alexa.pdfHow to Create a Voice-Assistant App Like Alexa.pdf
How to Create a Voice-Assistant App Like Alexa.pdfgirijalakshmi2
 
Importance of Programming Language in Day to Day Life
Importance of Programming Language in Day to Day LifeImportance of Programming Language in Day to Day Life
Importance of Programming Language in Day to Day Lifeijtsrd
 
A Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine LearningA Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine LearningEmily Smith
 
Procedia Computer Science 94 ( 2016 ) 295 – 301 Avail.docx
 Procedia Computer Science   94  ( 2016 )  295 – 301 Avail.docx Procedia Computer Science   94  ( 2016 )  295 – 301 Avail.docx
Procedia Computer Science 94 ( 2016 ) 295 – 301 Avail.docxaryan532920
 
A SURVEY ON AI POWERED PERSONAL ASSISTANT
A SURVEY ON AI POWERED PERSONAL ASSISTANTA SURVEY ON AI POWERED PERSONAL ASSISTANT
A SURVEY ON AI POWERED PERSONAL ASSISTANTIRJET Journal
 

Similar to Voice Command Mobile Phone Dialer (20)

Wake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneWake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phone
 
D1803041822
D1803041822D1803041822
D1803041822
 
Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...
 
How does speech recognition AI work.pdf
How does speech recognition AI work.pdfHow does speech recognition AI work.pdf
How does speech recognition AI work.pdf
 
IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing System
 
IRJET - Speech Recognition using Android
IRJET -  	  Speech Recognition using AndroidIRJET -  	  Speech Recognition using Android
IRJET - Speech Recognition using Android
 
Aj31253258
Aj31253258Aj31253258
Aj31253258
 
Instant speech translation 10BM60080 - VGSOM
Instant speech translation   10BM60080 - VGSOMInstant speech translation   10BM60080 - VGSOM
Instant speech translation 10BM60080 - VGSOM
 
Voice Assistant Using Python and AI
Voice Assistant Using Python and AIVoice Assistant Using Python and AI
Voice Assistant Using Python and AI
 
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”
 
Review On Speech Recognition using Deep Learning
Review On Speech Recognition using Deep LearningReview On Speech Recognition using Deep Learning
Review On Speech Recognition using Deep Learning
 
Assistive Examination System for Visually Impaired
Assistive Examination System for Visually ImpairedAssistive Examination System for Visually Impaired
Assistive Examination System for Visually Impaired
 
Uses of speech recognition system
Uses of speech recognition systemUses of speech recognition system
Uses of speech recognition system
 
Google Voice-to-text
Google Voice-to-textGoogle Voice-to-text
Google Voice-to-text
 
How to Create a Voice-Assistant App Like Alexa.pdf
How to Create a Voice-Assistant App Like Alexa.pdfHow to Create a Voice-Assistant App Like Alexa.pdf
How to Create a Voice-Assistant App Like Alexa.pdf
 
Importance of Programming Language in Day to Day Life
Importance of Programming Language in Day to Day LifeImportance of Programming Language in Day to Day Life
Importance of Programming Language in Day to Day Life
 
voice browser
voice browservoice browser
voice browser
 
A Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine LearningA Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine Learning
 
Procedia Computer Science 94 ( 2016 ) 295 – 301 Avail.docx
 Procedia Computer Science   94  ( 2016 )  295 – 301 Avail.docx Procedia Computer Science   94  ( 2016 )  295 – 301 Avail.docx
Procedia Computer Science 94 ( 2016 ) 295 – 301 Avail.docx
 
A SURVEY ON AI POWERED PERSONAL ASSISTANT
A SURVEY ON AI POWERED PERSONAL ASSISTANTA SURVEY ON AI POWERED PERSONAL ASSISTANT
A SURVEY ON AI POWERED PERSONAL ASSISTANT
 

More from ijtsrd

‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementationijtsrd
 
Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...ijtsrd
 
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and ProspectsDynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and Prospectsijtsrd
 
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...ijtsrd
 
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...ijtsrd
 
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...ijtsrd
 
Problems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A StudyProblems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A Studyijtsrd
 
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...ijtsrd
 
The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...ijtsrd
 
A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...ijtsrd
 
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...ijtsrd
 
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...ijtsrd
 
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. SadikuSustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadikuijtsrd
 
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...ijtsrd
 
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...ijtsrd
 
Activating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment MapActivating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment Mapijtsrd
 
Educational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger SocietyEducational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger Societyijtsrd
 
Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...ijtsrd
 
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...ijtsrd
 
Streamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine LearningStreamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine Learningijtsrd
 

More from ijtsrd (20)

‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation
 
Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...
 
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and ProspectsDynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
 
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
 
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
 
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
 
Problems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A StudyProblems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A Study
 
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
 
The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...
 
A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...
 
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
 
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
 
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. SadikuSustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
 
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
 
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
 
Activating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment MapActivating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment Map
 
Educational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger SocietyEducational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger Society
 
Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...
 
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
 
Streamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine LearningStreamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine Learning
 

Recently uploaded

Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 

Recently uploaded (20)

Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 

Voice Command Mobile Phone Dialer

  • 1. International Journal of Trend in Scientific Research and Development (IJTSRD) Volume 3 Issue 5, August 2019 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470 @ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1904 Voice Command Mobile Phone Dialer Aye Thida, Yee Wai Khaing Artificial Intelligence Lab, University of Computer Studies, Mandalay, Myanmar How to cite this paper: Aye Thida | Yee Wai Khaing "Voice Command Mobile Phone Dialer" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456- 6470, Volume-3 | Issue-5, August 2019, pp.1904-1909, https://doi.org/10.31142/ijtsrd26814 Copyright © 2019 by author(s) and International Journal ofTrend inScientific Research and Development Journal. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0) (http://creativecommons.org/licenses/by /4.0) ABSTRACT Speech recognition is an advanced technology that uses desired equipment and a service which can be controlled through voice without touching the screen of the smart phone. In current century, there are manyresearches with the help of speech recognition on mobile devices. In this system,mobilephone users can command with their voice to easily make phone call. Google’s cloud speech API is used to recognize the incoming user voice. The speech API recognizes over 120 languages but it cannot correctly provide Myanmar Language still now. The system will classify the Myanmar proper name recognized by the Google's speech API to get the correct name with thehelpof Naïve Bayesian Classifier. The contact name classified by NaiveBayes canonly meet user’s desired one just written in English script and itcannotprovidethe name written in Myanmar script. This system uses hybrid transliteration approach to solve the contact name recorded by Myanmar script. Therefore the system can make phone call to the contact name typed with not only English script but also Myanmar script. The system applies Jaro-Winker distance measure to outperform the accuracy of systemoutput.Successrateis used to measure the performance of each process contained in the system. This system is implemented with Android programming language. KEYWORDS: speech recognigiotn; voice command; mobile phone dialer; styling; I. INTRODUCTION Speech is the easiest and most common way for people to communicate. Speech is also faster than typing on a keypad and more expressive than clicking on a menuitem.Therefore speechapplications that recognize the human’snaturalvoice arebecoming more popular in human life. From theresearch point of view, there are many researches with the help of speech recognition. Generally the meaning of speech recognition is that it is an advanced technology that translates spoken words into text. Speech recognition is one of the NLP’s major tasks. Some NLP’s tasks are information retrieval, question and answering, machine translation, speech segmentation, transliteration and so on. Nowadays there are plenty of applications and areas where speech recognition is used. Most mobileinternet devices aremaking interesting use of speech recognition. The iPhone and Android devices are examples of that. Examples of mobile applications, which implement recognize user’s speech in these smartphone devices, are Siri (only iPhone) and Google Now (both iPhone and Android). In this system, it will implement phone calling with speech on android mobile devices with the help of Google’s speech recognition engine. The speech APIcancorrectlyrecognizeEnglishspokenwords but not Myanmar language words. For the speech input of Myanmar words, it can only recognize English words which pronunciation is similar to the input Myanmar word. Classification andsimilarity methods are needed in order to correctly recognize Myanmar spoken words. There are different kinds of systems that use Google speech recognition engine or Google Speech API. Some of these systems are as following: B. Raghavendhar Reddy, E. Mahender [2] proposed an application for sending SMS messages which uses Google’s speech recognition on. TheVoiceSMSapplicationallowsuser to input spoken information and send voice message as desired text message. The user is able to manipulate text message fast and easy without using keyboard, reducing spent time and effort. In this case speech recognition provides alternative to standard use of key board for text input, creating another dimension in modern communications. Sagarjit Dash [3] introduced voice detection capability in the quiz application for Android platform smart mobile phones with the help of Google Speech API. Herethedeviceproposed is an interactive android smart phone, which is capable of recognizing spoken words. He proposed to develop interactive application which can run on the tablet or any android based phone. The application helps the user to give the answer of a question using voice and through voice user can go to the next question. Users can command a mobile device to do something via speech.Thesecommandsarethen immediately executed. Ms. Anuja Jadhav Prof. ArvindPatil[4]demonstratedandroid speech to text converter for SMS application and also the requirement of speech to text conversion system. This converter based on evaluating voice versus keypad as a means for entry and editing of texts. In other words, mobile SMS users can say messages can be voice/speech typed. A speech to text converter is developed to send SMS. It is found that large-vocabulary speech recognition can offer a very competitive alternative to traditional text entry. This SMS IJTSRD26814
  • 2. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1905 application also implements speech recognition process at Google’s speech server. Hae-Duck J. Jeong, Sang-Kug Ye, Jiyoung Lim, Ilsun You, and WooSeok Hyun [5] had proposed a computer remote control system using voice recognition technologies of mobile devices and wireless communication technologies for the blind and physically disabled population as assistive technology. These people experience difficulty and inconvenience using computers through a keyboard and/or mouse. The purpose of this system is to provide a way that the blind and physically disabled population can easily control many functions of a computer via voice. The configuration of the system consists of a mobile device such as a smartphone, a PC server, and a Google server that are connected to each other. Users can commandamobiledevice to do something via voice; such aswriting emails, checking the weather forecast, or managing a schedule. These commands are then immediately executed. The proposed system also provided blind people with a function via TTS (Text to Voice) of the Google server if they want to receive contents of a document stored inacomputer. People are interested in mobile phones because they can actually stay in touch wherever they are. Now multi-touch surface have been widely used as the main input methods on mobile devices. But for visually impaired, it is difficult to use these methods.Moreovernowadayspeopleareverybusyand so they do their works in a timely fashion. For example,atthe time of driving, a user wants to phone a call in his or her contact list and so he or she searches it in many contacts. Consequently it makes wasting time and even unexpected accidents may occur. Therefore a speech recognition application for mobile device is being developed to avoid harmful accidents and it can save user’s valuable time. Moreover voicecontrolisaneffectiveandefficientalternative non-visual interaction mode which does not require target locating and pointing. II. Speech Recogntion Speech recognition is the ability of a machine or program to identify words and phrases in spoken language and convert them to a machine-readable format. Speech recognition,also known as speech-to-text or automatic speech recognition (ASR) has been studied for more than two decades and recentlybeenusedinvariouscommercialproducts.Thereare plenty of applications and areas where speech recognition is used. This technology has been more popular and successful in the following:  Device control. For individualswithdisabilitiesandbirth defects that leave them unable to use their hands, NLP technology can be used to control a mobile device. For example, just saying "OK Google" to an Android phone fires up a system that is all ears to your voice commands.  Car Bluetooth systems. Many cars are equipped with a system that connects its radio mechanism to the smartphone through Bluetooth. Phone callscanbemade and received without touching the smartphone, and can even dial numbers by just saying them.  Voice transcription. In areas where peoplehavetotypea lot, some intelligent software captures their spoken words and transcribes them into text. This is current in certain word processing software. III. Architecture of the System When the system starts, it transliterates Myanmar contact name that is recorded by Myanmar script in the contact list to English script. And the system recognizes user speech as an input by using Google’s Cloud Speech API. The output English text of Google Speech API is classified by using training data with the help of Naïve Bayesian Classifier to meet user desired text (i.e. contact name). Then the system calculates the similarity scores of the classified contact name, the transliterated name and English contact name in the contact list according to Jaro-Winkler similaritymethod. And it compares the similarity scores of two strings (string1 is the classified name and string2 is the combination of transliterated name and English contact name) and it chooses the highest similarity score. If the highest similarity score is greater than or equal to threshold value 8.4, the system will make phone call to that contact name. The Fig 1 shows architecture of the system. Fig.1. Architecture of the system IV. Myanmar to English Transliteration In this step, to get the contact names spelled with English script the system transliterates the contact names written with Myanmar script with the following steps as shown in Figure 2.  Normalization  Syllable Segmentation  Transliteration with Hybrid Approach Fig.2. Process of Myanmar to English Transliteration
  • 3. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1906 A. Text Normalization Myanmar words can be classified into standard words (i.e., words with standard syllable structure)andirregularwords (i.e., words with abbreviated characters or words written in special traditional writing forms etc.). Since these irregular words are written in different forms, it complicates the syllabication process. To identifycorrectsyllableboundaries in the given text, it must have standard syllable structure. Therefore the irregular Myanmar words are needed to normalize to outperform the syllable segmentation result. Table 1 shows some examples for normalization ofirregular words. Table 1 Normalization of Myanmar Irregular Words Irregular Word Normalization Output တက္ကသိုလ တက်ကသိုလ်[te’ ka- tho] အင်္ဂါ အင်ဂါ[in ga] ယောက်ျား ယောက်ကျား[jau’ kya:] ပြဿနာ ပြသ်သနာ[pja’ tha- na] B. Syllable Segmentation Myanmar is a syllabic script and syllable is a smallest unit in Myanmar language. The combination of one or more characters but not more than eight characters will become one syllable; combination of one or more syllables becomes one word. Syllable segmentation is the process of determining the syllable boundaries in a sentence or a document. In this system, syllable segmentation is done based on a Myanmar word segmentation tool [6]. For example – သန်တာချယ်ရီ  သန်[than] + တာ[da]+ချယ်[che] + ရီ[ji]. C. Transliteration with Hybrid Approach The system accepts the syllable segments and transliterates them with the help of rules from BGN/PCGN 1970 + KNAB modification 2002 and Transliteration: KNAB 2003 Agreement to accurate our measurement standard [6, 7]. Although these rules solve almost the transliteration problems, they cannot solve in special case such as the spelling of loan word “ချယ်ရီ[cherry]”. Therefore the system applies direct mapping transliterationapproachtosolvethis special case. The system uses a dictionaryfordirectmapping transliteration approach. Table 2 and Table 3 describe transliterated output of rule based approach and hybrid approach respectively. Table 2 Transliterated Output with Only Rule Based Approach Splitting Word သန်[than ] တာ[da ] ချယ်[che ] ရီ[ji ] Transliterate d Word than dar Chal yi Table 3 Transliterated Outputs with Hybrid Approach Splitting Word သန်[than] တာ[da] ချယ်[che] ရီ[ji] Transliterated Word than dar Cherry V. Training Data In this system, training data is required to get user desired contact name because speech API can only produce output text that is most similar pronunciation of input Myanmar proper name. According to Myanmar Orthography,thereare 1864 unique Myanmar syllables in which some syllables are used to spell proper name but some are not (e.g. “ကိစ်[kei’]”, “ဂိဇ်[gei’]”, “ဇောဋ်[zau’]”, etc. ).These syllables are trained with the help of Google speech API in this system. Before speech to text conversion of speech API,MyanmartoEnglish transliteration for the above Myanmar syllables is required because the training data, English script output of speech API, is stored in the database. According to BGN/PCGN 1970 + KNAB modification 2002 and Transliteration: KNAB 2003 Agreement for the Transliteration of Myanmar into English, Myanmar syllables that are similar pronunciation, use the same rules to transliterate English scripts. Table 4 shows example of these transliteration rules. Table 4 Example of Myanmar to English Transliteration Myanmar Script Romanization ဒ,ဓ, ဎ, ဍ D ယ,ရ Y ဗ,ဘ B ကျမ်း, ကြမ်း Kyan ညီ,ညီး ညင် Nyi ဗန်,ဗမ်,ဗဏ် Ban ယင်,ရင်,ယဉ်, ယာဉ် Yin လတ်,လပ်,လာဘ် Lat ဒန်,ဒံ,ဒဏ်,ဓမ် dan Although there are 1864 unique Myanmar syllables, the training dataset used in this system contains only 878 syllables in which each syllable is defined as one class label. In other word, the dataset holds 878 unique class labels in which each class label is trained 5 times. After each training time for one syllable, the number and value of resulted training data are different according to the accent of input speech. Therefore there are totally 13,748 training data in this system. For example “kyaw” class label (syllable) is trained withGoogle’sspeechRecognitionAPIasthefollowing Table 5. Table 5 Training Data of Class Label “kyaw” Output of Speech API Class Label joel kyaw jole kyaw jo kyaw jojo kyaw joe kyaw joh kyaw joel kyaw jo kyaw jojo kyaw joe kyaw dole kyaw jo kyaw
  • 4. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1907 jojo kyaw joel kyaw joe kyaw This system trains the syllables with unigram as described above. The combinations of syllables frequently used in Myanmar proper names can be trained with bigrams. The Table 6 describes examples of training data of Myanmar syllables with bigrams. If syllables are trained with bigrams, contact name classified by Naïve Bayes and user’s desired contact names are more similar than training with unigram. The similarity values of unigram and bigrams arenot easyto train all possible bigram combinations becausetherearetoo many possible bigram combinations. But, this system is a mobile phone based application. It stores trainingdata inthe client and memory allocation is very important. Therefore the system trains syllables with unigram and it uses threshold value for smoothing of system output. Table 6 Training Data of Bigrams for Class Label “su khin” Output of Speech API Class Label soup can sukhin suitcase sukhin soup kitchen sukhin suki sukhin soup king sukhin soup can sukhin suit cane sukhin sukan sukhin sukin sukhin sue cain sukhin soup can sukhin soup kitchen sukhin suit can sukhin soupcon sukhin soup canned sukhin VI. Google’s Speech Recognition Engine or API The main requirement of every speech to text conversion system is a database which will compare pitches with frequencies. If we develop the system which will convert the speech into text that is for any user, it is very difficult job because the frequency of a user is different from thatofother user. If the system is global hence creating the speech database for the mobile user is very much difficult because comparison of sound frequency and pitch for millions and billions mobile users is difficult again. To solve this issue of speech recognition (ASR) systems on mobile devices, Google Speech recognition engine or API which use a huge large speech database regarding possible different pitches and frequencies of a person's voice, is used to recognize user speech in this system. The Cloud Speech-to-Text uses a speech recognition engine that can understand one of a wide variety of languages (e.g. English, Mandarin, Chinese, Japanese and so on). This system uses English language which is one of the languages supported by Google Speech API. In some cases (e.g. Myanmar propername),althoughthe API returns output text to the client, these texts are not correct but similar pronunciation. The following Figure 2 is an example for the output text of Google’s speech API. Fig 2 Output of Google’s Speech API for Myanmar Proper Name VII. Classification with Naïve Bayesian Classifier The Google’s speech API cannot correctly recognize Myanmar proper name but can produce English word thatis similar pronunciation of that input Myanmar proper name. For example, if speech input is “yee/wai/khaing”, API recognizes it as “sea/red/khine” this is not exact output. To get the correct contact name, this system requires some training to be done on the data input. Therefore Naïve Bayesian Classifier will classify to get the correct name for output name recognized with speech API by using the above described training data. To calculate the probability of a hypothesis(C) being true, given the prior knowledge(X), Bayes’ Theorem is used as follows: P(C|X)=(P(X|C)P(C))/(P(X)) The following Figure 3 is an example for classifying the speech API output text by using Naïve Bayesian Classifier. Fig. 3 Classified Text of Naïve Bayes Classifier VIII. Making Phone Call In this step, the system compares the similarity scores calculated by Jaro-Winker method and chooses the highest similarity score. If the highest similarity valueis greater than or equal threshold value8.4,thesystem will make phone call to the contact name. Decision of making phone call to the contact name depends on threshold value. As a result the system needs a good threshold value to make phone call to the user desired contact name. Therefore the system was tested with 463 contact names (311 non-nasalized contact names plus 152 nasalized contact names) to get good threshold value. These contact names are categorized into nasalizedandnon-nasalizednamesbasedonMyanmarvowel sounds. From threshold value 8.2 to 8.6 are set to know which threshold value is optimized for this system as described in Table 7. According to Table 7, if threshold value 8.5 is set, the success rate of correct phone call to non- nasalized contact name is below80%,ifthresholdvalue8.3is set, the success rate of correct phone call to non-nasalized contact name is equal to that of correct phone call to non- nasalized contact name when threshold value is 8.4, but the success rates of incorrectly phone call to nasalized contact name are significantly different. Therefore the system specifies the good value of threshold value is 8.4.
  • 5. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1908 Table 7 Success Rates of Each Threshold Value IX. Evaluation of the System This section reports the experimental results of Myanmar to English Transliteration, speech to text (STT) conversion of Google’s speech API and making phone call processes.The performanceof each process appliedinthissystemis measuredwith success rate. A success rate is one of many metrics used for measuring/quantifying usability and is the simplest accuracy measurement method. To get the value of success rate for each process, the following formula is used. A. Experimental Result of Myanmar to English Transliteration Two types of Myanmar words (standard and irregular word) have been discussed in Section4.2.Sincethewordsare writtenin different orthographic ways, it may cause problem to the transliteration system. Therefore this system has been tested with 348 distinct Myanmar proper names for transliteration process performance as shown in Table 8. Table 8 Performance of Myanmar to English Transliteration No Type of Word No. of Contact Name No. of Correctly Transliterated Name % of Correct Translitern for Each Word Type 1. Standard word 286 279 97% 2. Irregular word 62 57 91% TOTAL 348 336 96% B. Consequences of Google’s Speech API Google’s speech API can recognize 120 languages and variants to support global user base. The Google’s speech API cannot correctly recognize some languages such as Myanmar Language but it can produce output text which pronunciation is similar to the input Myanmar word. Therefore the consonants in each consonant cluster have been analyzed to measure the performance of Google’s Speech API. The accuracy of Google’s cloud speech API is shown in Table 9 Table 9 Accuracy of Google’s Speech API for Each Consonant Cluster No Type of Consonant Cluster Usage Level % of Correct STT Conversion of API 1 0C20 Highest 87.5% Medium 81.82% Lowest 83.33% 2 0C2C3 Highest 81.82% Lowest 66.67% 3 C1C20 Highest 66.67% Lowest 50% 4 C1C2C3 25% TOTAL 71.67% C. Experimental Result of Overall System The contact name consists of one or more syllables. The syllable may be only vowel or combination of vowel and consonant. There are different types of consonants in Myanmar language already explained in Section 2.15. The system has been tested with contact names based on Myanmar consonant types. The following Table 10 showstheperformanceofthesystemforeach consonant type.
  • 6. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1909 Table 10: Performance of the Overall System No Type of Consot No. of Contact Name No. of Making Phone Call to Contact Name % of Correctly Making Phone Call 1 Nasal 548 366 66.78% 2 Stop 563 409 72.64% 3 Fricative 523 354 67.68% 4 Affricate 186 143 76.88% 5 Central Approximant 280 212 75.71% 6 Lateral Approximant 183 136 74.31% According to the above Table 10, performance of the system especially decreases when a contactnamecontains eithernasalor fricative consonant. Therefore the contact names are categorized intotwogroups:contactnamethatcontains nasal orfricative consonants and contact name that does not contain both nasal and fricative consonants and the system was analyzed with these two groups. The following Table 11 concludes the accuracy result of the system. Table 11: Accuracy Result of the Overall System No Type of Consonant No. of Contact Name No. of Making Phone Call to Contact Name % of Correctly Making Phone Call 1 With nasal or fricative 781 533 68.24% 2 Without nasal and fricative 119 108 90.75% Total 900 641 71.22% X. Result Discussion The performance of Google’s speech API, transliteration process and overall system have been analyzed. Firstly, Myanmar to English transliteration process was tested with 348Myanmar proper names. It observed thatonly12proper names out of 348 proper names are erroneous. Therefore it received 96% of overall accuracy covering both types of standard and irregular words. By doing error analysis, the errors are caused by those words with different styles in written and spoken format (e.g. “ပုသိမ်ကြီး [pa- thein gyi:]”, “မေစံပယ်ညို [may sa- be nyo]”). Secondly, Google’s speech API was tested with different kinds of Myanmar consonant clusters. There are four consonant clusters (0C20, 0C2C3, C1C20 and C1C2C3). The success rate of Google’s speech API for Myanmar consonants is 71.67%. As a result, it is found that the speech API cannot correctly recognize the consonants which are not widely used such as “နွှ [nhwa.]”, “လွှ [lhwa.]”, “မျှ [mjha.]” and so on. In this system, the performance of making phone call process totally depends on speech API. The Myanmar contact names contain one or more syllables. Each syllable consists of atleastonevowel or combination of consonant and vowel. The performance of the overall system was analyzed with 900 testing contact names. These contact names were categorizedintodifferent. We conclude that the success rate values of these contact names are from 66.78 % to 90.75% respectively. As a result, it was examined that there are contact name recognition errors when all syllables of the contact namearevowels (e.g. “အေးအေးအောင်[ei: ei: aun]”, “အိအိ[ei ei]”), contact name contains nasal or fricative consonants only (e.g. “မြင့်မြတ်မွန်[mjin. mja’ mun]”, “ဆုဇော်ဇော် ဇင်[hsu. zo zo zin]”) and combination of nasal andfricativeconsonants(e.g. “ငုဝါစိုး [ngu. wa so:]”, “ညီဇေယျာလှိုင် [nyi zei ja lhain]”). XI. Conclusion Natural language processing techniques are becoming widely popular scientific research areas as well as Information Technology industry. Language technology together with Information Technology can enhancethelives of people with differentcapabilities.This systemimplements voice command mobile phone dialer application. The strength of the system is that it can make phone call to the contact name written in either English script or Myanmar scripts. REFERENCES [1] G. Eason, B. Noble, and I. N. Sneddon, “On certain integrals of Lipschitz-Hankel typeinvolvingproducts of Bessel functions,” Phil. Trans. Roy. Soc. London, vol. A247, pp. 529-551, April 1955. [2] B. R. Reddy, E. Mahender, “Speech to Text Conversion using Android Platform” , International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 3, Issue 1, January - February 2013, pp.253-258. [3] S. Dash, Master of Engineering , Department of Computer Science & Engineering, Thapar University, Patiala, Punjab, India,“Kuiz AppsUsingVoiceDetection in Android Platform”, International Journal of Computer Engineering and Applications, [4] Ms. A. Jadhav, Prof. A. Patil, “Android Speech to Text Converter for SMS Application”, IOSR Journal of Engineering Mar. 2012, Vol. 2(3) pp: 420-423. [5] H. D. J. Jeong, S. K.Ye, J. Lim, I. You, W. S. Hyun, Department of Computer Software, Korean Bible University Seoul, South Korea, “A Computer Remote Control System Based on Speech Recognition Technologies of Mobile Devices and Wireless Communication Technologies”, IEEE Conference Publication,2013,page no. 595-600.
  • 7. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD26814 | Volume – 3 | Issue – 5 | July - August 2019 Page 1910 [6] A. M. Mon, S. L. Phyue, M. M. Thein, S. S. Htay, T. T. Win, "Analysis of Myanmar Word Boundary and Segmentation by Using Statistical Approach", International Conference on Advanced Computer Theory and Engineering (ICACTE), Vol.5, Augest 2010. [7] Romanization System for Burmese: BCG/PCGN 1970 Agreement, Office of the Superintendent, Government Printing, Rangoon, Burma. [8] T. H. Hlaing, Yoshiki MIKAMI, “Automatic Syllabification of Myanmar Texts using Finite State Transducer”, International Journal on Advances in ICT for Emerging Regions, Volume 6, Number 2 2013.