SlideShare a Scribd company logo
1 of 8
Download to read offline
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE)
e-ISSN: 2278-2834,p- ISSN: 2278-8735.Volume 6, Issue 1 (May. - Jun. 2013), PP 11-18
www.iosrjournals.org
www.iosrjournals.org 11 | Page
Speech Recognized Automation System Using Speaker
Identification through Wireless Communication
S. R. Suralkar1
, Amol C.Wani2
, Prabhakar V. Mhadse3
1
(Electronics & Telecomm Engg Dept, SSBT’s COET, North Maharashtra University, Jalgaon, India)
2
(Electronics & Telecomm Engg Dept, SSBT’s COET, North Maharashtra University, Jalgaon, India)
3
(Electronics & Telecomm Engg Dept, SSBT’s COET, North Maharashtra University, Jalgaon, India)
Abstract : This paper discusses the methodology for a project named “Speech Recognized Automation System
using Speaker Identification through wireless communication”. This project gives the design of Automation
system using wireless communication and speaker recognition using Matlab code. Straightforward
programming interface of Matlab makes it an ideal tool for speech analysis in project. This automation system
is useful for home appliances as well as in industry. This paper discusses the overall design of a wireless
automation system which is built and implemented. The speech recognition centers on recognition of speech
commands stored in data base of Matlab and it is matched with incoming voice command of speaker. Mel
Frequency Cepstral Coefficient (MFCC) algorithm is used to recognize the speech of speaker and to extract
features of speech. It uses low-power RF ZigBee transceiver wireless communication modules which are
relatively cheap. This automation system is intended to control lights, fans and other electrical appliances in a
home or office using speech commands like Light, Fan etc. Further, if security is not big issue then Speech
processor is used to control the appliances without speaker identification.
Keywords — Automation system, MATLAB code, MFCC, speaker identification, ZigBee transceiver.
I. Introduction
Development of automation systems using speaker identification began in the 1960s with exploration
into speech analysis using text matching, where characteristics of an individual's voice were thought to be able
to characterize the uniqueness of an individual much like a fingerprint. The early systems had many flaws and
research ensued to derive a more reliable method of predicting the correlation between two sets of speech
utterances. The home evolutionary developments time from the era in which man became sedentary to stop
living inside caves and start building their homes. These evolutionary trends of homes automation are focused
on several main issues such as security, culture, leisure, comfort, energy savings, management and economic
activities. Over the years much work has been done in the domain of automatic speech recognition for
automation systems. The progress made is significant for small, medium and even large vocabulary systems.
Speaker Identification based automation is one of the major growing industries that can change the way people
live with security in operation. The aim of such Automation System is to provide those with special needs with a
system that can respond to speech commands and control the on/off status of electrical devices, such as lamps,
fans, television etc, in the home and office wirelessly. The system should be reasonably cheap and easy to install
in any environment.
Generally, human voice conveys much information such as gender, emotion and identity of the
speaker. The objective of voice recognition is to determine which speaker is present based on the individual‟s
utterance [3]. Several techniques have been proposed for reducing the mismatch between the testing and training
environments. Many of these methods operate either in spectral, or in cepstral domain. Firstly, human voice is
converted into digital signal form to produce digital data representing each level of signal at every discrete time
step like zero crossing. The digitized speech samples are then processed using MFCC to produce voice features.
The performance of the speech recognition system is given in terms of error rate as measured for a specified
technology. Vocal communication between person and computer includes synthesis of speech which is in
Matlab database and matching of speech features with incoming speech commands. Speaker recognition
involves speaker identification to output the identity of speaker among the population as switching on/off of the
electrical equipment.
II. System Overview
Fig.1 show overall system confined with software and hardware with wireless units. A speaker
identification based automation system is integrated system including separate transmitter and receiver module
connected wirelessly and MATLAB software coding using MFCC algorithm to extract speech features for
Speaker Identification.
Speech Recognized Automation System Using Speaker Identification through Wireless
www.iosrjournals.org 12 | Page
Fig.1: Functional Block Diagram of Automation system for Speaker Identification
This project aims to develop an automated system capable of controlling operation of various devices
using voice command, so that respective user can handle, simply, such devices with authentication. Fig. 2 and
Fig. 3 shows the architecture used for transmitter and receiver module, for speech interpretation an
environmental microphone and communication interface, which captures and digitize voice more clearly in
Matlab database and in speech processor. Microcontroller unit is used to control every operation of speech
processor and speech commands coming from Matlab simulator.
Fig. 2: Transmitter section.
Speaker identification based automation system is an integrated system that facilitates user with an
easy-to-use home automation system that can be fully operated based on speech commands. This system is
handled by particular user only, and hence more secure system. The system is constructed in a way that is easy
to install, configure, run, and maintain. The functional blocks of the transmitter module is shown in Fig. 2.
Fig. 3 shows receiver section which converts digital signal from speech processor and Matlab simulator
to analog signal to drive appliances using microcontroller unit.
Fig. 3: Receiver section.
Speech Recognized Automation System Using Speaker Identification through Wireless
www.iosrjournals.org 13 | Page
We have selected the most appropriate speech recognition method using Processor HM2007 which will
provide instructions to activate the systems actuators. In this case the actuators will be switches or motors which
control lighting, heating and other features inside the home and office. Here, speech signal is fed to speech
processor for speech analysis and same signal is applied to MATLAB using speech detect and speech write
commands speech is saved in Matlab database. After the above we have selected the signal bus and
communication protocol between computer and actuators. The most common are: dedicated wiring, twisted pair,
ZigBee, coaxial cable, power supply installed.
III. Speech Recognition
3.1 Speech Recognition Algorithm
A speech analysis is done after taking an input through microphone from a user. The design of the
system involves processing of the input audio signal. At different levels, different operations are performed on
the input signal such as Pre-emphasis, Framing, Windowing, Mel Cepstrum analysis and Recognition
(Matching) of the spoken word [3]. The voice algorithms consist of two distinguished phases. The first one is
training sessions, where recording of speech signal takes place associated with the application or uniqueness,
while, the second one is referred to as operation session or testing phase, where particular users speech
commands matching is tested, as described in Fig. 4.
Fig. 4: Speech Recognition Algorithm.
3.2 MFCC Algorithm
The extraction of the best parametric representation of speech signals is an important task to produce a
better recognition performance. The efficiency of this phase is important for the next phase since it affects its
behavior. MFCC is based on human hearing perceptions which cannot perceive frequency over 1Khz. MFCC is
based on known variation of the human ear‟s critical bandwidth with frequency. MFCC has two types of filter
which are spaced linearly at low frequency below 1000 Hz and logarithmic spacing above 1000Hz. A subjective
pitch is present on Mel Frequency Scale to capture important characteristic of phonetic in speech [3]. It checks
zero crossing in speech signal.
The cepstrum is a common transform used to gain information from a person‟s speech signal. It can be
used to separate the excitation signal (which contains the words and the pitch) and the transfer function (which
contains the voice quality). It is the result of taking Fourier transform of decibel spectrum as if it were a signal.
We use cepstral analysis in speaker identification because the speech signal is of the particular form above, and
the "cepstral transform" of it makes analysis incredibly simple. Mathematically,
cepstrum of signal = FT[log{FT(the windowed signal)}].
MFCCs are commonly calculated by first taking the Fourier transform of a windowed function of a
signal and mapping the powers of the spectrum obtained onto the mel scale, using triangular overlapping
windows. Next the logs of the powers at each of the mel frequencies are taken, Direct Cosine Transform is
applied to it. The MFCCs are the amplitudes of the resulting spectrum.
Step-I: Pre–emphasis
This step processes the passing of signal through a filter which emphasizes higher frequencies. This
process will increase the energy of signal at higher frequency.
Step-II: Framing
The short time analysis is fundamental to speech analysis techniques. It is assumed that over a long
interval of time speech waveform is not stationary but over a sufficiently short time interval about 10-30msec, it
can be considered stationary. This is due to that fact that the rate at which the spectrums of the speech signal
changes is directly dependant on the rate of movement of the speech articulators. Most speech analysis systems
Speech Recognized Automation System Using Speaker Identification through Wireless
www.iosrjournals.org 14 | Page
operate at uniformly spaced time intervals or frames of typical duration 10-30 msec. In frame blocking, the
continuous speech signal is blocked into frames of N samples, with adjacent frames being separated by M (M <
N). The first frame consists of the first N samples and The second frame begins M samples after the first frame,
and overlaps it by N - M samples. Similarly, the third frame begins 2M samples after the first frame (or M
samples after the second frame) and overlaps it by N - 2M samples. This process continues until all the speech is
accounted for within one or more frames. Typical values for N and M are N = 256 (which is equivalent to ~
30msec windowing and facilitate the fast radix-2 FFT) and M = 100 [2].
Step-III: Windowing
The next step in the speech processing is to window each individual frame so as to minimize the signal
discontinuities at the beginning and end of each frame. When we perform Fourier Transform, it assumes that the
signal repeats, and the end of one frame does not connect smoothly with the beginning of the next one. This
adds some glitches at regular intervals. So we have to make the ends of each frame smooth enough to connect
with each other. This is possible by a processing called Windowing. This is done by Hamming windowing
method.
Fig. 5: Hamming Window.
Step-IV: Fast Fourier Transform
The Fast Fourier Transform is used to convert each frame of N samples from the time domain into the
frequency domain. The FFT is a fast algorithm to implement the Discrete Fourier Transform (DFT) which is
defined on the set of N samples {xn}, as follow:
Note that we use j here to denote the imaginary unit, i.e. j= √-1. In general Xn ‟s are complex numbers.
Step- V: Mel Filter Bank Processing
The range of frequencies in FFT is very wide and voice signal does not follow the linear scale. The
bank of filters according to Mel scale as shown in figure 6 is used [3] to smoothen scale.
Fig. 6: Mel scale filter bank.
The Mel for given frequency F in Hz:
F (mel) = [2595 * log10 [1+f] 700]
Step-VI: Discrete Cosine Transform
This is the process just reverse to FFT to convert the log Mel spectrum into time domain using Discrete
Cosine Transform (DCT). The result of the conversion is called Mel Frequency Cepstrum Coefficient.
Speech Recognized Automation System Using Speaker Identification through Wireless
www.iosrjournals.org 15 | Page
Step-VII: Delta Energy and Delta Spectrum
The levels of speech signal and the frames changes, therefore, there is a need to add features related to the
change in cepstral features over time. The energy in a frame for a signal X(t) in a window from time sample t1
to time sample t2, is represented at the equation below:
Energy= ∑X2
(t)
IV. Hardware Design
In this section we present the hardware description of the modules used for adaptation of speech
processor HM 2007 and wireless communication like Zigbee Module.
4.1. Speech Processor Module
The main element of the design consist of speech synthesizer/recognition ASIC and ZIGBEE
transceiver module. The speech recognition IC needs to be programmed for various speech commands. This is
done by using a MIC, analog speech signal from MIC are stored in the internal memory of the IC after being
digitized using ADC blocks which is internal process in processor. Once the learning/programming is
completed. The IC is ready to accept the commands. A command issued by the user through MIC will be
digitized and compared with the digitized commands already stored in the internal memory of IC. When a match
is received, microcontroller status will be updated accordingly. The microcontroller in turn will generate a
specific data which will be transmitted through RF channel using ZIGBEE transmitter.
4.2 Microcontroller Module
Fig. 7: Microcontroller Module.
Fig. 8: Microcontroller-89c52 Board
This module receives signal from speech processor and decision making regarding switching ON/OFF
appliance is made here as shown in figure 7. Either from speech processor or Matlab simulator output associated
with appliance like Light or fan is given to microcontroller. Output of microcontroller will toggle for incoming
signal from processors. Wireless module is connected to one of the input port of microcontroller and appliances
are connected to output port.
Speech Recognized Automation System Using Speaker Identification through Wireless
www.iosrjournals.org 16 | Page
4.3 Wireless (Zigbee) Module
Zigbee protocol is the communication protocol that is used in this system for wireless communication
as shown in figure 9. Zigbee offers 250 kbps as maximum baud rate. Range of wireless unit is 100meters, but
practically we get approximately about 30-40 meters.
Fig. 9: Zigbee Module.
V. Tests And Results
In this section, we need to train the system using speech commands, which are saved in database using
.wav format. Following command is used to record speech command for every appliance.
y = wavrecord (30*8000, 8000);
after recording command it is checked whether it is in given sample format using,
Speechdetect (y);
If recording is ok then it is saved in database using command,
Wavwrite (y, 8000, 'speaker.wav');
For example, for light appliance it is saved as,
Wavwrite (y, 8000, 'light.wav');
After recording all commands for various appliances, single database called MODEL file is built, which will
have all commands for specific speaker. This model file is built using following command in Matlab,
„trainscript‟.
Above steps are used to train and record the speech commands. Once commands are saved, our system is ready
to work. At the serial port we use Zigbee transmitter module, which will send on/off commands to appliance,
wirelessly. Using command „digitrecgui‟ speech commands from users are received. If there is match found
results will be shown on screen as shown in figure 10 and figure 11. Figure 10 shows matching for light
command for particular speaker, where as figure 11 shows match for fan.
Fig. 10: command Light matched results.
Speech Recognized Automation System Using Speaker Identification through Wireless
www.iosrjournals.org 17 | Page
Fig. 11: command Fan matched results.
For 10 different speakers our system is tested, where we found following satisfactory results of speaker
identification and appliances worked similarly, results are shown in Table 1.
Sr. No. SPEAKER LIGHT
(%)
FAN
(%)
1 Actual Speaker 80 85
2 Speaker 1 10 10
3 Speaker 2 0 0
4 Speaker 3 0 20
5 Speaker 4 10 0
6 Speaker 5 0 10
7 Speaker 6 20 10
8 Speaker 7 0 10
9 Speaker 8 10 0
10 Speaker 9 0 0
Table 1: Results of Speech recognized Automation system using speaker identification.
For actual speaker and other speakers, statistics is calculated for speech command matching.
VI. Conclusions And Future Work
A speech recognized automation system using speaker identification through wireless communication
is built and implemented. The prototype developed can control electrical devices in a home or office wirelessly.
The system implements Automatic Speech Recognition using speech processor and Speaker Identification
through MATLAB coding using MFCC algorithm. The system implements the wireless network using ZigBee
RF modules for their efficiency and low power consumption.
After analyzing all the information from our project, we found that our algorithm improves the speaker
recognition task. Because the MFCC uses the Mel scale, the approximation and maximum likelihood nature of
speech to the human voice behavior is good, it represent in a better way the voice. We improved the precision in
the recognition task. We found that as long as the number of speakers increases, the number of errors increases;
this is because the distance to each centroid of each speaker can be similar to another and maximum likelihood
of speech is checked by algorithm.
Future work will entail:
 Adding confirmation commands to the voice recognition system from appliance.
 Integrating appliance control functions to improve the system versatility such as providing control
commands other than ON/OFF commands such as slow down, dim, etc.
Acknowledgements
I, Prabhakar V. Mhadse acknowledge with thanks the guidance received from Prof. A. C. Wani and prof. S. R. Suralkar, Head
of the Department in preparation of this Project.
I would also give my sincere thanks to other staff of college for their warm support. The matter taken from the various sources in
preparing this report has been duly cited in reference.
Speech Recognized Automation System Using Speaker Identification through Wireless
www.iosrjournals.org 18 | Page
References
[1] Abhishek Thakur, Neeru Singla, “Design of Matlab-Based Automatic Speaker Recognition and Control System”, International
Journal Of Advanced Engineering Sciences And Technologies, Vol no. 8, issue no. 1.
[2] Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm, Ch.Srinivasa
Kumar et al. / International Journal on Computer Science and Engineering (IJCSE).
[3] Lindasalwa Muda, Mumtaj Begam and I. Elamvazuthi, “Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient
(MFCC) and Dynamic Time Warping (DTW) Techniques”, Journal Of Computing, Volume 2, Issue 3, March 2010, Issn 2151-9617.
[4] Rojas-Rodríguez Rafael, Aceves-Pérez Rita1, Cortés-Aburto Obed, García-Meneses Carlos1, Vela-Valdés Luis, “Development and
integration of automated systems aimed to the comfort of seniors and people with disabilities” International Conference on
Intelligent Environments, pp339-342,2012.
[5] Rabiner L. R. and Schafer R. W., Digital Processing of Speech Signals,New Jersey, US: Prentice Hall Inc, 1978.
[6] Augusto J. C. and Nugent C. D., “Smart homes can be smarter”, in Lecture Notes in Artificial Intelligence. Germany vol. 4008, pp
1-15, May 2006.
[7] “XBee-2.5-Manual,” ZigBee RF communication protocol. (2008).
[8] Algimantas R., Ratkevicius K. and Rudzionis V, “Voice interactive systems”, in The Engineering Handbook of Smart Technology
for Aging, Disability and Independence USA, pp 281-296, 2008.
[9] Vovos A. Kladis B.and Fakotakis N.D., "Speech operated smart-home control system for users with special needs", in Proc.
INTERSPEECH, pp.193-196, 2005.
[10] Sixmith A., “Modeling the well being of older people” in The Engineering Handbook of Smart Technology for Aging, Disability
and Independence USA, pp 569-584, 2008.
[11] Muñoz C., Arellan D, Perales F. J. and Fontaner G. “Perceptual and intelligent domotics system for disabled perople” in
proceedings of Sixth IASTED International Conference, USA pp 70-75 September 2011.
[12] Jorge MARTINEZ*, Hector PEREZ, Enrique ESCAMILLA; Masahisa Mabo SUZUKI; ‟‟Speaker recognition using Mel Frequency
Cepstral Coefficients (MFCC) and Vector Quantization (VQ) Techniques” IEEE 2012.
[13] Shannon B.J., Paliwal K.K., “A comparative study of filter bank spacing for speech recognition”,Proc. of Microelectronic engi-
neering research conference, Brisbane, Australia, Nov. 2003.
[14] T. Hill and P. Lewicki, Statistics Methods and Applications. Tulsa, OK: StatSoft Inc., 2006.
[15] S. Furui, Digital Speech Processing, Synthesis, and Recognition. New York: Marcel Dekker, 2001.
[16] Fezari, M.; Khati, A.-E, “New speech processor and ultrasonic sensors based embedded system to improve the control of a
motorized wheelchair,” Design and Test Workshop, 2008. IDT 2008.
[17] A. Spanias, T. Painter and V. Atti Audio Signal Processing and Coding. John Wiley & Sons, 2007.
[18] Mohd. Rihan, M. Salim Beg, Narayan Gehlot; “Developments in Home Automation and Networking”, Proc. National Conference
on “Computing for Nation Development”, Bharati Vidyapeeth Institute of Computer and Management, New Delhi, 23-24 Feb. ,
2007, pp. 61-64.

More Related Content

What's hot

IRJET - Threat Prediction using Speech Analysis
IRJET - Threat Prediction using Speech AnalysisIRJET - Threat Prediction using Speech Analysis
IRJET - Threat Prediction using Speech AnalysisIRJET Journal
 
Financial Transactions in ATM Machines using Speech Signals
Financial Transactions in ATM Machines using Speech SignalsFinancial Transactions in ATM Machines using Speech Signals
Financial Transactions in ATM Machines using Speech SignalsIJERA Editor
 
Estimating the quality of digitally transmitted speech over satellite communi...
Estimating the quality of digitally transmitted speech over satellite communi...Estimating the quality of digitally transmitted speech over satellite communi...
Estimating the quality of digitally transmitted speech over satellite communi...Alexander Decker
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNIJCSEA Journal
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNIJCSEA Journal
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identificationIJCSEA Journal
 
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSEFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSijnlc
 
The proposed System for Indoor Location Tracking
The proposed System for Indoor Location TrackingThe proposed System for Indoor Location Tracking
The proposed System for Indoor Location TrackingEditor IJCATR
 
Silent call reason
Silent call reasonSilent call reason
Silent call reasonMehmet TANIS
 
Adaptive wavelet thresholding with robust hybrid features for text-independe...
Adaptive wavelet thresholding with robust hybrid features  for text-independe...Adaptive wavelet thresholding with robust hybrid features  for text-independe...
Adaptive wavelet thresholding with robust hybrid features for text-independe...IJECEIAES
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingiosrjce
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327IJMER
 
IRJET- Comparative Analysis of Emotion Recognition System
IRJET- Comparative Analysis of Emotion Recognition SystemIRJET- Comparative Analysis of Emotion Recognition System
IRJET- Comparative Analysis of Emotion Recognition SystemIRJET Journal
 
Real Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationReal Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationIDES Editor
 
Speaker Identification & Verification Using MFCC & SVM
Speaker Identification & Verification Using MFCC & SVMSpeaker Identification & Verification Using MFCC & SVM
Speaker Identification & Verification Using MFCC & SVMIRJET Journal
 
Speaker recognition in android
Speaker recognition in androidSpeaker recognition in android
Speaker recognition in androidAnshuli Mittal
 

What's hot (20)

Dy36749754
Dy36749754Dy36749754
Dy36749754
 
IRJET - Threat Prediction using Speech Analysis
IRJET - Threat Prediction using Speech AnalysisIRJET - Threat Prediction using Speech Analysis
IRJET - Threat Prediction using Speech Analysis
 
Financial Transactions in ATM Machines using Speech Signals
Financial Transactions in ATM Machines using Speech SignalsFinancial Transactions in ATM Machines using Speech Signals
Financial Transactions in ATM Machines using Speech Signals
 
Estimating the quality of digitally transmitted speech over satellite communi...
Estimating the quality of digitally transmitted speech over satellite communi...Estimating the quality of digitally transmitted speech over satellite communi...
Estimating the quality of digitally transmitted speech over satellite communi...
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identification
 
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSEFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
 
The proposed System for Indoor Location Tracking
The proposed System for Indoor Location TrackingThe proposed System for Indoor Location Tracking
The proposed System for Indoor Location Tracking
 
IRJET- Vocal Code
IRJET- Vocal CodeIRJET- Vocal Code
IRJET- Vocal Code
 
Silent call reason
Silent call reasonSilent call reason
Silent call reason
 
Adaptive wavelet thresholding with robust hybrid features for text-independe...
Adaptive wavelet thresholding with robust hybrid features  for text-independe...Adaptive wavelet thresholding with robust hybrid features  for text-independe...
Adaptive wavelet thresholding with robust hybrid features for text-independe...
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law companding
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327
 
IRJET- Comparative Analysis of Emotion Recognition System
IRJET- Comparative Analysis of Emotion Recognition SystemIRJET- Comparative Analysis of Emotion Recognition System
IRJET- Comparative Analysis of Emotion Recognition System
 
Real Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationReal Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and Validation
 
Speaker Identification & Verification Using MFCC & SVM
Speaker Identification & Verification Using MFCC & SVMSpeaker Identification & Verification Using MFCC & SVM
Speaker Identification & Verification Using MFCC & SVM
 
Speaker recognition in android
Speaker recognition in androidSpeaker recognition in android
Speaker recognition in android
 
Assignment 02
Assignment 02Assignment 02
Assignment 02
 
Participatory urbanism
Participatory urbanismParticipatory urbanism
Participatory urbanism
 

Viewers also liked

Viewers also liked (20)

Friction Stir Welding of steels: AReview Paper
Friction Stir Welding of steels: AReview PaperFriction Stir Welding of steels: AReview Paper
Friction Stir Welding of steels: AReview Paper
 
Influence of joint clearance on kinematic and dynamic parameters of mechanism
Influence of joint clearance on kinematic and dynamic parameters of mechanismInfluence of joint clearance on kinematic and dynamic parameters of mechanism
Influence of joint clearance on kinematic and dynamic parameters of mechanism
 
R01732105109
R01732105109R01732105109
R01732105109
 
C017152427
C017152427C017152427
C017152427
 
Q01821111115
Q01821111115Q01821111115
Q01821111115
 
K017655963
K017655963K017655963
K017655963
 
F017433947
F017433947F017433947
F017433947
 
U180304130141
U180304130141U180304130141
U180304130141
 
I010125361
I010125361I010125361
I010125361
 
G017653135
G017653135G017653135
G017653135
 
C010411218
C010411218C010411218
C010411218
 
I1302035763
I1302035763I1302035763
I1302035763
 
D012522433
D012522433D012522433
D012522433
 
E1303012733
E1303012733E1303012733
E1303012733
 
A012660105
A012660105A012660105
A012660105
 
D011112432
D011112432D011112432
D011112432
 
J012275461
J012275461J012275461
J012275461
 
Characteristics analysis of silicon carbide based 1-D Photonic crystal with 2...
Characteristics analysis of silicon carbide based 1-D Photonic crystal with 2...Characteristics analysis of silicon carbide based 1-D Photonic crystal with 2...
Characteristics analysis of silicon carbide based 1-D Photonic crystal with 2...
 
D010112031
D010112031D010112031
D010112031
 
G010343844
G010343844G010343844
G010343844
 

Similar to Speech Recognized Automation System Using Speaker Identification through Wireless Communication

Speaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization ApproachSpeaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization Approachijsrd.com
 
Intelligent Arabic letters speech recognition system based on mel frequency c...
Intelligent Arabic letters speech recognition system based on mel frequency c...Intelligent Arabic letters speech recognition system based on mel frequency c...
Intelligent Arabic letters speech recognition system based on mel frequency c...IJECEIAES
 
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...IJCSEA Journal
 
A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...TELKOMNIKA JOURNAL
 
A Review On Speech Feature Techniques And Classification Techniques
A Review On Speech Feature Techniques And Classification TechniquesA Review On Speech Feature Techniques And Classification Techniques
A Review On Speech Feature Techniques And Classification TechniquesNicole Heredia
 
Intelligent hands free speech based sms
Intelligent hands free speech based smsIntelligent hands free speech based sms
Intelligent hands free speech based smsKamal Spring
 
Intelligent hands free speech based sms
Intelligent hands free speech based smsIntelligent hands free speech based sms
Intelligent hands free speech based smsKamal Spring
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...IRJET Journal
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...IRJET Journal
 
IRJET- Emotion recognition using Speech Signal: A Review
IRJET-  	  Emotion recognition using Speech Signal: A ReviewIRJET-  	  Emotion recognition using Speech Signal: A Review
IRJET- Emotion recognition using Speech Signal: A ReviewIRJET Journal
 
Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition S...
Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition S...Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition S...
Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition S...IDES Editor
 
IRJET- Device Activation based on Voice Recognition using Mel Frequency Cepst...
IRJET- Device Activation based on Voice Recognition using Mel Frequency Cepst...IRJET- Device Activation based on Voice Recognition using Mel Frequency Cepst...
IRJET- Device Activation based on Voice Recognition using Mel Frequency Cepst...IRJET Journal
 
AN EFFICIENT SPEECH RECOGNITION SYSTEM
AN EFFICIENT SPEECH RECOGNITION SYSTEMAN EFFICIENT SPEECH RECOGNITION SYSTEM
AN EFFICIENT SPEECH RECOGNITION SYSTEMcseij
 
ADAPTIVE WATERMARKING TECHNIQUE FOR SPEECH SIGNAL AUTHENTICATION
ADAPTIVE WATERMARKING TECHNIQUE FOR SPEECH SIGNAL AUTHENTICATION ADAPTIVE WATERMARKING TECHNIQUE FOR SPEECH SIGNAL AUTHENTICATION
ADAPTIVE WATERMARKING TECHNIQUE FOR SPEECH SIGNAL AUTHENTICATION ijcsit
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueCSCJournals
 
Comparative Study of Different Techniques in Speaker Recognition: Review
Comparative Study of Different Techniques in Speaker Recognition: ReviewComparative Study of Different Techniques in Speaker Recognition: Review
Comparative Study of Different Techniques in Speaker Recognition: ReviewIJAEMSJORNAL
 
Human Emotion Recognition From Speech
Human Emotion Recognition From SpeechHuman Emotion Recognition From Speech
Human Emotion Recognition From SpeechIJERA Editor
 
A Robust Speaker Identification System
A Robust Speaker Identification SystemA Robust Speaker Identification System
A Robust Speaker Identification Systemijtsrd
 

Similar to Speech Recognized Automation System Using Speaker Identification through Wireless Communication (20)

Speaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization ApproachSpeaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization Approach
 
Intelligent Arabic letters speech recognition system based on mel frequency c...
Intelligent Arabic letters speech recognition system based on mel frequency c...Intelligent Arabic letters speech recognition system based on mel frequency c...
Intelligent Arabic letters speech recognition system based on mel frequency c...
 
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
 
A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...
 
A Review On Speech Feature Techniques And Classification Techniques
A Review On Speech Feature Techniques And Classification TechniquesA Review On Speech Feature Techniques And Classification Techniques
A Review On Speech Feature Techniques And Classification Techniques
 
Intelligent hands free speech based sms
Intelligent hands free speech based smsIntelligent hands free speech based sms
Intelligent hands free speech based sms
 
Intelligent hands free speech based sms
Intelligent hands free speech based smsIntelligent hands free speech based sms
Intelligent hands free speech based sms
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
IRJET- Emotion recognition using Speech Signal: A Review
IRJET-  	  Emotion recognition using Speech Signal: A ReviewIRJET-  	  Emotion recognition using Speech Signal: A Review
IRJET- Emotion recognition using Speech Signal: A Review
 
Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition S...
Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition S...Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition S...
Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition S...
 
IRJET- Device Activation based on Voice Recognition using Mel Frequency Cepst...
IRJET- Device Activation based on Voice Recognition using Mel Frequency Cepst...IRJET- Device Activation based on Voice Recognition using Mel Frequency Cepst...
IRJET- Device Activation based on Voice Recognition using Mel Frequency Cepst...
 
AN EFFICIENT SPEECH RECOGNITION SYSTEM
AN EFFICIENT SPEECH RECOGNITION SYSTEMAN EFFICIENT SPEECH RECOGNITION SYSTEM
AN EFFICIENT SPEECH RECOGNITION SYSTEM
 
H010625862
H010625862H010625862
H010625862
 
ADAPTIVE WATERMARKING TECHNIQUE FOR SPEECH SIGNAL AUTHENTICATION
ADAPTIVE WATERMARKING TECHNIQUE FOR SPEECH SIGNAL AUTHENTICATION ADAPTIVE WATERMARKING TECHNIQUE FOR SPEECH SIGNAL AUTHENTICATION
ADAPTIVE WATERMARKING TECHNIQUE FOR SPEECH SIGNAL AUTHENTICATION
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
 
Comparative Study of Different Techniques in Speaker Recognition: Review
Comparative Study of Different Techniques in Speaker Recognition: ReviewComparative Study of Different Techniques in Speaker Recognition: Review
Comparative Study of Different Techniques in Speaker Recognition: Review
 
Human Emotion Recognition From Speech
Human Emotion Recognition From SpeechHuman Emotion Recognition From Speech
Human Emotion Recognition From Speech
 
A Robust Speaker Identification System
A Robust Speaker Identification SystemA Robust Speaker Identification System
A Robust Speaker Identification System
 
Speaker Recognition Using Vocal Tract Features
Speaker Recognition Using Vocal Tract FeaturesSpeaker Recognition Using Vocal Tract Features
Speaker Recognition Using Vocal Tract Features
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Recently uploaded

Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
microprocessor 8085 and its interfacing
microprocessor 8085  and its interfacingmicroprocessor 8085  and its interfacing
microprocessor 8085 and its interfacingjaychoudhary37
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 

Recently uploaded (20)

Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
microprocessor 8085 and its interfacing
microprocessor 8085  and its interfacingmicroprocessor 8085  and its interfacing
microprocessor 8085 and its interfacing
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 

Speech Recognized Automation System Using Speaker Identification through Wireless Communication

  • 1. IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-ISSN: 2278-2834,p- ISSN: 2278-8735.Volume 6, Issue 1 (May. - Jun. 2013), PP 11-18 www.iosrjournals.org www.iosrjournals.org 11 | Page Speech Recognized Automation System Using Speaker Identification through Wireless Communication S. R. Suralkar1 , Amol C.Wani2 , Prabhakar V. Mhadse3 1 (Electronics & Telecomm Engg Dept, SSBT’s COET, North Maharashtra University, Jalgaon, India) 2 (Electronics & Telecomm Engg Dept, SSBT’s COET, North Maharashtra University, Jalgaon, India) 3 (Electronics & Telecomm Engg Dept, SSBT’s COET, North Maharashtra University, Jalgaon, India) Abstract : This paper discusses the methodology for a project named “Speech Recognized Automation System using Speaker Identification through wireless communication”. This project gives the design of Automation system using wireless communication and speaker recognition using Matlab code. Straightforward programming interface of Matlab makes it an ideal tool for speech analysis in project. This automation system is useful for home appliances as well as in industry. This paper discusses the overall design of a wireless automation system which is built and implemented. The speech recognition centers on recognition of speech commands stored in data base of Matlab and it is matched with incoming voice command of speaker. Mel Frequency Cepstral Coefficient (MFCC) algorithm is used to recognize the speech of speaker and to extract features of speech. It uses low-power RF ZigBee transceiver wireless communication modules which are relatively cheap. This automation system is intended to control lights, fans and other electrical appliances in a home or office using speech commands like Light, Fan etc. Further, if security is not big issue then Speech processor is used to control the appliances without speaker identification. Keywords — Automation system, MATLAB code, MFCC, speaker identification, ZigBee transceiver. I. Introduction Development of automation systems using speaker identification began in the 1960s with exploration into speech analysis using text matching, where characteristics of an individual's voice were thought to be able to characterize the uniqueness of an individual much like a fingerprint. The early systems had many flaws and research ensued to derive a more reliable method of predicting the correlation between two sets of speech utterances. The home evolutionary developments time from the era in which man became sedentary to stop living inside caves and start building their homes. These evolutionary trends of homes automation are focused on several main issues such as security, culture, leisure, comfort, energy savings, management and economic activities. Over the years much work has been done in the domain of automatic speech recognition for automation systems. The progress made is significant for small, medium and even large vocabulary systems. Speaker Identification based automation is one of the major growing industries that can change the way people live with security in operation. The aim of such Automation System is to provide those with special needs with a system that can respond to speech commands and control the on/off status of electrical devices, such as lamps, fans, television etc, in the home and office wirelessly. The system should be reasonably cheap and easy to install in any environment. Generally, human voice conveys much information such as gender, emotion and identity of the speaker. The objective of voice recognition is to determine which speaker is present based on the individual‟s utterance [3]. Several techniques have been proposed for reducing the mismatch between the testing and training environments. Many of these methods operate either in spectral, or in cepstral domain. Firstly, human voice is converted into digital signal form to produce digital data representing each level of signal at every discrete time step like zero crossing. The digitized speech samples are then processed using MFCC to produce voice features. The performance of the speech recognition system is given in terms of error rate as measured for a specified technology. Vocal communication between person and computer includes synthesis of speech which is in Matlab database and matching of speech features with incoming speech commands. Speaker recognition involves speaker identification to output the identity of speaker among the population as switching on/off of the electrical equipment. II. System Overview Fig.1 show overall system confined with software and hardware with wireless units. A speaker identification based automation system is integrated system including separate transmitter and receiver module connected wirelessly and MATLAB software coding using MFCC algorithm to extract speech features for Speaker Identification.
  • 2. Speech Recognized Automation System Using Speaker Identification through Wireless www.iosrjournals.org 12 | Page Fig.1: Functional Block Diagram of Automation system for Speaker Identification This project aims to develop an automated system capable of controlling operation of various devices using voice command, so that respective user can handle, simply, such devices with authentication. Fig. 2 and Fig. 3 shows the architecture used for transmitter and receiver module, for speech interpretation an environmental microphone and communication interface, which captures and digitize voice more clearly in Matlab database and in speech processor. Microcontroller unit is used to control every operation of speech processor and speech commands coming from Matlab simulator. Fig. 2: Transmitter section. Speaker identification based automation system is an integrated system that facilitates user with an easy-to-use home automation system that can be fully operated based on speech commands. This system is handled by particular user only, and hence more secure system. The system is constructed in a way that is easy to install, configure, run, and maintain. The functional blocks of the transmitter module is shown in Fig. 2. Fig. 3 shows receiver section which converts digital signal from speech processor and Matlab simulator to analog signal to drive appliances using microcontroller unit. Fig. 3: Receiver section.
  • 3. Speech Recognized Automation System Using Speaker Identification through Wireless www.iosrjournals.org 13 | Page We have selected the most appropriate speech recognition method using Processor HM2007 which will provide instructions to activate the systems actuators. In this case the actuators will be switches or motors which control lighting, heating and other features inside the home and office. Here, speech signal is fed to speech processor for speech analysis and same signal is applied to MATLAB using speech detect and speech write commands speech is saved in Matlab database. After the above we have selected the signal bus and communication protocol between computer and actuators. The most common are: dedicated wiring, twisted pair, ZigBee, coaxial cable, power supply installed. III. Speech Recognition 3.1 Speech Recognition Algorithm A speech analysis is done after taking an input through microphone from a user. The design of the system involves processing of the input audio signal. At different levels, different operations are performed on the input signal such as Pre-emphasis, Framing, Windowing, Mel Cepstrum analysis and Recognition (Matching) of the spoken word [3]. The voice algorithms consist of two distinguished phases. The first one is training sessions, where recording of speech signal takes place associated with the application or uniqueness, while, the second one is referred to as operation session or testing phase, where particular users speech commands matching is tested, as described in Fig. 4. Fig. 4: Speech Recognition Algorithm. 3.2 MFCC Algorithm The extraction of the best parametric representation of speech signals is an important task to produce a better recognition performance. The efficiency of this phase is important for the next phase since it affects its behavior. MFCC is based on human hearing perceptions which cannot perceive frequency over 1Khz. MFCC is based on known variation of the human ear‟s critical bandwidth with frequency. MFCC has two types of filter which are spaced linearly at low frequency below 1000 Hz and logarithmic spacing above 1000Hz. A subjective pitch is present on Mel Frequency Scale to capture important characteristic of phonetic in speech [3]. It checks zero crossing in speech signal. The cepstrum is a common transform used to gain information from a person‟s speech signal. It can be used to separate the excitation signal (which contains the words and the pitch) and the transfer function (which contains the voice quality). It is the result of taking Fourier transform of decibel spectrum as if it were a signal. We use cepstral analysis in speaker identification because the speech signal is of the particular form above, and the "cepstral transform" of it makes analysis incredibly simple. Mathematically, cepstrum of signal = FT[log{FT(the windowed signal)}]. MFCCs are commonly calculated by first taking the Fourier transform of a windowed function of a signal and mapping the powers of the spectrum obtained onto the mel scale, using triangular overlapping windows. Next the logs of the powers at each of the mel frequencies are taken, Direct Cosine Transform is applied to it. The MFCCs are the amplitudes of the resulting spectrum. Step-I: Pre–emphasis This step processes the passing of signal through a filter which emphasizes higher frequencies. This process will increase the energy of signal at higher frequency. Step-II: Framing The short time analysis is fundamental to speech analysis techniques. It is assumed that over a long interval of time speech waveform is not stationary but over a sufficiently short time interval about 10-30msec, it can be considered stationary. This is due to that fact that the rate at which the spectrums of the speech signal changes is directly dependant on the rate of movement of the speech articulators. Most speech analysis systems
  • 4. Speech Recognized Automation System Using Speaker Identification through Wireless www.iosrjournals.org 14 | Page operate at uniformly spaced time intervals or frames of typical duration 10-30 msec. In frame blocking, the continuous speech signal is blocked into frames of N samples, with adjacent frames being separated by M (M < N). The first frame consists of the first N samples and The second frame begins M samples after the first frame, and overlaps it by N - M samples. Similarly, the third frame begins 2M samples after the first frame (or M samples after the second frame) and overlaps it by N - 2M samples. This process continues until all the speech is accounted for within one or more frames. Typical values for N and M are N = 256 (which is equivalent to ~ 30msec windowing and facilitate the fast radix-2 FFT) and M = 100 [2]. Step-III: Windowing The next step in the speech processing is to window each individual frame so as to minimize the signal discontinuities at the beginning and end of each frame. When we perform Fourier Transform, it assumes that the signal repeats, and the end of one frame does not connect smoothly with the beginning of the next one. This adds some glitches at regular intervals. So we have to make the ends of each frame smooth enough to connect with each other. This is possible by a processing called Windowing. This is done by Hamming windowing method. Fig. 5: Hamming Window. Step-IV: Fast Fourier Transform The Fast Fourier Transform is used to convert each frame of N samples from the time domain into the frequency domain. The FFT is a fast algorithm to implement the Discrete Fourier Transform (DFT) which is defined on the set of N samples {xn}, as follow: Note that we use j here to denote the imaginary unit, i.e. j= √-1. In general Xn ‟s are complex numbers. Step- V: Mel Filter Bank Processing The range of frequencies in FFT is very wide and voice signal does not follow the linear scale. The bank of filters according to Mel scale as shown in figure 6 is used [3] to smoothen scale. Fig. 6: Mel scale filter bank. The Mel for given frequency F in Hz: F (mel) = [2595 * log10 [1+f] 700] Step-VI: Discrete Cosine Transform This is the process just reverse to FFT to convert the log Mel spectrum into time domain using Discrete Cosine Transform (DCT). The result of the conversion is called Mel Frequency Cepstrum Coefficient.
  • 5. Speech Recognized Automation System Using Speaker Identification through Wireless www.iosrjournals.org 15 | Page Step-VII: Delta Energy and Delta Spectrum The levels of speech signal and the frames changes, therefore, there is a need to add features related to the change in cepstral features over time. The energy in a frame for a signal X(t) in a window from time sample t1 to time sample t2, is represented at the equation below: Energy= ∑X2 (t) IV. Hardware Design In this section we present the hardware description of the modules used for adaptation of speech processor HM 2007 and wireless communication like Zigbee Module. 4.1. Speech Processor Module The main element of the design consist of speech synthesizer/recognition ASIC and ZIGBEE transceiver module. The speech recognition IC needs to be programmed for various speech commands. This is done by using a MIC, analog speech signal from MIC are stored in the internal memory of the IC after being digitized using ADC blocks which is internal process in processor. Once the learning/programming is completed. The IC is ready to accept the commands. A command issued by the user through MIC will be digitized and compared with the digitized commands already stored in the internal memory of IC. When a match is received, microcontroller status will be updated accordingly. The microcontroller in turn will generate a specific data which will be transmitted through RF channel using ZIGBEE transmitter. 4.2 Microcontroller Module Fig. 7: Microcontroller Module. Fig. 8: Microcontroller-89c52 Board This module receives signal from speech processor and decision making regarding switching ON/OFF appliance is made here as shown in figure 7. Either from speech processor or Matlab simulator output associated with appliance like Light or fan is given to microcontroller. Output of microcontroller will toggle for incoming signal from processors. Wireless module is connected to one of the input port of microcontroller and appliances are connected to output port.
  • 6. Speech Recognized Automation System Using Speaker Identification through Wireless www.iosrjournals.org 16 | Page 4.3 Wireless (Zigbee) Module Zigbee protocol is the communication protocol that is used in this system for wireless communication as shown in figure 9. Zigbee offers 250 kbps as maximum baud rate. Range of wireless unit is 100meters, but practically we get approximately about 30-40 meters. Fig. 9: Zigbee Module. V. Tests And Results In this section, we need to train the system using speech commands, which are saved in database using .wav format. Following command is used to record speech command for every appliance. y = wavrecord (30*8000, 8000); after recording command it is checked whether it is in given sample format using, Speechdetect (y); If recording is ok then it is saved in database using command, Wavwrite (y, 8000, 'speaker.wav'); For example, for light appliance it is saved as, Wavwrite (y, 8000, 'light.wav'); After recording all commands for various appliances, single database called MODEL file is built, which will have all commands for specific speaker. This model file is built using following command in Matlab, „trainscript‟. Above steps are used to train and record the speech commands. Once commands are saved, our system is ready to work. At the serial port we use Zigbee transmitter module, which will send on/off commands to appliance, wirelessly. Using command „digitrecgui‟ speech commands from users are received. If there is match found results will be shown on screen as shown in figure 10 and figure 11. Figure 10 shows matching for light command for particular speaker, where as figure 11 shows match for fan. Fig. 10: command Light matched results.
  • 7. Speech Recognized Automation System Using Speaker Identification through Wireless www.iosrjournals.org 17 | Page Fig. 11: command Fan matched results. For 10 different speakers our system is tested, where we found following satisfactory results of speaker identification and appliances worked similarly, results are shown in Table 1. Sr. No. SPEAKER LIGHT (%) FAN (%) 1 Actual Speaker 80 85 2 Speaker 1 10 10 3 Speaker 2 0 0 4 Speaker 3 0 20 5 Speaker 4 10 0 6 Speaker 5 0 10 7 Speaker 6 20 10 8 Speaker 7 0 10 9 Speaker 8 10 0 10 Speaker 9 0 0 Table 1: Results of Speech recognized Automation system using speaker identification. For actual speaker and other speakers, statistics is calculated for speech command matching. VI. Conclusions And Future Work A speech recognized automation system using speaker identification through wireless communication is built and implemented. The prototype developed can control electrical devices in a home or office wirelessly. The system implements Automatic Speech Recognition using speech processor and Speaker Identification through MATLAB coding using MFCC algorithm. The system implements the wireless network using ZigBee RF modules for their efficiency and low power consumption. After analyzing all the information from our project, we found that our algorithm improves the speaker recognition task. Because the MFCC uses the Mel scale, the approximation and maximum likelihood nature of speech to the human voice behavior is good, it represent in a better way the voice. We improved the precision in the recognition task. We found that as long as the number of speakers increases, the number of errors increases; this is because the distance to each centroid of each speaker can be similar to another and maximum likelihood of speech is checked by algorithm. Future work will entail:  Adding confirmation commands to the voice recognition system from appliance.  Integrating appliance control functions to improve the system versatility such as providing control commands other than ON/OFF commands such as slow down, dim, etc. Acknowledgements I, Prabhakar V. Mhadse acknowledge with thanks the guidance received from Prof. A. C. Wani and prof. S. R. Suralkar, Head of the Department in preparation of this Project. I would also give my sincere thanks to other staff of college for their warm support. The matter taken from the various sources in preparing this report has been duly cited in reference.
  • 8. Speech Recognized Automation System Using Speaker Identification through Wireless www.iosrjournals.org 18 | Page References [1] Abhishek Thakur, Neeru Singla, “Design of Matlab-Based Automatic Speaker Recognition and Control System”, International Journal Of Advanced Engineering Sciences And Technologies, Vol no. 8, issue no. 1. [2] Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm, Ch.Srinivasa Kumar et al. / International Journal on Computer Science and Engineering (IJCSE). [3] Lindasalwa Muda, Mumtaj Begam and I. Elamvazuthi, “Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques”, Journal Of Computing, Volume 2, Issue 3, March 2010, Issn 2151-9617. [4] Rojas-Rodríguez Rafael, Aceves-Pérez Rita1, Cortés-Aburto Obed, García-Meneses Carlos1, Vela-Valdés Luis, “Development and integration of automated systems aimed to the comfort of seniors and people with disabilities” International Conference on Intelligent Environments, pp339-342,2012. [5] Rabiner L. R. and Schafer R. W., Digital Processing of Speech Signals,New Jersey, US: Prentice Hall Inc, 1978. [6] Augusto J. C. and Nugent C. D., “Smart homes can be smarter”, in Lecture Notes in Artificial Intelligence. Germany vol. 4008, pp 1-15, May 2006. [7] “XBee-2.5-Manual,” ZigBee RF communication protocol. (2008). [8] Algimantas R., Ratkevicius K. and Rudzionis V, “Voice interactive systems”, in The Engineering Handbook of Smart Technology for Aging, Disability and Independence USA, pp 281-296, 2008. [9] Vovos A. Kladis B.and Fakotakis N.D., "Speech operated smart-home control system for users with special needs", in Proc. INTERSPEECH, pp.193-196, 2005. [10] Sixmith A., “Modeling the well being of older people” in The Engineering Handbook of Smart Technology for Aging, Disability and Independence USA, pp 569-584, 2008. [11] Muñoz C., Arellan D, Perales F. J. and Fontaner G. “Perceptual and intelligent domotics system for disabled perople” in proceedings of Sixth IASTED International Conference, USA pp 70-75 September 2011. [12] Jorge MARTINEZ*, Hector PEREZ, Enrique ESCAMILLA; Masahisa Mabo SUZUKI; ‟‟Speaker recognition using Mel Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) Techniques” IEEE 2012. [13] Shannon B.J., Paliwal K.K., “A comparative study of filter bank spacing for speech recognition”,Proc. of Microelectronic engi- neering research conference, Brisbane, Australia, Nov. 2003. [14] T. Hill and P. Lewicki, Statistics Methods and Applications. Tulsa, OK: StatSoft Inc., 2006. [15] S. Furui, Digital Speech Processing, Synthesis, and Recognition. New York: Marcel Dekker, 2001. [16] Fezari, M.; Khati, A.-E, “New speech processor and ultrasonic sensors based embedded system to improve the control of a motorized wheelchair,” Design and Test Workshop, 2008. IDT 2008. [17] A. Spanias, T. Painter and V. Atti Audio Signal Processing and Coding. John Wiley & Sons, 2007. [18] Mohd. Rihan, M. Salim Beg, Narayan Gehlot; “Developments in Home Automation and Networking”, Proc. National Conference on “Computing for Nation Development”, Bharati Vidyapeeth Institute of Computer and Management, New Delhi, 23-24 Feb. , 2007, pp. 61-64.