SlideShare a Scribd company logo
Middle-East Journal of Scientific Research
24 (Special Issue on Innovations in Information, Embedded and Communication Systems): 58-62, 2016
ISSN 1990-9233; © IDOSI Publications, 2016
DOI: 10.5829/idosi.mejsr.2016.24.IIECS.23140
Corresponding Author: Nitin Desai, School of Electronics Engineering, VIT University, Chennai campus, Chennai, India.
58
Voice Controlled Intelligent Remote for Consumer Electronics
Nitin Desai, Shiva Shankar and John Sahaya Rani Alex
School of Electronics Engineering, VIT University, Chennai campus, Chennai, India
Abstract: Census has revealed that over 21 million people in India have been suffering from one or other kind
of disability. This is about 2.1 percentage of total population. Among the total disabled in the country, 27.9
percentage persons are having functional movement disorder. Hence intelligent remote is proposed and
developed to help physically challenged people and patients in the hospitals with limited motor skills in hand.
The proposed device take the voice command of physically challenged people and generate equivalent infra
red signal for a consumer devices. Low power embedded raspberry pi convert the voice command to text and
8051 based system generates infra red signal as per text command. Thus, the proposed system is implemented
and tested in real time.
Key words: IR generation Transmitter Raspberry pi MCS8051
INTRODUCTION Languages, on which so far automatic speech
As per Census, more than 21 million people in India fraction of total around 7300 existing languages. Russian,
are suffering from different kinds of disability [1]. Physical Portuguese, Chinese, Vietnamese, Japan, Spanish,
disability becomes a major obstacle in the lives of Filipino, Arabic, English, Bengali, Tamil, Malayalam,
physically challenged people and they are not able to Sinhala, Hindi are prominent among them. English is the
perform their day - to-day activities without help of other language for which maximum work for recognition is done
people [2]. Most of motor impaired users use computers [5]. A voice-controlled human-computer interface system
as their communication device [3]. Thus voice based has been designed that enables severely handicapped
system can be beneficial for such physically challenged individuals to operate a computer this will be the new
people. generation of interface for disabled [6]. The hands free
The idea of interaction between human and machine speech recognition method based on HMM composition
led to research in Speech recognition. Automatic speech which realizes and improves user interface by countering
recognition system involves conversion of speech into additive noise [7]. Approach to unsupervised speaker
text. Speech recognition offers greater freedom to employ adaption task for HMM based speech synthesis models
the physically handicapped people in several applications which avoids need for supplementary acoustic models [8].
like manufacturing processes; medicine and telephone IR remote control is most popular equipment for
network [4]. home and office appliances such as air conditioner,
Speech recognition technology is used for television. Recent researchers concentrate on the use of
converting speech signals into a sequence of words or IR remote control of devices for controlling different
other linguistic units by the use of an algorithm [4]. Out of applications. Literature shows work on detection and
87 different speaker, the error word rate of 19% with 20 study of various IR control signals [9]. IR control signal
seconds was found using raspberry pi [5]. Automatic is used to control home appliances and industrial machine
speech recognition has been viewed as successive parameters [10, 11]. There is a white paper on how IR
transformations of acoustic micro structure of speech remote can be used to provide cost efficient solution for
signal into its implicit phonetic macro- structure. In other controlling many kinds of electronics devices [12].
words, a speech recognition system is a speech-to-text Interferences between IR remote control systems for
conversion whereas the output of the system displays electrical home appliances and high frequency electronic
text corresponding to the recognized speech. ballast lightning is described by this paper [13].
recognition systems have been developed, are just a
Voice
Command
Raspberr
y PI
Micro-
controller
IR
IR Signal
Middle-East J. Sci. Res., 24(Special Issue on Innovations in Information, Embedded and Communication Systems): 58-62, 2016
59
Fig. 1: Block Diagram of Intelligent remote system Generation of IR Signal: This consists of 8051
The proposed system convert voice command into takes digitized voice command as input from voice
digital using Raspberry pi and transmitting equivalent IR command recognition system. Database is created for
code using Raspberry pi and ESA MCB 51 microcontroller different commands which consist of IR code in binary
kit. This proposed intelligent IR generation system will be form. In response to input voice command, microcontroller
used to control many devices in home for physically will fetch IR code in the binary form and generates IR
challenged people and for patients in the hospitals. code with the help of RC6 transmitter.
Rest of paper is organized as follows, the section II
explains about speech recognition system and IR code RC-6 Protocol: RC-6 signals are modulated on a 36KHz
generator. The section III shows complete software Infra Red Carrier. The duty cycle of this carrier has to be
description of intelligent remote system. Implementation between 25 percent to 50 percent. Data is modulated using
details and results are mentioned in section IV. Manchester coding.
Conclusion is given in section V.
Hadware System Design: Block diagram in Fig 1, Its purpose is to adjust the gain of the IR receiving unit.
represents complete intelligent remote system which This leader symbol is followed by a Start Bit SB which
consist of raspberry pi and 8051 microcontroller. always has the value of "1". Its purpose is to calibrate the
Raspberry pi takes input from physically challenged receiver's timing. The field bits are used to calculate mode.
people and convert that signal into equivalent text format. Here mode zero is used. Thus all field bits are set to zero.
This text is saved in raspberry pi. Python script is used to Finally the header is terminated by the trailer bit TR. Bit
recognize input command in text form and as per time of trailer symbol is twice the normal bits. Trailer bit
command, raspberry pi generates control signal for 8051 acts as a traditional toggle bit, which will be inverted
microcontroller which is used to generate equivalent IR whenever a key is released. This allows the receiver to
code with the help of IR transmitter connected to one of distinguish between a new key and a repeated key.
the GPIO pin of 8051.
Speech Recognition System: Speech is the most common address byte. It indicates that total of 256 different
communication platform for exchanging information devices can be controlled using mode 0 of RC-6. The MSB
among humans. Speech recognition systems are the is transmitted first.
efficient alternatives for such devices where typing
becomes difficult. The proposed speech recognition Information Field: The information field holds 8 bits
system converts speech signals into a sequence of words which are used as command byte. This means that each
by algorithm implemented on a computer. Given voice device can have up to 256 different commands. The MSB
command is analyzed and compared with the predefined is transmitted first.
command in the grammar file and based on match, text is
displayed. More number of commands can be added in Signal Free Time: The Signal Free time is a period in
grammar file to make library of word for a universal which no data may be transmitted (by any device). It is
applications. important for the receiver to detect the signal free
The key feature of Raspberry pi-2 model b, is 1GB
ram. Other features of pi 2 are 4 USB ports,900Hz quad
–core ARM Cortex-A7 CPU,40 GPIO pins, full HDMI port,
Ethernet port, Combined 3.5mm audio jack, Micro SD card
slot, Video Core IV 3D graphics core. Because it has an
ARMv7 processor, it can run the full range of ARM
GNU/Linux distributions, including Snappy Ubuntu Core,
as well as Microsoft Windows 10. Speech recognition
application computation intensive and requires lot of
memory, that is the reason Raspberry Pi-2 model is
chosen.
microcontroller and RC6 IR transmitter. Microcontroller
Header Field: First the Leader Symbol [LS] is transmitted.
Control Field: Control field holds 8 bits which are used as
Middle-East J. Sci. Res., 24(Special Issue on Innovations in Information, Embedded and Communication Systems): 58-62, 2016
60
2a
2b
2c
2d
Fig. 2a: Leader bit, 2b: Normal bit, 2c: Toggle bit, 2d: RC6
Frame Format
time at the end of a message to avoid incorrect
reception. The signal free time is set to 6t, which is
2.666ms. Fig. 4: Internal logic for IR code generation
Below mentioned diagram shows different bits used
in RC6 frame format as shown in Fig 2d. Fig 2a shows Markov models.HMM uses Vitterbi algorithm to find the
width of header bit. Similarly Fig 2c represents width of most probable path. Phoneme is the linguistic unit used to
toggle bit and Fig 2a shows width of normal bit in RC6 construct words. For example, the word “bat” is
frame format. Sample RC6 frame format is shown in Fig 2d. composed of three phones /b/ /ae/ /t/.There are about
Software Description of the System configuration file allows us to choose the feature
Speech Recognition System: There are lot of open source extraction method of speech and the data for training.
software’s available for implementing speech recognition The system connection is shown in Figure 3.
system. This paper use Sphinx software developed by
CMU. Generation of IR Signal: IR code is generated on 8051
Pocket sphinx is based on Hidden Markov model microcontroller with the help of assembly language in Keil
(HMM) classifier. HMM is the mathematical tool for real uVision 4. This flow chart describes how IR code is
time modelling. It creates stochastic models from known generated for consumer electronics device.
utterances and compares the probability that the This proposed system generates IR code for DVD
unknown utterance generated by each model.HMM has player to make it on or off. Same system can be extended
several states each state corresponds to one observable by creating the data base for different IR code to make the
event restriction is that when number of observation remote as universal remote as shown in Figure 4.
increases size of module becomes larger. For each state
probability distribution is defined that specifies every Implementation and Result: Sphinx base 0.8 and Pocket
observation symbol generated in particular state values of sphinx 0.8 are installed in raspberry pi which are used to
states are hidden hence this model is called Hidden analyzing the input voice commands. The system is
Fig. 3: Speech recognition system hardware
implementation
40 such phonemes are required for English. The
Middle-East J. Sci. Res., 24(Special Issue on Innovations in Information, Embedded and Communication Systems): 58-62, 2016
61
Fig. 5: Voice Command Recognized
Fig. 6: Simulation of IR code in Proteus
Fig. 7: IR Code Waveform on CRO
Fig. 8: IR Code generation hardware implementation
configured to choose Mel Frequency Cepstral Coefficient
(MFCC) as feature extraction method. Grammar file is
created with predefined commands which are to be
recognized.
HMM basedphonememodel forEnglish is being trained
using WSJ data: During testing, Viterbi decoding is used
to compute the most probable match of the word being
uttered and the results are displayed in Figure 5. Based on
predefined commands in the grammar file if the match is
found on voice command, respective text is displaced.
Thus voice to text is conversion is done. That text is
saved in a text file. Using the transcribed text and trigger
for IR generation is given through GPIO pins.
IR code generation logic is implemented in 8051
microcontroller for DVD player as shown in Figure 6. IR
code is not visible to human eyes. However we can
observe the blinking of IR transmitter led with the help of
camera. It is also possible to get complete IR code on CRO
as shown in Figure 6. This logic is implemented and
verified on Proteus before using it in hardware.
CONCLUSION
Proposed work is initiated keeping in mind the needs
and comforts of physically challenged people and
patients in the hospitals with limited motor skills in
hand. Most of products seen in the market are focused
for the larger strata of the society. Technology is no
longer different. The traditional remote controls with
push down buttons are nearly impossible to use for
people suffering with diseases inhibiting motor functions.
A voice controlled smart remote however, solves that
issue.
This proposed system is used to control Philips DVD
player and can be extended to one step ahead to make it
as universal for making the home automation which would
help physically challenged people
REFERENCES
1. http://censusindia.gov.in/Census_And_You/disabl
ed_pop ulation.aspx
2. Mirza, R., XXXX. An indoor navigation approach to
aid the physically disabled people.
3. Pradipta Biswas and Debasis Samanta, 2008. IEEE
transactions on neural systems and rehabilitation
engineering, VOL. 16, NO. 2, APRIL 2008. A
Communication Aid for Persons With Disabilities.
Middle-East J. Sci. Res., 24(Special Issue on Innovations in Information, Embedded and Communication Systems): 58-62, 2016
62
4. Suma Swamy and K.V. Ramakrishanan, 2013. An 9. Rasheed, A.A. and M. Khidrjarjes, 2012. Detection
efficient Speech recognition System”, Computer and Study of Various IR Handheld Remote Control
science &engineering: An international Journal, Signals And Using them for Home Applications,
3(4). [2012 International Conference on Education and e-
5. D’Haro, L.F., 2012. Member, IEEE, R. de Córdoba, Learning Innovations]
Member, IEEE, J.I. Rojo, J. Díez, D. Avendaño, J.M. 10. Nhivekar, G.S. and R.R. Mudholker, 2011.
Bermudo “Low-Cost Speaker and Language Microcontroller based IR control Signal detector for
Recognition Literature review on Automatic speech home application. Advanced in applied Science
recognition by Wiqas Ghai International journal of Research, 2(4): 410-416.
computers Application Vol 4 march 2012 11. Yuan Xiao, Pan Yuliang and Ling Zaiying, 2011. The
6. Mu-Chun Su and Ming-Tsang Chung, 2001. Voice- application of infrared remote controlled code locks
controlled for disabled, year 2001,volume 12. in the management of industrial machine parameters.
7. Takiguchi, T., 2001. HMM -separation based speech International Conference on Electrical and Control
recognition for a distant moving speaker, year 2001, Engineering, pp: 418- 421.
volume 9 12. Martin Gotschlich, 2010. Remote Controls – Radio
8. Gibson, M., 2011. Unsupervised Intralingual and Frequency or Infrared, Infineon Technologies AG,
cross lingual speaker adaption for hmm based speech 2010.
synthesis using two pass decision tree construction, 13. Preventing IR interference between infrared waves
year 2011 volume 19. emitted by high frequency florescent lighting system
and infrared remote control by Kataoka,year 1997
volume 33.

More Related Content

What's hot

Operation and mainetainence of switch ppt
Operation and mainetainence of switch pptOperation and mainetainence of switch ppt
Operation and mainetainence of switch ppt
Isha Negi
 
Network layer and circuit switching
Network layer and circuit switchingNetwork layer and circuit switching
Network layer and circuit switching
asimnawaz54
 
Switching systems lecture6
Switching  systems lecture6Switching  systems lecture6
Switching systems lecture6
Jumaan Ally Mohamed
 
Digital network lecturer7
Digital network  lecturer7Digital network  lecturer7
Digital network lecturer7
Jumaan Ally Mohamed
 
Digital network lecturer5
Digital network  lecturer5Digital network  lecturer5
Digital network lecturer5
Jumaan Ally Mohamed
 
The Performance Evaluation of IEEE 802.16 Physical Layer in the Basis of Bit ...
The Performance Evaluation of IEEE 802.16 Physical Layer in the Basis of Bit ...The Performance Evaluation of IEEE 802.16 Physical Layer in the Basis of Bit ...
The Performance Evaluation of IEEE 802.16 Physical Layer in the Basis of Bit ...
IJCI JOURNAL
 
Rfia.word
Rfia.wordRfia.word
Rfia.word
Bhushan Bhushan
 
DMRC Telecom EPABX System
DMRC Telecom EPABX SystemDMRC Telecom EPABX System
Switching systems lecture5
Switching  systems lecture5Switching  systems lecture5
Switching systems lecture5
Jumaan Ally Mohamed
 
VoIP and multimedia networking
VoIP and multimedia networkingVoIP and multimedia networking
VoIP and multimedia networking
sangusajjan
 
Switching systems lecture1
Switching  systems lecture1Switching  systems lecture1
Switching systems lecture1
Jumaan Ally Mohamed
 
Switching systems lecture8 mpls
Switching  systems lecture8 mplsSwitching  systems lecture8 mpls
Switching systems lecture8 mpls
Jumaan Ally Mohamed
 
1x ev do hard handoff
1x ev do hard handoff1x ev do hard handoff
1x ev do hard handoff
Phuoc Phuoc
 
Report
ReportReport
VHDL Implementation of FPGA Based OFDM Modem for Wireless Applications
VHDL Implementation of FPGA Based OFDM Modem for Wireless ApplicationsVHDL Implementation of FPGA Based OFDM Modem for Wireless Applications
VHDL Implementation of FPGA Based OFDM Modem for Wireless Applications
Editor IJCATR
 
Mobile computing Unit III MANET Notes
Mobile computing Unit III MANET NotesMobile computing Unit III MANET Notes
Bluetooth
BluetoothBluetooth
Bluetooth
kannabhargav
 
Signaling System 7 (SS7)
Signaling System 7 (SS7)Signaling System 7 (SS7)
Signaling System 7 (SS7)
shire ali
 
Digital network lecturer4
Digital network  lecturer4Digital network  lecturer4
Digital network lecturer4
Jumaan Ally Mohamed
 
Shahnshah Sarker 072802556
Shahnshah Sarker  072802556Shahnshah Sarker  072802556
Shahnshah Sarker 072802556
mashiur
 

What's hot (20)

Operation and mainetainence of switch ppt
Operation and mainetainence of switch pptOperation and mainetainence of switch ppt
Operation and mainetainence of switch ppt
 
Network layer and circuit switching
Network layer and circuit switchingNetwork layer and circuit switching
Network layer and circuit switching
 
Switching systems lecture6
Switching  systems lecture6Switching  systems lecture6
Switching systems lecture6
 
Digital network lecturer7
Digital network  lecturer7Digital network  lecturer7
Digital network lecturer7
 
Digital network lecturer5
Digital network  lecturer5Digital network  lecturer5
Digital network lecturer5
 
The Performance Evaluation of IEEE 802.16 Physical Layer in the Basis of Bit ...
The Performance Evaluation of IEEE 802.16 Physical Layer in the Basis of Bit ...The Performance Evaluation of IEEE 802.16 Physical Layer in the Basis of Bit ...
The Performance Evaluation of IEEE 802.16 Physical Layer in the Basis of Bit ...
 
Rfia.word
Rfia.wordRfia.word
Rfia.word
 
DMRC Telecom EPABX System
DMRC Telecom EPABX SystemDMRC Telecom EPABX System
DMRC Telecom EPABX System
 
Switching systems lecture5
Switching  systems lecture5Switching  systems lecture5
Switching systems lecture5
 
VoIP and multimedia networking
VoIP and multimedia networkingVoIP and multimedia networking
VoIP and multimedia networking
 
Switching systems lecture1
Switching  systems lecture1Switching  systems lecture1
Switching systems lecture1
 
Switching systems lecture8 mpls
Switching  systems lecture8 mplsSwitching  systems lecture8 mpls
Switching systems lecture8 mpls
 
1x ev do hard handoff
1x ev do hard handoff1x ev do hard handoff
1x ev do hard handoff
 
Report
ReportReport
Report
 
VHDL Implementation of FPGA Based OFDM Modem for Wireless Applications
VHDL Implementation of FPGA Based OFDM Modem for Wireless ApplicationsVHDL Implementation of FPGA Based OFDM Modem for Wireless Applications
VHDL Implementation of FPGA Based OFDM Modem for Wireless Applications
 
Mobile computing Unit III MANET Notes
Mobile computing Unit III MANET NotesMobile computing Unit III MANET Notes
Mobile computing Unit III MANET Notes
 
Bluetooth
BluetoothBluetooth
Bluetooth
 
Signaling System 7 (SS7)
Signaling System 7 (SS7)Signaling System 7 (SS7)
Signaling System 7 (SS7)
 
Digital network lecturer4
Digital network  lecturer4Digital network  lecturer4
Digital network lecturer4
 
Shahnshah Sarker 072802556
Shahnshah Sarker  072802556Shahnshah Sarker  072802556
Shahnshah Sarker 072802556
 

Similar to Voice Controlled Intelligent Remote for Consumer Electronics

Automatic Control of Instruments Using Efficient Speech Recognition Algorithm
Automatic Control of Instruments Using Efficient Speech Recognition AlgorithmAutomatic Control of Instruments Using Efficient Speech Recognition Algorithm
Automatic Control of Instruments Using Efficient Speech Recognition Algorithm
IJEEE
 
IRJET - Hand Gesture Recognition and Voice Conversion System using IoT
IRJET -  	  Hand Gesture Recognition and Voice Conversion System using IoTIRJET -  	  Hand Gesture Recognition and Voice Conversion System using IoT
IRJET - Hand Gesture Recognition and Voice Conversion System using IoT
IRJET Journal
 
Smart Garage Implementation and Design Using Whatsapp Communication Media
Smart Garage Implementation and Design Using Whatsapp Communication MediaSmart Garage Implementation and Design Using Whatsapp Communication Media
Smart Garage Implementation and Design Using Whatsapp Communication Media
TELKOMNIKA JOURNAL
 
Final Report of Project A Low
Final Report of Project A LowFinal Report of Project A Low
Final Report of Project A Low
Jan Salomon
 
Biomedical Parameter Transfer Using Wireless Communication
Biomedical Parameter Transfer Using Wireless CommunicationBiomedical Parameter Transfer Using Wireless Communication
Biomedical Parameter Transfer Using Wireless Communication
ijsrd.com
 
EEE226a.ppt
EEE226a.pptEEE226a.ppt
EEE226a.ppt
SaifulAhmad27
 
Synthesized Speech using a small Microcontroller
Synthesized Speech using a small MicrocontrollerSynthesized Speech using a small Microcontroller
Synthesized Speech using a small Microcontroller
iosrjce
 
N010637794
N010637794N010637794
N010637794
IOSR Journals
 
3245731 tele-controlled-steper-motor-thesis
3245731 tele-controlled-steper-motor-thesis3245731 tele-controlled-steper-motor-thesis
3245731 tele-controlled-steper-motor-thesis
Thanh Nguyen
 
GSM based E-Notice Board
GSM based E-Notice BoardGSM based E-Notice Board
GSM based E-Notice Board
Mohammad Aazam Ali
 
IRJET- Smart Home System using Voice Recognition
IRJET-  	  Smart Home System using Voice RecognitionIRJET-  	  Smart Home System using Voice Recognition
IRJET- Smart Home System using Voice Recognition
IRJET Journal
 
LORA BASED DATA ACQUISITION SYSTEM
LORA BASED DATA ACQUISITION SYSTEMLORA BASED DATA ACQUISITION SYSTEM
LORA BASED DATA ACQUISITION SYSTEM
IRJET Journal
 
A42060105
A42060105A42060105
A42060105
IJERA Editor
 
Speech Recognized Automation System Using Speaker Identification through Wire...
Speech Recognized Automation System Using Speaker Identification through Wire...Speech Recognized Automation System Using Speaker Identification through Wire...
Speech Recognized Automation System Using Speaker Identification through Wire...
IOSR Journals
 
Speech Recognized Automation System Using Speaker Identification through Wire...
Speech Recognized Automation System Using Speaker Identification through Wire...Speech Recognized Automation System Using Speaker Identification through Wire...
Speech Recognized Automation System Using Speaker Identification through Wire...
IOSR Journals
 
Paper robot
Paper robotPaper robot
Paper robot
Akhil Ajith
 
Module 2_part 2.pptx
Module 2_part 2.pptxModule 2_part 2.pptx
Module 2_part 2.pptx
NIKHILAS12
 
Implementation of Algorithms For Multi-Channel Digital Monitoring Receiver
Implementation of Algorithms For Multi-Channel Digital Monitoring ReceiverImplementation of Algorithms For Multi-Channel Digital Monitoring Receiver
Implementation of Algorithms For Multi-Channel Digital Monitoring Receiver
IOSR Journals
 
Voice Recognition Wireless Home Automation System Based On Zigbee
Voice Recognition Wireless Home Automation System Based On ZigbeeVoice Recognition Wireless Home Automation System Based On Zigbee
Voice Recognition Wireless Home Automation System Based On Zigbee
IOSR Journals
 
Voice Recognition Wireless Home Automation System Based On Zigbee
Voice Recognition Wireless Home Automation System Based On ZigbeeVoice Recognition Wireless Home Automation System Based On Zigbee
Voice Recognition Wireless Home Automation System Based On Zigbee
IOSR Journals
 

Similar to Voice Controlled Intelligent Remote for Consumer Electronics (20)

Automatic Control of Instruments Using Efficient Speech Recognition Algorithm
Automatic Control of Instruments Using Efficient Speech Recognition AlgorithmAutomatic Control of Instruments Using Efficient Speech Recognition Algorithm
Automatic Control of Instruments Using Efficient Speech Recognition Algorithm
 
IRJET - Hand Gesture Recognition and Voice Conversion System using IoT
IRJET -  	  Hand Gesture Recognition and Voice Conversion System using IoTIRJET -  	  Hand Gesture Recognition and Voice Conversion System using IoT
IRJET - Hand Gesture Recognition and Voice Conversion System using IoT
 
Smart Garage Implementation and Design Using Whatsapp Communication Media
Smart Garage Implementation and Design Using Whatsapp Communication MediaSmart Garage Implementation and Design Using Whatsapp Communication Media
Smart Garage Implementation and Design Using Whatsapp Communication Media
 
Final Report of Project A Low
Final Report of Project A LowFinal Report of Project A Low
Final Report of Project A Low
 
Biomedical Parameter Transfer Using Wireless Communication
Biomedical Parameter Transfer Using Wireless CommunicationBiomedical Parameter Transfer Using Wireless Communication
Biomedical Parameter Transfer Using Wireless Communication
 
EEE226a.ppt
EEE226a.pptEEE226a.ppt
EEE226a.ppt
 
Synthesized Speech using a small Microcontroller
Synthesized Speech using a small MicrocontrollerSynthesized Speech using a small Microcontroller
Synthesized Speech using a small Microcontroller
 
N010637794
N010637794N010637794
N010637794
 
3245731 tele-controlled-steper-motor-thesis
3245731 tele-controlled-steper-motor-thesis3245731 tele-controlled-steper-motor-thesis
3245731 tele-controlled-steper-motor-thesis
 
GSM based E-Notice Board
GSM based E-Notice BoardGSM based E-Notice Board
GSM based E-Notice Board
 
IRJET- Smart Home System using Voice Recognition
IRJET-  	  Smart Home System using Voice RecognitionIRJET-  	  Smart Home System using Voice Recognition
IRJET- Smart Home System using Voice Recognition
 
LORA BASED DATA ACQUISITION SYSTEM
LORA BASED DATA ACQUISITION SYSTEMLORA BASED DATA ACQUISITION SYSTEM
LORA BASED DATA ACQUISITION SYSTEM
 
A42060105
A42060105A42060105
A42060105
 
Speech Recognized Automation System Using Speaker Identification through Wire...
Speech Recognized Automation System Using Speaker Identification through Wire...Speech Recognized Automation System Using Speaker Identification through Wire...
Speech Recognized Automation System Using Speaker Identification through Wire...
 
Speech Recognized Automation System Using Speaker Identification through Wire...
Speech Recognized Automation System Using Speaker Identification through Wire...Speech Recognized Automation System Using Speaker Identification through Wire...
Speech Recognized Automation System Using Speaker Identification through Wire...
 
Paper robot
Paper robotPaper robot
Paper robot
 
Module 2_part 2.pptx
Module 2_part 2.pptxModule 2_part 2.pptx
Module 2_part 2.pptx
 
Implementation of Algorithms For Multi-Channel Digital Monitoring Receiver
Implementation of Algorithms For Multi-Channel Digital Monitoring ReceiverImplementation of Algorithms For Multi-Channel Digital Monitoring Receiver
Implementation of Algorithms For Multi-Channel Digital Monitoring Receiver
 
Voice Recognition Wireless Home Automation System Based On Zigbee
Voice Recognition Wireless Home Automation System Based On ZigbeeVoice Recognition Wireless Home Automation System Based On Zigbee
Voice Recognition Wireless Home Automation System Based On Zigbee
 
Voice Recognition Wireless Home Automation System Based On Zigbee
Voice Recognition Wireless Home Automation System Based On ZigbeeVoice Recognition Wireless Home Automation System Based On Zigbee
Voice Recognition Wireless Home Automation System Based On Zigbee
 

Voice Controlled Intelligent Remote for Consumer Electronics

  • 1. Middle-East Journal of Scientific Research 24 (Special Issue on Innovations in Information, Embedded and Communication Systems): 58-62, 2016 ISSN 1990-9233; © IDOSI Publications, 2016 DOI: 10.5829/idosi.mejsr.2016.24.IIECS.23140 Corresponding Author: Nitin Desai, School of Electronics Engineering, VIT University, Chennai campus, Chennai, India. 58 Voice Controlled Intelligent Remote for Consumer Electronics Nitin Desai, Shiva Shankar and John Sahaya Rani Alex School of Electronics Engineering, VIT University, Chennai campus, Chennai, India Abstract: Census has revealed that over 21 million people in India have been suffering from one or other kind of disability. This is about 2.1 percentage of total population. Among the total disabled in the country, 27.9 percentage persons are having functional movement disorder. Hence intelligent remote is proposed and developed to help physically challenged people and patients in the hospitals with limited motor skills in hand. The proposed device take the voice command of physically challenged people and generate equivalent infra red signal for a consumer devices. Low power embedded raspberry pi convert the voice command to text and 8051 based system generates infra red signal as per text command. Thus, the proposed system is implemented and tested in real time. Key words: IR generation Transmitter Raspberry pi MCS8051 INTRODUCTION Languages, on which so far automatic speech As per Census, more than 21 million people in India fraction of total around 7300 existing languages. Russian, are suffering from different kinds of disability [1]. Physical Portuguese, Chinese, Vietnamese, Japan, Spanish, disability becomes a major obstacle in the lives of Filipino, Arabic, English, Bengali, Tamil, Malayalam, physically challenged people and they are not able to Sinhala, Hindi are prominent among them. English is the perform their day - to-day activities without help of other language for which maximum work for recognition is done people [2]. Most of motor impaired users use computers [5]. A voice-controlled human-computer interface system as their communication device [3]. Thus voice based has been designed that enables severely handicapped system can be beneficial for such physically challenged individuals to operate a computer this will be the new people. generation of interface for disabled [6]. The hands free The idea of interaction between human and machine speech recognition method based on HMM composition led to research in Speech recognition. Automatic speech which realizes and improves user interface by countering recognition system involves conversion of speech into additive noise [7]. Approach to unsupervised speaker text. Speech recognition offers greater freedom to employ adaption task for HMM based speech synthesis models the physically handicapped people in several applications which avoids need for supplementary acoustic models [8]. like manufacturing processes; medicine and telephone IR remote control is most popular equipment for network [4]. home and office appliances such as air conditioner, Speech recognition technology is used for television. Recent researchers concentrate on the use of converting speech signals into a sequence of words or IR remote control of devices for controlling different other linguistic units by the use of an algorithm [4]. Out of applications. Literature shows work on detection and 87 different speaker, the error word rate of 19% with 20 study of various IR control signals [9]. IR control signal seconds was found using raspberry pi [5]. Automatic is used to control home appliances and industrial machine speech recognition has been viewed as successive parameters [10, 11]. There is a white paper on how IR transformations of acoustic micro structure of speech remote can be used to provide cost efficient solution for signal into its implicit phonetic macro- structure. In other controlling many kinds of electronics devices [12]. words, a speech recognition system is a speech-to-text Interferences between IR remote control systems for conversion whereas the output of the system displays electrical home appliances and high frequency electronic text corresponding to the recognized speech. ballast lightning is described by this paper [13]. recognition systems have been developed, are just a
  • 2. Voice Command Raspberr y PI Micro- controller IR IR Signal Middle-East J. Sci. Res., 24(Special Issue on Innovations in Information, Embedded and Communication Systems): 58-62, 2016 59 Fig. 1: Block Diagram of Intelligent remote system Generation of IR Signal: This consists of 8051 The proposed system convert voice command into takes digitized voice command as input from voice digital using Raspberry pi and transmitting equivalent IR command recognition system. Database is created for code using Raspberry pi and ESA MCB 51 microcontroller different commands which consist of IR code in binary kit. This proposed intelligent IR generation system will be form. In response to input voice command, microcontroller used to control many devices in home for physically will fetch IR code in the binary form and generates IR challenged people and for patients in the hospitals. code with the help of RC6 transmitter. Rest of paper is organized as follows, the section II explains about speech recognition system and IR code RC-6 Protocol: RC-6 signals are modulated on a 36KHz generator. The section III shows complete software Infra Red Carrier. The duty cycle of this carrier has to be description of intelligent remote system. Implementation between 25 percent to 50 percent. Data is modulated using details and results are mentioned in section IV. Manchester coding. Conclusion is given in section V. Hadware System Design: Block diagram in Fig 1, Its purpose is to adjust the gain of the IR receiving unit. represents complete intelligent remote system which This leader symbol is followed by a Start Bit SB which consist of raspberry pi and 8051 microcontroller. always has the value of "1". Its purpose is to calibrate the Raspberry pi takes input from physically challenged receiver's timing. The field bits are used to calculate mode. people and convert that signal into equivalent text format. Here mode zero is used. Thus all field bits are set to zero. This text is saved in raspberry pi. Python script is used to Finally the header is terminated by the trailer bit TR. Bit recognize input command in text form and as per time of trailer symbol is twice the normal bits. Trailer bit command, raspberry pi generates control signal for 8051 acts as a traditional toggle bit, which will be inverted microcontroller which is used to generate equivalent IR whenever a key is released. This allows the receiver to code with the help of IR transmitter connected to one of distinguish between a new key and a repeated key. the GPIO pin of 8051. Speech Recognition System: Speech is the most common address byte. It indicates that total of 256 different communication platform for exchanging information devices can be controlled using mode 0 of RC-6. The MSB among humans. Speech recognition systems are the is transmitted first. efficient alternatives for such devices where typing becomes difficult. The proposed speech recognition Information Field: The information field holds 8 bits system converts speech signals into a sequence of words which are used as command byte. This means that each by algorithm implemented on a computer. Given voice device can have up to 256 different commands. The MSB command is analyzed and compared with the predefined is transmitted first. command in the grammar file and based on match, text is displayed. More number of commands can be added in Signal Free Time: The Signal Free time is a period in grammar file to make library of word for a universal which no data may be transmitted (by any device). It is applications. important for the receiver to detect the signal free The key feature of Raspberry pi-2 model b, is 1GB ram. Other features of pi 2 are 4 USB ports,900Hz quad –core ARM Cortex-A7 CPU,40 GPIO pins, full HDMI port, Ethernet port, Combined 3.5mm audio jack, Micro SD card slot, Video Core IV 3D graphics core. Because it has an ARMv7 processor, it can run the full range of ARM GNU/Linux distributions, including Snappy Ubuntu Core, as well as Microsoft Windows 10. Speech recognition application computation intensive and requires lot of memory, that is the reason Raspberry Pi-2 model is chosen. microcontroller and RC6 IR transmitter. Microcontroller Header Field: First the Leader Symbol [LS] is transmitted. Control Field: Control field holds 8 bits which are used as
  • 3. Middle-East J. Sci. Res., 24(Special Issue on Innovations in Information, Embedded and Communication Systems): 58-62, 2016 60 2a 2b 2c 2d Fig. 2a: Leader bit, 2b: Normal bit, 2c: Toggle bit, 2d: RC6 Frame Format time at the end of a message to avoid incorrect reception. The signal free time is set to 6t, which is 2.666ms. Fig. 4: Internal logic for IR code generation Below mentioned diagram shows different bits used in RC6 frame format as shown in Fig 2d. Fig 2a shows Markov models.HMM uses Vitterbi algorithm to find the width of header bit. Similarly Fig 2c represents width of most probable path. Phoneme is the linguistic unit used to toggle bit and Fig 2a shows width of normal bit in RC6 construct words. For example, the word “bat” is frame format. Sample RC6 frame format is shown in Fig 2d. composed of three phones /b/ /ae/ /t/.There are about Software Description of the System configuration file allows us to choose the feature Speech Recognition System: There are lot of open source extraction method of speech and the data for training. software’s available for implementing speech recognition The system connection is shown in Figure 3. system. This paper use Sphinx software developed by CMU. Generation of IR Signal: IR code is generated on 8051 Pocket sphinx is based on Hidden Markov model microcontroller with the help of assembly language in Keil (HMM) classifier. HMM is the mathematical tool for real uVision 4. This flow chart describes how IR code is time modelling. It creates stochastic models from known generated for consumer electronics device. utterances and compares the probability that the This proposed system generates IR code for DVD unknown utterance generated by each model.HMM has player to make it on or off. Same system can be extended several states each state corresponds to one observable by creating the data base for different IR code to make the event restriction is that when number of observation remote as universal remote as shown in Figure 4. increases size of module becomes larger. For each state probability distribution is defined that specifies every Implementation and Result: Sphinx base 0.8 and Pocket observation symbol generated in particular state values of sphinx 0.8 are installed in raspberry pi which are used to states are hidden hence this model is called Hidden analyzing the input voice commands. The system is Fig. 3: Speech recognition system hardware implementation 40 such phonemes are required for English. The
  • 4. Middle-East J. Sci. Res., 24(Special Issue on Innovations in Information, Embedded and Communication Systems): 58-62, 2016 61 Fig. 5: Voice Command Recognized Fig. 6: Simulation of IR code in Proteus Fig. 7: IR Code Waveform on CRO Fig. 8: IR Code generation hardware implementation configured to choose Mel Frequency Cepstral Coefficient (MFCC) as feature extraction method. Grammar file is created with predefined commands which are to be recognized. HMM basedphonememodel forEnglish is being trained using WSJ data: During testing, Viterbi decoding is used to compute the most probable match of the word being uttered and the results are displayed in Figure 5. Based on predefined commands in the grammar file if the match is found on voice command, respective text is displaced. Thus voice to text is conversion is done. That text is saved in a text file. Using the transcribed text and trigger for IR generation is given through GPIO pins. IR code generation logic is implemented in 8051 microcontroller for DVD player as shown in Figure 6. IR code is not visible to human eyes. However we can observe the blinking of IR transmitter led with the help of camera. It is also possible to get complete IR code on CRO as shown in Figure 6. This logic is implemented and verified on Proteus before using it in hardware. CONCLUSION Proposed work is initiated keeping in mind the needs and comforts of physically challenged people and patients in the hospitals with limited motor skills in hand. Most of products seen in the market are focused for the larger strata of the society. Technology is no longer different. The traditional remote controls with push down buttons are nearly impossible to use for people suffering with diseases inhibiting motor functions. A voice controlled smart remote however, solves that issue. This proposed system is used to control Philips DVD player and can be extended to one step ahead to make it as universal for making the home automation which would help physically challenged people REFERENCES 1. http://censusindia.gov.in/Census_And_You/disabl ed_pop ulation.aspx 2. Mirza, R., XXXX. An indoor navigation approach to aid the physically disabled people. 3. Pradipta Biswas and Debasis Samanta, 2008. IEEE transactions on neural systems and rehabilitation engineering, VOL. 16, NO. 2, APRIL 2008. A Communication Aid for Persons With Disabilities.
  • 5. Middle-East J. Sci. Res., 24(Special Issue on Innovations in Information, Embedded and Communication Systems): 58-62, 2016 62 4. Suma Swamy and K.V. Ramakrishanan, 2013. An 9. Rasheed, A.A. and M. Khidrjarjes, 2012. Detection efficient Speech recognition System”, Computer and Study of Various IR Handheld Remote Control science &engineering: An international Journal, Signals And Using them for Home Applications, 3(4). [2012 International Conference on Education and e- 5. D’Haro, L.F., 2012. Member, IEEE, R. de Córdoba, Learning Innovations] Member, IEEE, J.I. Rojo, J. Díez, D. Avendaño, J.M. 10. Nhivekar, G.S. and R.R. Mudholker, 2011. Bermudo “Low-Cost Speaker and Language Microcontroller based IR control Signal detector for Recognition Literature review on Automatic speech home application. Advanced in applied Science recognition by Wiqas Ghai International journal of Research, 2(4): 410-416. computers Application Vol 4 march 2012 11. Yuan Xiao, Pan Yuliang and Ling Zaiying, 2011. The 6. Mu-Chun Su and Ming-Tsang Chung, 2001. Voice- application of infrared remote controlled code locks controlled for disabled, year 2001,volume 12. in the management of industrial machine parameters. 7. Takiguchi, T., 2001. HMM -separation based speech International Conference on Electrical and Control recognition for a distant moving speaker, year 2001, Engineering, pp: 418- 421. volume 9 12. Martin Gotschlich, 2010. Remote Controls – Radio 8. Gibson, M., 2011. Unsupervised Intralingual and Frequency or Infrared, Infineon Technologies AG, cross lingual speaker adaption for hmm based speech 2010. synthesis using two pass decision tree construction, 13. Preventing IR interference between infrared waves year 2011 volume 19. emitted by high frequency florescent lighting system and infrared remote control by Kataoka,year 1997 volume 33.