VOICE CONTROLLED WHEELCHAIR using Amharic.pdf

BAHIR DAR INSTITUTE OF TECHNOLOGY
SCHOOL OF ELECTRICAL ENGINEERING
FINAL YEAR PROJECT PRESENTATION :
VOICE CONTROLLED WHEELCHAIR
Members:
Matebie Gashu (500/02)
Mengistu Baye (1307/03)
Mignot Ayana (1373/03)
Mubarek Kebede (14bb/03)
Supervisor :
Mr. : Enyachewu

PRESENTATION OUTLINE
 introduction mengistu
 Voice recognition Mubarek
 Speaker recognition
 control system Mignot
 Result and conclusion matebie

INTRODUCTION
 Controlling device with speech is used for many
application
 To recognize speech and speaker first the
feature of the speech must be extracted
 the best way to extract speech feature is
MFCC
o After feature extraction we classify the speech
and speaker using artificial neural net work

OBJECTIVE OF THE PROJECT
 General Objective
 Design control system of voice controlled
wheelchair.
 Specific Objectives
 To develop speech and speaker recognition
system
 To develop c code for Arduino which control
motor
 To interface Ultrasonic sensor to Arduino

.
Literature review
Study mathematical
modeling
Training with neural
network
Do speaker recognition
Voice data collection
Feature extraction
Is test
good?
predict
Do speech recognition

 Voice recognition part of our project mainly done
in semester project
 what we did is improving efficiency

 Ways of efficacy improvement
 extracting additional feature
 short time energy
 Zero crossing point
 Adjusting good recording time
 And adjusting suppling frequency

 The speaker recognition part is used for change
the system from speaker independent to speaker
dependent

CAPTURE AND PREPROCESSING
 Get the audio signal.
 Make suitable for feature extraction
Capture
Silence
Removal
Pre-Emphasis
Framing
Windowing

FEATURE EXTRACTION
 Transform the input audio signal into a
sequence of acoustic feature vectors
 MFCC : Mel Filter Cepstral Coefficients
as Feature
 Perceptual approach
 Human Ear processes audio signal in Mel
scale
 Mel scale : linear up to 1KHz and
logarithmic after 1KHz
 MFCC gives distribution of energy in Mel
frequency band
 Calculated for each frame
Fourier Transform
Mel Filter
Log
IFT : DCT
Energy and Deltas

COMPOSITION OF FEATURE VECTOR
 20 MFCC Features
 20 Delta MFCC

LEARNING PHASE
Artifial neural network (ANN)

DESIGN CONSIDERATION OF ANN
Sigmoid activation function
Activation function
Number of input unit
Number of hidden
layer
Number of units per
layer
Number of out put

DESIGN CONSIDERATION OF ANN
 The number of input units are the
the size of the external tigers
 in our case the input units are
40 coeffecentes
Activation function
Number of hidden
layer
Number of units per
layer
Number of out put

DESIGN CONSIDERATION
The number of hidden layer affect
learning acurecy
High number hidden layer lead
over fit
High mathematical cost
Small number of hidden layer may
unable to learn
Activation function
Number of hidden
layer
Number of units per
layer
Number of out put

 Number of units per layer
 High number of unit per layer
 running cost increase
 It may lead over fit
o Small number of units per layer
 Leads under fit
Activation function
Number of hidden
layer
Number of units per
layer
Number of out put

 number of out put unit
the number of the out put unit
in our case 2
default and
User
Activation function
Number of hidden
layer
Number of units per
layer
Number of out put

MODEL
 Based on the above design criteria
We select our model

FEEDFORWARD
 feedforward are used to propagate the input
forward to the hypothesis

COST
 The root mean square error (RMSE) is
a frequently-used measure of the
differences between values predicted by a
model or an estimator and the values
actually observed from the thing being
modelled or estimated

BACK PROPAGATION AND GRADIENT
 Backpropagation is used for propagating the
error from out put to input
 Backpropagation performs a gradient descent
within the solution's vector space towards a
global minimum.
 The gradient from the backpropagation given to
the fimncg bulletin fanction
 Fmincg updating the weights according to the
cost and out put the optimized weights

BLOCK DIAGRAM
MIC
VOICE
RECOGNITION
IC
Arduino
RIGHT
MOTOR
BATTERY
LEFT
MOTOR
H_bridge
Ultrasonic
Alarm and
lighting

PARTS OF THE SYSTEM
 In our real system we will use
HM2007 DSP KIT
 It has external interface circuit
 It is dual processing unit
 Good relatively in its speed
 Since can't found in local market
We use normal pc
VOICE RECOGNITION
IC
CONTROL UNIT
OBSTACLE
SENSOR
MOTOR
DIRECTION
CONTROL
MOTOR SPEED
CONTROL

 Control unit are the hart of all
control system
In our case we use Arduino uno
 It has 6 analog input and
14 digital i/o
 It has serial communication pin
from external device
VOICE RECOGNITION
IC
CONTROL UNIT
OBSTACLE
SENSOR
MOTOR
DIRECTION
CONTROL
MOTOR SPEED
CONTROL

 Obstacle sensor detect presence or
absence of obstacle
In our case we use ultrasonic sensor
VOICE RECOGNITION
IC
CONTROL UNIT
OBSTACLE
SENSOR
MOTOR
DIRECTION
CONTROL
MOTOR SPEED
CONTROL

 H-bridge used to control the
direction of the motor
VOICE RECOGNITION
IC
CONTROL UNIT
OBSTACLE
SENSOR
MOTOR
DIRECTION
CONTROL
MOTOR SPEED
CONTROL

Current
• One switch is closed in each leg of the "H"
• One switch is open in each leg of the "H"

Current
• One switch is closed in each leg of the "H"
• One switch is open in each leg of the "H”
turn in the Other Direction

 The speed of the dc motor can
be controlled using PWM from
Arduino
VOICE RECOGNITION
IC
CONTROL UNIT
OBSTACLE
SENSOR
MOTOR
DIRECTION
CONTROL
MOTOR SPEED
CONTROL

Start
serial
data
avail
able
Data
range
Is
data<mi
n data?
Read sensor
data
Data={1,2,3,4,5}
Data={6,7,8}
Change
direction
Chang speed
stop
no
yes

RESULT AND DESICCATION
 We develop speech and speaker recognition system
 In last we train our data set and gate good accuracy
 Voice recognition 97 for training 88% for test
 Speaker 99 % for training 94 for test
 And we design the control system
 Final we stimulate and we do well

PROBLEM FACED
 In this project time we face many problems
 there was no lab which for doing the project so
there was deficiency of computer
 Adviser even if their is there is no follow up
 bad view of the staff member towards as

 we will implement the hard ware
properly
It can improved its efficacy by using
vector machine learning
Additional value can be added like
RF transceiver to control remotely

CONCLUSION
 From this work we conclude that
 MFCC is fine for speech feature extraction
 Neural network is good for isolated word recognition
In general we conclude that our project is feasible for
control wheelchair movement control.

VOICE CONTROLLED WHEELCHAIR using Amharic.pdf

Recommended

Recommended

More Related Content

Similar to VOICE CONTROLLED WHEELCHAIR using Amharic.pdf

Similar to VOICE CONTROLLED WHEELCHAIR using Amharic.pdf (20)

Recently uploaded

Recently uploaded (20)

VOICE CONTROLLED WHEELCHAIR using Amharic.pdf