SlideShare a Scribd company logo
1 of 39
Download to read offline
BAHIR DAR INSTITUTE OF TECHNOLOGY
SCHOOL OF ELECTRICAL ENGINEERING
FINAL YEAR PROJECT PRESENTATION :
VOICE CONTROLLED WHEELCHAIR
Members:
Matebie Gashu (500/02)
Mengistu Baye (1307/03)
Mignot Ayana (1373/03)
Mubarek Kebede (14bb/03)
Supervisor :
Mr. : Enyachewu
PRESENTATION OUTLINE
 introduction mengistu
 Voice recognition Mubarek
 Speaker recognition
 control system Mignot
 Result and conclusion matebie
INTRODUCTION
 Controlling device with speech is used for many
application
 To recognize speech and speaker first the
feature of the speech must be extracted
 the best way to extract speech feature is
MFCC
o After feature extraction we classify the speech
and speaker using artificial neural net work
OBJECTIVE OF THE PROJECT
 General Objective
 Design control system of voice controlled
wheelchair.
 Specific Objectives
 To develop speech and speaker recognition
system
 To develop c code for Arduino which control
motor
 To interface Ultrasonic sensor to Arduino
.
Literature review
Study mathematical
modeling
Training with neural
network
Do speaker recognition
Voice data collection
Feature extraction
Is test
good?
predict
Do speech recognition
Voice recognition
 Voice recognition part of our project mainly done
in semester project
 what we did is improving efficiency
 Ways of efficacy improvement
 extracting additional feature
 short time energy
 Zero crossing point
 Adjusting good recording time
 And adjusting suppling frequency
Speaker recognition
 The speaker recognition part is used for change
the system from speaker independent to speaker
dependent
CAPTURE AND PREPROCESSING
 Get the audio signal.
 Make suitable for feature extraction
Capture
Silence
Removal
Pre-Emphasis
Framing
Windowing
FEATURE EXTRACTION
 Transform the input audio signal into a
sequence of acoustic feature vectors
 MFCC : Mel Filter Cepstral Coefficients
as Feature
 Perceptual approach
 Human Ear processes audio signal in Mel
scale
 Mel scale : linear up to 1KHz and
logarithmic after 1KHz
 MFCC gives distribution of energy in Mel
frequency band
 Calculated for each frame
Fourier Transform
Mel Filter
Log
IFT : DCT
Energy and Deltas
COMPOSITION OF FEATURE VECTOR
 20 MFCC Features
 20 Delta MFCC
LEARNING PHASE
Artifial neural network (ANN)
DESIGN CONSIDERATION OF ANN
Sigmoid activation function
Activation function
Number of input unit
Number of hidden
layer
Number of units per
layer
Number of out put
DESIGN CONSIDERATION OF ANN
 The number of input units are the
the size of the external tigers
 in our case the input units are
40 coeffecentes
Activation function
Number of input unit
Number of hidden
layer
Number of units per
layer
Number of out put
DESIGN CONSIDERATION
The number of hidden layer affect
learning acurecy
High number hidden layer lead
over fit
High mathematical cost
Small number of hidden layer may
unable to learn
Activation function
Number of input unit
Number of hidden
layer
Number of units per
layer
Number of out put
DESIGN CONSIDERATION
 Number of units per layer
 High number of unit per layer
 running cost increase
 It may lead over fit
o Small number of units per layer
 Leads under fit
Activation function
Number of input unit
Number of hidden
layer
Number of units per
layer
Number of out put
DESIGN CONSIDERATION
 number of out put unit
the number of the out put unit
in our case 2
default and
User
Activation function
Number of input unit
Number of hidden
layer
Number of units per
layer
Number of out put
MODEL
 Based on the above design criteria
We select our model
FEEDFORWARD
 feedforward are used to propagate the input
forward to the hypothesis
COST
 The root mean square error (RMSE) is
a frequently-used measure of the
differences between values predicted by a
model or an estimator and the values
actually observed from the thing being
modelled or estimated
BACK PROPAGATION AND GRADIENT
 Backpropagation is used for propagating the
error from out put to input
 Backpropagation performs a gradient descent
within the solution's vector space towards a
global minimum.
 The gradient from the backpropagation given to
the fimncg bulletin fanction
 Fmincg updating the weights according to the
cost and out put the optimized weights
ANN ALGORITHM
Hard ware control
design part
BLOCK DIAGRAM
MIC
VOICE
RECOGNITION
IC
Arduino
RIGHT
MOTOR
BATTERY
LEFT
MOTOR
H_bridge
Ultrasonic
Alarm and
lighting
PARTS OF THE SYSTEM
 In our real system we will use
HM2007 DSP KIT
 It has external interface circuit
 It is dual processing unit
 Good relatively in its speed
 Since can't found in local market
We use normal pc
VOICE RECOGNITION
IC
CONTROL UNIT
OBSTACLE
SENSOR
MOTOR
DIRECTION
CONTROL
MOTOR SPEED
CONTROL
 Control unit are the hart of all
control system
In our case we use Arduino uno
 It has 6 analog input and
14 digital i/o
 It has serial communication pin
from external device
VOICE RECOGNITION
IC
CONTROL UNIT
OBSTACLE
SENSOR
MOTOR
DIRECTION
CONTROL
MOTOR SPEED
CONTROL
 Obstacle sensor detect presence or
absence of obstacle
In our case we use ultrasonic sensor
VOICE RECOGNITION
IC
CONTROL UNIT
OBSTACLE
SENSOR
MOTOR
DIRECTION
CONTROL
MOTOR SPEED
CONTROL
 H-bridge used to control the
direction of the motor
VOICE RECOGNITION
IC
CONTROL UNIT
OBSTACLE
SENSOR
MOTOR
DIRECTION
CONTROL
MOTOR SPEED
CONTROL
Current
• One switch is closed in each leg of the "H"
• One switch is open in each leg of the "H"
Current
• One switch is closed in each leg of the "H"
• One switch is open in each leg of the "H”
turn in the Other Direction
 The speed of the dc motor can
be controlled using PWM from
Arduino
VOICE RECOGNITION
IC
CONTROL UNIT
OBSTACLE
SENSOR
MOTOR
DIRECTION
CONTROL
MOTOR SPEED
CONTROL
Start
serial
data
avail
able
Data
range
Is
data<mi
n data?
Read sensor
data
Data={1,2,3,4,5}
Data={6,7,8}
Change
direction
Chang speed
stop
no
yes
RESULT AND DESICCATION
 We develop speech and speaker recognition system
 In last we train our data set and gate good accuracy
 Voice recognition 97 for training 88% for test
 Speaker 99 % for training 94 for test
 And we design the control system
 Final we stimulate and we do well
PROBLEM FACED
 In this project time we face many problems
 there was no lab which for doing the project so
there was deficiency of computer
 Adviser even if their is there is no follow up
 bad view of the staff member towards as
 we will implement the hard ware
properly
It can improved its efficacy by using
vector machine learning
Additional value can be added like
RF transceiver to control remotely
CONCLUSION
 From this work we conclude that
 MFCC is fine for speech feature extraction
 Neural network is good for isolated word recognition
In general we conclude that our project is feasible for
control wheelchair movement control.
Thanks

More Related Content

Similar to VOICE CONTROLLED WHEELCHAIR using Amharic.pdf

VOICE OPERATED WHEELCHAIR
VOICE OPERATED WHEELCHAIRVOICE OPERATED WHEELCHAIR
VOICE OPERATED WHEELCHAIR
Mandar Jadhav
 
Resume for Embedded Engineer_1
Resume for Embedded Engineer_1Resume for Embedded Engineer_1
Resume for Embedded Engineer_1
gajendra parmar
 
Dual tone multiple frequency
Dual tone multiple frequencyDual tone multiple frequency
Dual tone multiple frequency
Neeraj Khatri
 
IJARCCE4J s govinda Public Transport Assistant for Visually Impaired (PuTAVI)
IJARCCE4J s govinda Public Transport Assistant for Visually Impaired (PuTAVI)IJARCCE4J s govinda Public Transport Assistant for Visually Impaired (PuTAVI)
IJARCCE4J s govinda Public Transport Assistant for Visually Impaired (PuTAVI)
SreekarV
 

Similar to VOICE CONTROLLED WHEELCHAIR using Amharic.pdf (20)

Multirate signal processing and decimation interpolation
Multirate signal processing and decimation interpolationMultirate signal processing and decimation interpolation
Multirate signal processing and decimation interpolation
 
Sensor id overview_ppt
Sensor id overview_pptSensor id overview_ppt
Sensor id overview_ppt
 
My Project
My ProjectMy Project
My Project
 
My Project
My ProjectMy Project
My Project
 
A review of Noise Suppression Technology for Real-Time Speech Enhancement
A review of Noise Suppression Technology for Real-Time Speech EnhancementA review of Noise Suppression Technology for Real-Time Speech Enhancement
A review of Noise Suppression Technology for Real-Time Speech Enhancement
 
IRJET- Hearing Loss Detection through Audiogram in Mobile Devices
IRJET-  	  Hearing Loss Detection through Audiogram in Mobile DevicesIRJET-  	  Hearing Loss Detection through Audiogram in Mobile Devices
IRJET- Hearing Loss Detection through Audiogram in Mobile Devices
 
Heart rate monitor system
Heart rate monitor systemHeart rate monitor system
Heart rate monitor system
 
Getting Started with TDS1000B / 2000B Digital Phosphor Oscilloscope Series
Getting Started with TDS1000B / 2000B  Digital Phosphor Oscilloscope SeriesGetting Started with TDS1000B / 2000B  Digital Phosphor Oscilloscope Series
Getting Started with TDS1000B / 2000B Digital Phosphor Oscilloscope Series
 
Introduction to Blackfin BF532 DSP
Introduction to Blackfin BF532 DSPIntroduction to Blackfin BF532 DSP
Introduction to Blackfin BF532 DSP
 
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
IRJET- Voice Command Execution with Speech Recognition and SynthesizerIRJET- Voice Command Execution with Speech Recognition and Synthesizer
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
 
IRJET-Home Automation System Based on Voice Recognition
IRJET-Home Automation System Based on Voice RecognitionIRJET-Home Automation System Based on Voice Recognition
IRJET-Home Automation System Based on Voice Recognition
 
Intelligence System for Disfluency
Intelligence System for DisfluencyIntelligence System for Disfluency
Intelligence System for Disfluency
 
Text independent speaker recognition system
Text independent speaker recognition systemText independent speaker recognition system
Text independent speaker recognition system
 
VOICE OPERATED WHEELCHAIR
VOICE OPERATED WHEELCHAIRVOICE OPERATED WHEELCHAIR
VOICE OPERATED WHEELCHAIR
 
Resume for Embedded Engineer_1
Resume for Embedded Engineer_1Resume for Embedded Engineer_1
Resume for Embedded Engineer_1
 
IOT BASED VIBRATION VEST FOR DEAF PEOPLE
IOT BASED VIBRATION VEST FOR DEAF PEOPLEIOT BASED VIBRATION VEST FOR DEAF PEOPLE
IOT BASED VIBRATION VEST FOR DEAF PEOPLE
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Dual tone multiple frequency
Dual tone multiple frequencyDual tone multiple frequency
Dual tone multiple frequency
 
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABA GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
 
IJARCCE4J s govinda Public Transport Assistant for Visually Impaired (PuTAVI)
IJARCCE4J s govinda Public Transport Assistant for Visually Impaired (PuTAVI)IJARCCE4J s govinda Public Transport Assistant for Visually Impaired (PuTAVI)
IJARCCE4J s govinda Public Transport Assistant for Visually Impaired (PuTAVI)
 

Recently uploaded

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Recently uploaded (20)

Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational Performance
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 

VOICE CONTROLLED WHEELCHAIR using Amharic.pdf

  • 1. BAHIR DAR INSTITUTE OF TECHNOLOGY SCHOOL OF ELECTRICAL ENGINEERING FINAL YEAR PROJECT PRESENTATION : VOICE CONTROLLED WHEELCHAIR Members: Matebie Gashu (500/02) Mengistu Baye (1307/03) Mignot Ayana (1373/03) Mubarek Kebede (14bb/03) Supervisor : Mr. : Enyachewu
  • 2. PRESENTATION OUTLINE  introduction mengistu  Voice recognition Mubarek  Speaker recognition  control system Mignot  Result and conclusion matebie
  • 3. INTRODUCTION  Controlling device with speech is used for many application  To recognize speech and speaker first the feature of the speech must be extracted  the best way to extract speech feature is MFCC o After feature extraction we classify the speech and speaker using artificial neural net work
  • 4. OBJECTIVE OF THE PROJECT  General Objective  Design control system of voice controlled wheelchair.  Specific Objectives  To develop speech and speaker recognition system  To develop c code for Arduino which control motor  To interface Ultrasonic sensor to Arduino
  • 5. . Literature review Study mathematical modeling Training with neural network Do speaker recognition Voice data collection Feature extraction Is test good? predict Do speech recognition
  • 7.  Voice recognition part of our project mainly done in semester project  what we did is improving efficiency
  • 8.  Ways of efficacy improvement  extracting additional feature  short time energy  Zero crossing point  Adjusting good recording time  And adjusting suppling frequency
  • 10.  The speaker recognition part is used for change the system from speaker independent to speaker dependent
  • 11. CAPTURE AND PREPROCESSING  Get the audio signal.  Make suitable for feature extraction Capture Silence Removal Pre-Emphasis Framing Windowing
  • 12. FEATURE EXTRACTION  Transform the input audio signal into a sequence of acoustic feature vectors  MFCC : Mel Filter Cepstral Coefficients as Feature  Perceptual approach  Human Ear processes audio signal in Mel scale  Mel scale : linear up to 1KHz and logarithmic after 1KHz  MFCC gives distribution of energy in Mel frequency band  Calculated for each frame Fourier Transform Mel Filter Log IFT : DCT Energy and Deltas
  • 13. COMPOSITION OF FEATURE VECTOR  20 MFCC Features  20 Delta MFCC
  • 15. DESIGN CONSIDERATION OF ANN Sigmoid activation function Activation function Number of input unit Number of hidden layer Number of units per layer Number of out put
  • 16. DESIGN CONSIDERATION OF ANN  The number of input units are the the size of the external tigers  in our case the input units are 40 coeffecentes Activation function Number of input unit Number of hidden layer Number of units per layer Number of out put
  • 17. DESIGN CONSIDERATION The number of hidden layer affect learning acurecy High number hidden layer lead over fit High mathematical cost Small number of hidden layer may unable to learn Activation function Number of input unit Number of hidden layer Number of units per layer Number of out put
  • 18. DESIGN CONSIDERATION  Number of units per layer  High number of unit per layer  running cost increase  It may lead over fit o Small number of units per layer  Leads under fit Activation function Number of input unit Number of hidden layer Number of units per layer Number of out put
  • 19. DESIGN CONSIDERATION  number of out put unit the number of the out put unit in our case 2 default and User Activation function Number of input unit Number of hidden layer Number of units per layer Number of out put
  • 20. MODEL  Based on the above design criteria We select our model
  • 21. FEEDFORWARD  feedforward are used to propagate the input forward to the hypothesis
  • 22. COST  The root mean square error (RMSE) is a frequently-used measure of the differences between values predicted by a model or an estimator and the values actually observed from the thing being modelled or estimated
  • 23. BACK PROPAGATION AND GRADIENT  Backpropagation is used for propagating the error from out put to input  Backpropagation performs a gradient descent within the solution's vector space towards a global minimum.  The gradient from the backpropagation given to the fimncg bulletin fanction  Fmincg updating the weights according to the cost and out put the optimized weights
  • 27. PARTS OF THE SYSTEM  In our real system we will use HM2007 DSP KIT  It has external interface circuit  It is dual processing unit  Good relatively in its speed  Since can't found in local market We use normal pc VOICE RECOGNITION IC CONTROL UNIT OBSTACLE SENSOR MOTOR DIRECTION CONTROL MOTOR SPEED CONTROL
  • 28.  Control unit are the hart of all control system In our case we use Arduino uno  It has 6 analog input and 14 digital i/o  It has serial communication pin from external device VOICE RECOGNITION IC CONTROL UNIT OBSTACLE SENSOR MOTOR DIRECTION CONTROL MOTOR SPEED CONTROL
  • 29.  Obstacle sensor detect presence or absence of obstacle In our case we use ultrasonic sensor VOICE RECOGNITION IC CONTROL UNIT OBSTACLE SENSOR MOTOR DIRECTION CONTROL MOTOR SPEED CONTROL
  • 30.  H-bridge used to control the direction of the motor VOICE RECOGNITION IC CONTROL UNIT OBSTACLE SENSOR MOTOR DIRECTION CONTROL MOTOR SPEED CONTROL
  • 31. Current • One switch is closed in each leg of the "H" • One switch is open in each leg of the "H"
  • 32. Current • One switch is closed in each leg of the "H" • One switch is open in each leg of the "H” turn in the Other Direction
  • 33.  The speed of the dc motor can be controlled using PWM from Arduino VOICE RECOGNITION IC CONTROL UNIT OBSTACLE SENSOR MOTOR DIRECTION CONTROL MOTOR SPEED CONTROL
  • 35. RESULT AND DESICCATION  We develop speech and speaker recognition system  In last we train our data set and gate good accuracy  Voice recognition 97 for training 88% for test  Speaker 99 % for training 94 for test  And we design the control system  Final we stimulate and we do well
  • 36. PROBLEM FACED  In this project time we face many problems  there was no lab which for doing the project so there was deficiency of computer  Adviser even if their is there is no follow up  bad view of the staff member towards as
  • 37.  we will implement the hard ware properly It can improved its efficacy by using vector machine learning Additional value can be added like RF transceiver to control remotely
  • 38. CONCLUSION  From this work we conclude that  MFCC is fine for speech feature extraction  Neural network is good for isolated word recognition In general we conclude that our project is feasible for control wheelchair movement control.