1. Project Supervisor: Dr. Humera Tariq
Project Co - Supervisor: Sir Farid Alvi
Voice Controlled Robotic Car
By
Ayesha Shafique
University Of Karachi
Department Of Computer Science
2. 5/14/2018 BSCS-Final Year Project 2
“In a few years artificial intelligence virtual assistants will be as common
as the smart phone.” ~Steve jobs
3. 3
Problem Statement
Why It is important to Solve the
Chosen Problem?
Proper Block Diagram
GUI
intermediate and output images
(Important Results)
Preprocessing Details
Algorithm Description
Learning and Achievements
Improvements and Future Work
Demo and Queries
Contents
4. Problem Statement
The objective of the project is to allows
users to control the robotic vehicle
remotely by voice commands.
5/14/2018 4
Forward
BSCS-Final Year Project
5. Importance
• People who historically have difficulties with
driving, such as disabled people and older
citizens, as well as the very young, would be
able to experience the freedom of car travel.
5/14/2018 BSCS-Final Year Project 5
6. DIAGRAMMATIC FLOW OF WORK
In this Section we described with the help of Diagrams, the principles we
studied and apply to input speech to address the mentioned problem.
5/14/2018 6BSCS-Final Year Project
10. GUI
In this section we show our GUI so that viewers can understand the
interaction in between input and output.
5/14/2018 10BSCS-Final Year Project
12. RESULTS
Before getting into the theoretical/Mathematical details we will show
viewers some important results to grasp the techniques and concepts.
5/14/2018 12BSCS-Final Year Project
14. 5/14/2018 14
Lx 293D Lx 293D
Door
motor
Turn
motor
Driver
motor
BREADBOARD CIRCUIT DESIGN IMPLEMENTATION
Power
Supply
Common
Ground
BSCS-Final Year Project
17. PREPROCESSING
In this Section we described the preprocessing applied on input speech
along with necessary code.
5/14/2018 17BSCS-Final Year Project
18. 5/14/2018 BSCS-514 Computer Graphics 18
CONVERSION TO WAV
File received from smart phone is of
‘.3gpp’,so to process we’ve to convert
it in ‘.wav’.
3gpp
21. ALGORITHM DESCRIPTION
Here we describes the key steps of algorithm that we used for Feature
Extraction and speech recognition
5/14/2018 21BSCS-Final Year Project
23. 5/14/2018 23
It convert a speech signal into a sequence of
acoustic feature vector to identify the components
of linguistic content and discarding all the other stuff
which carries information like background noise,
emotion etc.
MEL FREQUENCY CEPSTRAL COEFFICIENT (MFCC)
BSCS-Final Year Project
24. 5/14/2018 24
HMM creates stochastic models from known utterances
and compares the probability that the unknown
utterance was generated by each model.
HIDDEN MARKOV MODEL (HMM)
HMM can be characterized by
following:
• Transition probability matrix
• Emission probability matrix
• Initial probability matrix
BSCS-Final Year Project
25. 5/14/2018 BSCS-514 Computer Graphics 25
Evaluation:
• Problem - Compute Probability of observation
sequence given a model
• Solution - Forward Algorithm and Viterbi Algorithm
Decoding:
• Problem - Find state sequence which maximizes
probability of observation sequence
• Solution - Viterbi Algorithm
Training:
• Problem - Adjust model parameters to maximize
probability of observed sequences
• Solution - Forward-Backward Algorithm
HMM PROBLEMS AND SOLUTIONS
26. LEARNING AND ACHIEVEMENTS
In this section we describe the things that makes us feel that we have
learned, we did something.
5/14/2018 26BSCS-Final Year Project
27. 5/14/2018 27
LEARNING AND ACHIEVEMENTS
• Never ever bound your skill set to any
specific language
Select 1 or more language(s)
Map idea
BSCS-Final Year Project
31. IMPROVEMENTS AND FUTURE
WORK
In this section we mention the tasks that are still unresolved. The
exceptional cases where still improvement is needed. We also mention
the applications that can build on top of our current work.
5/14/2018 31BSCS-Final Year Project
32. 5/14/2018 32
• Need improvements in accuracy of speech
recognizer to make it works in noisy
environment.
• Use of sensors and radar to detect objects and
Cars’ surroundings as
Google’s self-driving car
project, relies heavily on LiDAR
and other sensors to
detect the presence of objects
in it’s vicinity.
IMPROVEMENTS
BSCS-Final Year Project
33. 5/14/2018 33
FUTURE WORK
Same idea can be mapped for controlling a
wheelchair by means of human voice.
It enables a disabled person to move
around independently,
using a speech recognition
application which is
interfaced with motors.
Just we have to change
the hardware‘s framework.
BSCS-Final Year Project
In this slide, I will tell that I haven’t thought my idea in a language box. Just think about your idea, design your idea, free from language dependency. Adopt any language to implement your idea, if something is difficult in language A and easier in language B, do that work in B.
This is my point from my very little experience and learning.