SlideShare a Scribd company logo
1 of 6
Download to read offline
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 560
SPEAKER - INDEPENDENT VISUAL LIP ACTIVITY DETECTION FOR
HUMAN - COMPUTER INTERACTION
P.Sujatha1
, M.Radhakrishnan2
1
Department of Computer Science and Engineering, Sudharsan Engineering College, Pudukkottai, Tamilnadu, India,
suja_param@yahoo.com
2
Director / IT, Sudharsan Engineering College, Pudukkottai, Tamilnadu, India, sumyukta2005@yahoo.com
Abstract
Recently there is an increased interest in using the visual features for improved speech processing. Lip reading plays a vital role
in visual speech processing. In this paper, a new approach for lip reading is presented. Visual speech recognition is applied in
mobile phone applications, human-computer interaction and also to recognize the spoken words of hearing impaired persons. The
visual speech video is taken as input for face detection module which is used to detect the face region. The mouth region is
identified based on the face region of interest (ROI). The mouth images are applied for feature extraction process. The features
are extracted using every 10th coordinate, every 16th coordinate, 16 point + Discrete Cosine Transform (DCT) method and Lip
DCT method. Then, these features are applied as inputs for recognizing the visual speech using Hidden Markov Model. Out of the
different feature extraction methods, the DCT method gives the experimental results of better performance accuracy. 10
participants were uttered 35 different isolated words. For each word, 20 samples are collected for training and testing the
process.
Index Terms: Feature Extraction, HMM, Mouth ROI, DWT, Visual Speech Recognition
--------------------------------------------------------------------***----------------------------------------------------------------------
1. INTRODUCTION
Visual speech recognition refers to recognizing the spoken
words based on visual lip movements. Visual speech
recognition is an area with great potential to solve
challenging problems in speech processing. Difficulties in
the audio based speech recognition system can be
significantly reduced by additional information provided by
the extra visual features. It is well known that visual speech
information through lip movement is very useful for human
speech perceptions. The main difficulty in incorporating
visual information into an acoustic speech recognition
method is to find a robust and accurate method for
extracting essential visual speech features.
Figure 1 illustrates our proposed system architecture of a
visual speech recognition process. The recorded visual
speech video is given as input to the system. The algorithm
starts with detecting face using a popular face detection
technique by Viola-Jone’s [4, 5]. After face is detected, then
Mouth ROI is localized using simple algorithm. The next
step is to extract the visual features of the lip region. Then,
these feature vectors are applied separately as inputs to the
HMM classifier for recognizing the spoken word.
The aim of the paper is to extract the visual lip movements
(lip features) and predicting the word which is actually
pronounced. This paper is organized as follows. Section 2
describes the literature survey on extraction of visual speech
features. Section 3 describes the face localization process.
Section 4 describes the mouth ROI detection algorithm.
Section 5 explains the lip feature extraction techniques.
Section 6 explains about the classifier HMM. In section 7
the database and the experimental results are discussed, and
in eighth section the conclusion is presented.
Fig -1: Overview of the proposed Visual Speech
Recognition system
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 561
2. LITERATURE SURVEY
An automatic speech recognizer was developed for a
speaker dependent and continuous speech alphanumeric
recognition application based on the European Portuguese
language [1]. Hyper column model (HCM) was used to
extract visual speech features from input image. The
extracted features are modeled by Gaussian distributions
through HMM [2]. An audio visual digit recognition using
N-best decision fusion was proposed in [3]. Viola and Jones
presented a face detector which is a machine learning
approach for visual object detection [4, 5]. Lip reading
system designed by Pentajan [6] was based on geometric
features such as mouth’s height, width, area and perimeter.
Another technique designed by Werda [7], an Automatic Lip
Feature Extraction prototype (ALiFE) includes lip
localization, lip tracking, visual feature extraction and
speech unit recognition for French vowels, uttered by
multiple speakers. Wang introduced [8], a region-based lip
contour extraction algorithm uses a 16-point lip model to
describe the lip contour. Training algorithm of HMM was
proposed for visual speech recognition based on a modified
simulated annealing (SA) technique to improve the
convergence speed and the solution quality [9]. An approach
to estimate the parameters of continuous density HMMs for
visual speech recognition was presented in [10]. In [11],
Haar features are used to train Adaboost classifier and
combined skin and lip color separation algorithm to form a
self-adaptive separation model, which can dynamically
adjust constant parameters. A lip reading technique for
speech recognition by using motion estimation analysis was
proposed by Matthew Ramage[12]. A user authentication
system based on password lip reading was presented.
Motion estimation was done for lip movement image
sequences representing speech.
3. FACE LOCALIZATION
Viola and Jones face detector is capable of processing image
rapidly and achieving high detection rates .The work has
been distinguished by three key contributions. The first
contribution was an integral image which allows the features
used by the detector to be computed very quickly. For each
pixel in the original image, there is exactly one pixel in the
integral image, whose value is the sum of the original image
values above to the left. The performance can be attributed
to the use of an attentional cascade, using low feature
number detectors based on a natural extension of Haar
wavelets. Each detector in their cascade fits objects to
simple rectangular masks. In order to reduce the number of
computations, while moving through their cascade, they
introduced a new image representation called the integral
image.
The second was an adaboost learning algorithm which
selects a small number of visual critical features from a
large set and yields extremely efficient classifiers. The third
contribution was a method for combining increasingly more
complex classifiers in a cascade which allows background
region of the image to be quickly discarded while spending
more computation on promising object like regions. In this
paper, while a person in pronouncing a word, the video is
captured and stored in AVI file format. Subsequently the
video frames are grabbed and it is subjected to viola and
Jones face detector which detects the face in the video and
highlighted inside a rectangle ROI (Region of Interest).
Fig -2: Face Localization process using AdaBoost classifier
4.MOUTH REGION OF INTEREST DETECTION
The mouth region are the visual parts of the human speech
production system; these parts hold the most visual speech
information, therefore it is imperative for any VSR system
to detect or localize such regions to capture the related
visual information i.e., we cannot read lips without seeing
them first. Therefore lip localization is an external process
for any VSR system. Many techniques for lip detection /
localization in digital images like Snakes, Active shape
models (ASM), Active Appearance Models (AAM) and
deformable templates are based on model based lip
detection method. Image based lip detection methods
include the use of spatial information, Pixel color and
intensity, lines, corners, edges and motion.
Fig -3: Mouth ROI determination in real time Video
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 562
In this paper, Image based lip detection method is used to
extract the mouth region. In a standard face the location of
the mouth will be in the lower half of the face. Based on this
concept, a ROI is set by reducing the left, width, top and
height values with respect to the face ROI. Then the mouth
ROI is localized by certain values which are derived from
mathematical calculations. The extracted Mouth ROI is
copied into new frame for further processing. The
diagrammatic representation of Mouth ROI extraction using
the algorithm given in table 1 is shown in fig 3. The
proposed method has the advantage of providing a reliable
Mouth ROI without any geometric model assumption and
complex procedures such as determining corners and edge
detection. The method was evaluated on 175000 frames of
the in-house database. The experiments show that the
method localizes the mouth ROI efficiently with the high
level accuracy (91.15 %).
Table -1: The algorithm to extract Mouth ROI from the face
ROI
1. The frames of face ROI are grabbed and given as input
for the mouth localization and extraction.
2. Find out the values associated with Fl, Fw, Ft and Fh of
the face in the XY Plane where,
Fl – Left value of the face ROI
Fw – Width value of the face ROI
Ft – Top value of the face ROI
Fh – Height value of the face ROI
3. The mouth ROI is extracted as per the following
calculations,
Ml = Fl + (Fw – Fl) / 4 (1)
Mw = Fw – (Fw – Fl)/ 4 (2)
Mt = Ft + (2*(Fh – Ft)) / 3 (3)
Mh = Fh – (Fh – Ft)/ 15 (4)
Ml = left of the mouth ROI
Mw = Width of the mouth ROI
Mt = Top of the Mouth ROI
Mh = height of the Mouth ROI
4. Ml, Mw, Mt and Mh are the values used to localize the
mouth ROI.
5. Repeat the steps 2, 3 and 4 for all the frames until the
video ends.
Compared to other similar algorithms, the solution proposed
here has the advantage of providing a reliable lip contour
without any geometric model assumption and complex
procedures such as determining the edge detection. This
approach will be more helpful for those research works
which involves the outer contour extraction of lip such as lip
reading.
5. FEATURE EXTRACTION TECHNIQUES
The VSR systems require the analysis of feature vectors
which is extracted from the speech related visual signals in
the sequence of the speaker face frames while uttering the
spoken words. To find a signal or signature for each word,
we need to find a proper way of extracting the most relevant
features, which play an important role in recognizing that
word.
The frame which has only mouth (Mouth ROI) is
subjected to image enhancement to improve the quality of
image for further processing. The enhancement starts from
increasing or decreasing the brightness or contrast of the
image. The enhanced image serves as the input for
thresholding where lip region is separated from the
background. In this paper, adaptive thresholding is used for
generating the lip region from the Mouth ROI frame. The
adaptive thresholding takes a color image as input and in the
simplest implementation, outputs a binary image
representing the segmentation. For each pixel in the image a
threshold has to be calculated. If the pixel value is below the
threshold it is set to be the background value (white),
otherwise it assumes the foreground value (black). The
threshold value is enlarged to the size of 200 x 200 for better
processing. The resulting frame after thresholding is a mass
of lip contour points where the feature points of outer
contour points are extracted for both upper and lower lips.
The point of interest (POI) is detected by the projection of
final contour on horizontal and vertical axis. The following
is the proposed list of feature extraction methods that will be
extracted from the sequence of lip contour points of the
Mouth ROI during the uttering of the words.
(i) Every 10th
Coordinate Method - From the mass of lip
contour points, every 10th coordinates are selected.
The feature points are selected based on top to bottom
and left to right, the starting and ending position of the
lip contour x, y coordinates.
(ii) Every 16th
coordinate Method - From the mass of lip
contour points, 16 coordinates are considered as
feature vectors. From the center of the lip, Left, right,
top and bottom of the contours and also the mid
between those contour points, such as left to top, top to
right, right to bottom and bottom to left x, y
coordinates were selected. In addition to that, the mid
coordinates between those feature vector contour
points are also selected. The Normalized distance from
the center point of the lip is applied for the 16
coordinates and considered as feature vectors.
(iii) Every 10th
coordinate + DCT Method - The Discrete
Cosine Transform is applied for 16 coordinates obtained
from method II and then the results are considered as
feature vectors.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 563
(iv) DCT Method for entire lip region - The entire lip
region has been selected as feature vector. The
Discrete Cosine transform for the entire lip region
coordinates were calculated and considered as feature
points.
The discrete cosine transform (DCT) method is used to
separate the image into parts of differing importance (with
respect to the image's visual quality). The DCT is similar to
the Discrete Fourier Transform: it transforms a signal or
image from spatial domain to the frequency domain.
The general equation for a 2D (N by M image) DCT is
defined by the following equation:
(5)
where
The basic operation of the DCT is as follows:
• The input image is N by M.
• f(i,j) is the intensity of the pixel in row i and column j;
• F(u,v) is the DCT coefficient in row and column of the
DCT matrix.
• For most images, much of the signal energy lies at low
frequencies; these appear in the upper left corner of the
DCT.
• Compression is achieved since the lower right values
represent higher frequencies, and are often small -
small enough to be neglected with little visible
distortion.
• The DCT input is an 8 by 8 array of integers. This
array contains each pixel's gray scale level;
• 8 bit pixels have levels from 0 to 255.
6. HIDDEN MARKOV MODEL
A hidden Markov model (HMM) is denoted by the equation:
λ= (Π, A, B) (6)
Where Π is the initial state distribution, A is the state
transition matrix and B is the emission probability matrix.
The emission probability matrix specifies, for each state, a
probability distribution over the output alphabet. The output
alphabet need no longer be the same as the state space.
Denoting the output alphabet with θ = {1, 2, ..., M} we get a
matrix with N rows and M columns,
(7)
Where bi (k) is the probability of symbol k being emitted
from state i. The emission probability matrix is another
stochastic matrix, in the sense that each row sums up to one,
and all elements are greater than or equal to zero. A HMM
poses three stages:
(i) Evaluation or computing P (Observations |
Model): This allows us to find out how well a model
matches a given observation sequence. The main concern
here is computational efficiency of finding an algorithm
with only a polynomial running time.
(ii) ) Decoding or finding the hidden state
sequence: Best corresponds to the observed symbols,
because there are generally many sequences that give rise to
the same symbols, there is no "correct" solution to be found
in most cases. Thus, some optimality criterion must be
chosen. The most widely used criterion is to find a path
through the model that maximizes P (Path | Observations,
Model).
(iii) Training or Learning: Finding the model
parameter values (λ= Π, A, B) that specify a model most
likely to produce a given sequence of training data. In other
words, the objective is to construct a model that best fits the
training data (or best represents the source that produced the
data). There is no known way to analytically solve for the
best model, but an iterative algorithm that often yields
sufficiently good approximations. The training problem for
hidden Markov models is to estimate the transition
probabilities, the initial state distribution and the emission
probability distributions from sample data.
The features vectors are trained and tested using the HMM
classifier.
7. EXPERIMENTAL RESULTS
The in-house videos were recorded inside a normal room
using web camera. The participants were 4 females and 6
males, distributed over different age groups. The videos were
recorded at 25 frames per second. It is stored in AVI format
and resized to 320*240 pixels, because it is easier to deal
with AVI format and it faster for training and analysing the
videos with smaller frame sizes. Each person in each
recorded video utters non-contiguous 35 different words 20
times, which are numbers from 1-19 (19 words) twenty,
thirty up to hundred (9 words), thousand, lakh (2 words) and
cash counter words rupees, paise, sir, madam, please (5
words). These 35 words are normally used on cash counters
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 564
and also STD booths and post offices.The hidden markov’s
model was trained for every word from the visual
parameters. The HMM system consists of 35 HMM models
to recognize 35 words. First, the models are initialized and
subsequently re-estimated with the embedded training
version of the Baum-welch algorithm. Then, the training data
were aligned to the models through the viterbi algorithm to
obtain the state duration densities. To recognize a new word,
the extracted feature vectors are fed as input to the HMM
system. The maximum probability model is obtained among
35 HMM word models. The maximum probability model is
recognized as the output word model and the corresponding
word is displayed in the form of text.4900 samples (7
participants pronounced 20 samples of each one of 35 words)
were collected for training and 2100 samples (3 participant’s
pronounced 20 samples of each one of 35 words) were used
for testing. The performance of the proposed method using
HMM with respect to different feature vector is given in the
fig 4. Then spoken word recognition rate is very low for
every 16 coordinates method and the accuracy rate for the
visual speech recognition for Lip DCT method is 98.8%
which is higher compared to the all other feature extraction
techniques [13].
Fig -4: Performance of different Feature Extraction methods
8. Conclusion
In this paper, a new method for extracting the mouth
region from the face is presented. The recorded visual speech
video is given as input to the face localization module for
detecting the face ROI. Based upon the rectangle ROI of the
face another ROI is set to locate the mouth region. The
mouth ROI is separated from the frame and is copied to
another frame which has only the mouth region. The frame
which has only moth is subjected to image enhancement to
improve the quality of image for further processing. The
enhanced image serves as the input for thresholding where
lip region is separated from the background. The resulting
frame after thresholding is a mass of lip contour points where
the feature points of outer contour points are extracted. The
different feature vectors from the mouth ROI is determined.
The extracted feature vectors are applied separately to the
HMM models and their performance are compared. As the
output of the method is the corresponding text for the visual
speech. The recognition rate for the visual speech is low for
every 10th
co-ordinates method. The Lip DCT method is used
to recognize the isolated words and it achieves 98.8% of
accuracy.
REFERENCES
[1] Vitor Pera, Filipe Sa Afonso, Ricardo Ferreira “Audio
Visual Speech Recognition in a Portuguse Language Based
Application”, IEEE, ICIT –Maribor,slovenia , pp.688-692,
2003.
[2] Alaa Sagheer, Naoyuki Tsuruta, Rin-Ichiro Taniguchi
and Sakashi Maeda, “Visual speech features Representation
for Automatic Lip Reading“, IEEE, ICASSP pp.781-784,
2005.
[3] Georg F.Meyer, Jeffrey B. Mulligan, Sophie M.Wuerger,
“Continuous audio-visual digit recognition using N-best
decision fusion”, Published by Elsevier Ltd, Information
fusion-5, pp.91 -101, 2003.
[4] P. Viola and M. Jones, “Robust Real-time Object
Detection”, IEEE International Journal of Computer Vision
vol.57, no.2, pp.137-154, May 2004.
[5] P. Viola and M. Jones, “Rapid Object Detection using a
Boosted Cascade of Simple Features”, Conf. Computer
Vision and Pattern Recognition. Volume 1, pp. 511–518,
2001.
[6] Mitsuhiro Kawamura, Naoshi Kakita, Tmoyuki Osaki,
Kazunori Sugahara, Ryosuke Konishi, “On the Hardware
Realization of Lip Reading System”, SICE Annual
Conference in Fukui, pp 2452 -2457 , 2003
[7] Takeshi Saitoh and Ryosuke Konishi, “ Word
Recognition based on Two Dimensional Lip Motion
Trajectory”, International Symposium on Intelligent signal
processing and communication systems japan , IEEE pp 287
– 290, 2006.
[8] S.L.Wang , W.H.Lau, S.H.Leung. “Automatic Lip
Contour extraction From Lip Images” Published by Elsevier
Ltd, Pattern Recognition 37 pp 2375-2384, 2004.
[9] Jong-Seok Lee and Cheol Hoon Park ,“Training Hidden
Markov Models by Hybrid Simulated annealing for Visual
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 565
Speech recognition“ , IEEE International conference on
Systems, Man and Cybernetics, pp 198 – 202, 2006.
[10] Yoshihiko Nakaku, Keiichi Tokuda, Tadashi Kitamura
and Takao Kobayashi, “Normalized Training for HMM-
Based Visual Speech recognition” IEEE pp 234 – 237, 2000.
[11] Huang Yong-hui, PAN Bao-chang, LIANG Jian, FAN
Xiao-yan, “ A new lip-automatic detection and location
algorithm in lip-reading system” Systems Man and
Cybernetics (SMC), IEEE International Conference, pp.
2402 - 2405, 2010.
[12] Sujatha, P.; Krishnan, M.R., "Lip feature extraction for
visual speech recognition using Hidden Markov Model,"
Computing, Communication and Applications (ICCCA),
2012 International Conference on , vol., no., pp.1,5, 22-24
Feb. 2012
[13] Matthew Ramage., & Euan Lindsay, “ Wrapping
snakes for improved lip segmentation” IEEE International
conference on acoustics, speech and signal processing, pp.
1205–1208, 2009.
[14] Rafael C.Gonzalez and Richard E.Woods, “ Digital
Image Processing”, Addison Wesley ,Second edition.
BIOGRAPHIES
[1] P.Sujatha is a faculty member of the
Departmant of Computer Science and
Engineering, Sudharsan Engineering College,
Tamilnadu, India. She has 12 years teaching
experience. Her current research interest
includes image processing, computer vision
and data mining.
Dr.M.Radhakrishnan is curently a Professor
in Civil Engineering and Director/IT
Sudharsan Engineering College, Tamilnadu,
India. He has more than 35 years of teaching
experience. His field of interest includes
Computer Aided Structural Analysis,
Computer Networks, Image Processing and
Effort Estimation.

More Related Content

What's hot

IRJET- A Review on Face Recognition using Local Binary Pattern Algorithm
IRJET- A Review on Face Recognition using Local Binary Pattern AlgorithmIRJET- A Review on Face Recognition using Local Binary Pattern Algorithm
IRJET- A Review on Face Recognition using Local Binary Pattern AlgorithmIRJET Journal
 
IRJET- Multiple Feature Fusion for Facial Expression Recognition in Video: Su...
IRJET- Multiple Feature Fusion for Facial Expression Recognition in Video: Su...IRJET- Multiple Feature Fusion for Facial Expression Recognition in Video: Su...
IRJET- Multiple Feature Fusion for Facial Expression Recognition in Video: Su...IRJET Journal
 
Latent fingerprint and vein matching using ridge feature identification
Latent fingerprint and vein matching using ridge feature identificationLatent fingerprint and vein matching using ridge feature identification
Latent fingerprint and vein matching using ridge feature identificationeSAT Publishing House
 
IRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind PeopleIRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind PeopleIRJET Journal
 
EFFECT OF FACE TAMPERING ON FACE RECOGNITION
EFFECT OF FACE TAMPERING ON FACE RECOGNITIONEFFECT OF FACE TAMPERING ON FACE RECOGNITION
EFFECT OF FACE TAMPERING ON FACE RECOGNITIONsipij
 
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...Facial Expression Recognition Using Local Binary Pattern and Support Vector M...
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...AM Publications
 
Facial Expression Recognition Using SVM Classifier
Facial Expression Recognition Using SVM ClassifierFacial Expression Recognition Using SVM Classifier
Facial Expression Recognition Using SVM Classifierijeei-iaes
 
Facial expression using 3 d animation
Facial expression using 3 d animationFacial expression using 3 d animation
Facial expression using 3 d animationIAEME Publication
 
Facial expression using 3 d animation
Facial expression using 3 d animationFacial expression using 3 d animation
Facial expression using 3 d animationiaemedu
 
Recognition of Facial Emotions Based on Sparse Coding
Recognition of Facial Emotions Based on Sparse CodingRecognition of Facial Emotions Based on Sparse Coding
Recognition of Facial Emotions Based on Sparse CodingIJERA Editor
 
A Review on Face Detection under Occlusion by Facial Accessories
A Review on Face Detection under Occlusion by Facial AccessoriesA Review on Face Detection under Occlusion by Facial Accessories
A Review on Face Detection under Occlusion by Facial AccessoriesIRJET Journal
 
Different Viewpoints of Recognizing Fleeting Facial Expressions with DWT
Different Viewpoints of Recognizing Fleeting Facial Expressions with DWTDifferent Viewpoints of Recognizing Fleeting Facial Expressions with DWT
Different Viewpoints of Recognizing Fleeting Facial Expressions with DWTIRJET Journal
 
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATION
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATIONA SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATION
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATIONijnlc
 
Implementation of face and eye detection on DM6437 board using simulink model
Implementation of face and eye detection on DM6437 board using simulink modelImplementation of face and eye detection on DM6437 board using simulink model
Implementation of face and eye detection on DM6437 board using simulink modeljournalBEEI
 

What's hot (17)

IRJET- A Review on Face Recognition using Local Binary Pattern Algorithm
IRJET- A Review on Face Recognition using Local Binary Pattern AlgorithmIRJET- A Review on Face Recognition using Local Binary Pattern Algorithm
IRJET- A Review on Face Recognition using Local Binary Pattern Algorithm
 
IRJET- Multiple Feature Fusion for Facial Expression Recognition in Video: Su...
IRJET- Multiple Feature Fusion for Facial Expression Recognition in Video: Su...IRJET- Multiple Feature Fusion for Facial Expression Recognition in Video: Su...
IRJET- Multiple Feature Fusion for Facial Expression Recognition in Video: Su...
 
Latent fingerprint and vein matching using ridge feature identification
Latent fingerprint and vein matching using ridge feature identificationLatent fingerprint and vein matching using ridge feature identification
Latent fingerprint and vein matching using ridge feature identification
 
IRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind PeopleIRJET - A Review on Text Recognition for Visually Blind People
IRJET - A Review on Text Recognition for Visually Blind People
 
EFFECT OF FACE TAMPERING ON FACE RECOGNITION
EFFECT OF FACE TAMPERING ON FACE RECOGNITIONEFFECT OF FACE TAMPERING ON FACE RECOGNITION
EFFECT OF FACE TAMPERING ON FACE RECOGNITION
 
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...Facial Expression Recognition Using Local Binary Pattern and Support Vector M...
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...
 
Facial Expression Recognition Using SVM Classifier
Facial Expression Recognition Using SVM ClassifierFacial Expression Recognition Using SVM Classifier
Facial Expression Recognition Using SVM Classifier
 
Facial expression using 3 d animation
Facial expression using 3 d animationFacial expression using 3 d animation
Facial expression using 3 d animation
 
Facial expression using 3 d animation
Facial expression using 3 d animationFacial expression using 3 d animation
Facial expression using 3 d animation
 
Recognition of Facial Emotions Based on Sparse Coding
Recognition of Facial Emotions Based on Sparse CodingRecognition of Facial Emotions Based on Sparse Coding
Recognition of Facial Emotions Based on Sparse Coding
 
A Review on Face Detection under Occlusion by Facial Accessories
A Review on Face Detection under Occlusion by Facial AccessoriesA Review on Face Detection under Occlusion by Facial Accessories
A Review on Face Detection under Occlusion by Facial Accessories
 
Real time facial expression analysis using pca
Real time facial expression analysis using pcaReal time facial expression analysis using pca
Real time facial expression analysis using pca
 
A04430105
A04430105A04430105
A04430105
 
Different Viewpoints of Recognizing Fleeting Facial Expressions with DWT
Different Viewpoints of Recognizing Fleeting Facial Expressions with DWTDifferent Viewpoints of Recognizing Fleeting Facial Expressions with DWT
Different Viewpoints of Recognizing Fleeting Facial Expressions with DWT
 
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATION
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATIONA SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATION
A SIGNATURE BASED DRAVIDIAN SIGN LANGUAGE RECOGNITION BY SPARSE REPRESENTATION
 
Implementation of face and eye detection on DM6437 board using simulink model
Implementation of face and eye detection on DM6437 board using simulink modelImplementation of face and eye detection on DM6437 board using simulink model
Implementation of face and eye detection on DM6437 board using simulink model
 
N010226872
N010226872N010226872
N010226872
 

Viewers also liked

Development and determination of voltage faultlocation
Development and determination of voltage faultlocationDevelopment and determination of voltage faultlocation
Development and determination of voltage faultlocationeSAT Publishing House
 
Improving quality of service using ofdm technique for 4 th generation network
Improving quality of service using ofdm technique for 4 th generation networkImproving quality of service using ofdm technique for 4 th generation network
Improving quality of service using ofdm technique for 4 th generation networkeSAT Publishing House
 
A simulation study of wi max based communication system using deliberately cl...
A simulation study of wi max based communication system using deliberately cl...A simulation study of wi max based communication system using deliberately cl...
A simulation study of wi max based communication system using deliberately cl...eSAT Publishing House
 
Information security by biometric waterking
Information security by biometric waterkingInformation security by biometric waterking
Information security by biometric waterkingeSAT Publishing House
 
Novel method to find the parameter for noise removal
Novel method to find the parameter for noise removalNovel method to find the parameter for noise removal
Novel method to find the parameter for noise removaleSAT Publishing House
 
Design of a wind power generation system using a
Design of a wind power generation system using aDesign of a wind power generation system using a
Design of a wind power generation system using aeSAT Publishing House
 
An analysis of raw materials for concretes as metal sheets for solar radiatio...
An analysis of raw materials for concretes as metal sheets for solar radiatio...An analysis of raw materials for concretes as metal sheets for solar radiatio...
An analysis of raw materials for concretes as metal sheets for solar radiatio...eSAT Publishing House
 
The effect of watercement ratio on the sulfate
The effect of watercement ratio on the sulfateThe effect of watercement ratio on the sulfate
The effect of watercement ratio on the sulfateeSAT Publishing House
 
Life cycle assessment and simulation enablers of sustainable product design
Life cycle assessment and simulation enablers of sustainable product designLife cycle assessment and simulation enablers of sustainable product design
Life cycle assessment and simulation enablers of sustainable product designeSAT Publishing House
 
Uncertainty modelling and limit state reliability of
Uncertainty modelling and limit state reliability ofUncertainty modelling and limit state reliability of
Uncertainty modelling and limit state reliability ofeSAT Publishing House
 
Application of ann for ultimate shear strength of fly
Application of ann for ultimate shear strength of flyApplication of ann for ultimate shear strength of fly
Application of ann for ultimate shear strength of flyeSAT Publishing House
 
New approach for generalised unsharp masking alogorithm
New approach for generalised unsharp masking alogorithmNew approach for generalised unsharp masking alogorithm
New approach for generalised unsharp masking alogorithmeSAT Publishing House
 
Finite element analysis of dissimilar welding between
Finite element analysis of dissimilar welding betweenFinite element analysis of dissimilar welding between
Finite element analysis of dissimilar welding betweeneSAT Publishing House
 
Location updation for energy efficient geographic routing in
Location updation for energy efficient geographic routing inLocation updation for energy efficient geographic routing in
Location updation for energy efficient geographic routing ineSAT Publishing House
 
Scour investigation around single and two piers
Scour investigation around single and two piersScour investigation around single and two piers
Scour investigation around single and two pierseSAT Publishing House
 
A systematic approach towards restoration of heritage buildings a case study
A systematic approach towards restoration of heritage buildings  a case studyA systematic approach towards restoration of heritage buildings  a case study
A systematic approach towards restoration of heritage buildings a case studyeSAT Publishing House
 
Accuracy enhancement of srtm and aster dems using weight estimation regressio...
Accuracy enhancement of srtm and aster dems using weight estimation regressio...Accuracy enhancement of srtm and aster dems using weight estimation regressio...
Accuracy enhancement of srtm and aster dems using weight estimation regressio...eSAT Publishing House
 
An investigation of the design implications for openings
An investigation of the design implications for openingsAn investigation of the design implications for openings
An investigation of the design implications for openingseSAT Publishing House
 
Stirling engine performance prediction using schmidt
Stirling engine performance prediction using schmidtStirling engine performance prediction using schmidt
Stirling engine performance prediction using schmidteSAT Publishing House
 
Stability analysis of orthotropic reinforce concrete shear wall
Stability analysis of orthotropic reinforce concrete shear wallStability analysis of orthotropic reinforce concrete shear wall
Stability analysis of orthotropic reinforce concrete shear walleSAT Publishing House
 

Viewers also liked (20)

Development and determination of voltage faultlocation
Development and determination of voltage faultlocationDevelopment and determination of voltage faultlocation
Development and determination of voltage faultlocation
 
Improving quality of service using ofdm technique for 4 th generation network
Improving quality of service using ofdm technique for 4 th generation networkImproving quality of service using ofdm technique for 4 th generation network
Improving quality of service using ofdm technique for 4 th generation network
 
A simulation study of wi max based communication system using deliberately cl...
A simulation study of wi max based communication system using deliberately cl...A simulation study of wi max based communication system using deliberately cl...
A simulation study of wi max based communication system using deliberately cl...
 
Information security by biometric waterking
Information security by biometric waterkingInformation security by biometric waterking
Information security by biometric waterking
 
Novel method to find the parameter for noise removal
Novel method to find the parameter for noise removalNovel method to find the parameter for noise removal
Novel method to find the parameter for noise removal
 
Design of a wind power generation system using a
Design of a wind power generation system using aDesign of a wind power generation system using a
Design of a wind power generation system using a
 
An analysis of raw materials for concretes as metal sheets for solar radiatio...
An analysis of raw materials for concretes as metal sheets for solar radiatio...An analysis of raw materials for concretes as metal sheets for solar radiatio...
An analysis of raw materials for concretes as metal sheets for solar radiatio...
 
The effect of watercement ratio on the sulfate
The effect of watercement ratio on the sulfateThe effect of watercement ratio on the sulfate
The effect of watercement ratio on the sulfate
 
Life cycle assessment and simulation enablers of sustainable product design
Life cycle assessment and simulation enablers of sustainable product designLife cycle assessment and simulation enablers of sustainable product design
Life cycle assessment and simulation enablers of sustainable product design
 
Uncertainty modelling and limit state reliability of
Uncertainty modelling and limit state reliability ofUncertainty modelling and limit state reliability of
Uncertainty modelling and limit state reliability of
 
Application of ann for ultimate shear strength of fly
Application of ann for ultimate shear strength of flyApplication of ann for ultimate shear strength of fly
Application of ann for ultimate shear strength of fly
 
New approach for generalised unsharp masking alogorithm
New approach for generalised unsharp masking alogorithmNew approach for generalised unsharp masking alogorithm
New approach for generalised unsharp masking alogorithm
 
Finite element analysis of dissimilar welding between
Finite element analysis of dissimilar welding betweenFinite element analysis of dissimilar welding between
Finite element analysis of dissimilar welding between
 
Location updation for energy efficient geographic routing in
Location updation for energy efficient geographic routing inLocation updation for energy efficient geographic routing in
Location updation for energy efficient geographic routing in
 
Scour investigation around single and two piers
Scour investigation around single and two piersScour investigation around single and two piers
Scour investigation around single and two piers
 
A systematic approach towards restoration of heritage buildings a case study
A systematic approach towards restoration of heritage buildings  a case studyA systematic approach towards restoration of heritage buildings  a case study
A systematic approach towards restoration of heritage buildings a case study
 
Accuracy enhancement of srtm and aster dems using weight estimation regressio...
Accuracy enhancement of srtm and aster dems using weight estimation regressio...Accuracy enhancement of srtm and aster dems using weight estimation regressio...
Accuracy enhancement of srtm and aster dems using weight estimation regressio...
 
An investigation of the design implications for openings
An investigation of the design implications for openingsAn investigation of the design implications for openings
An investigation of the design implications for openings
 
Stirling engine performance prediction using schmidt
Stirling engine performance prediction using schmidtStirling engine performance prediction using schmidt
Stirling engine performance prediction using schmidt
 
Stability analysis of orthotropic reinforce concrete shear wall
Stability analysis of orthotropic reinforce concrete shear wallStability analysis of orthotropic reinforce concrete shear wall
Stability analysis of orthotropic reinforce concrete shear wall
 

Similar to Speaker independent visual lip activity detection for

IRJET- Analysis of Yawning Behavior in IoT based of Drowsy Drivers
IRJET- Analysis of Yawning Behavior in IoT based of Drowsy DriversIRJET- Analysis of Yawning Behavior in IoT based of Drowsy Drivers
IRJET- Analysis of Yawning Behavior in IoT based of Drowsy DriversIRJET Journal
 
IRJET- Characteristics and Mood Prediction of Human by Signature and Facial E...
IRJET- Characteristics and Mood Prediction of Human by Signature and Facial E...IRJET- Characteristics and Mood Prediction of Human by Signature and Facial E...
IRJET- Characteristics and Mood Prediction of Human by Signature and Facial E...IRJET Journal
 
lips _reading_nagham _salim compute.pptx
lips _reading_nagham _salim compute.pptxlips _reading_nagham _salim compute.pptx
lips _reading_nagham _salim compute.pptxnaghamallella
 
Comparative Study of Lip Extraction Feature with Eye Feature Extraction Algor...
Comparative Study of Lip Extraction Feature with Eye Feature Extraction Algor...Comparative Study of Lip Extraction Feature with Eye Feature Extraction Algor...
Comparative Study of Lip Extraction Feature with Eye Feature Extraction Algor...Editor IJCATR
 
Facial Expression Identification System
Facial Expression Identification SystemFacial Expression Identification System
Facial Expression Identification SystemIRJET Journal
 
Design of a Communication System using Sign Language aid for Differently Able...
Design of a Communication System using Sign Language aid for Differently Able...Design of a Communication System using Sign Language aid for Differently Able...
Design of a Communication System using Sign Language aid for Differently Able...IRJET Journal
 
IRJET- Vision Based Sign Language by using Matlab
IRJET- Vision Based Sign Language by using MatlabIRJET- Vision Based Sign Language by using Matlab
IRJET- Vision Based Sign Language by using MatlabIRJET Journal
 
IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...
IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...
IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...IRJET Journal
 
Review On Speech Recognition using Deep Learning
Review On Speech Recognition using Deep LearningReview On Speech Recognition using Deep Learning
Review On Speech Recognition using Deep LearningIRJET Journal
 
IRJET - A Review on: Face Recognition using Laplacianface
IRJET - A Review on: Face Recognition using LaplacianfaceIRJET - A Review on: Face Recognition using Laplacianface
IRJET - A Review on: Face Recognition using LaplacianfaceIRJET Journal
 
Automated attendance system using Face recognition
Automated attendance system using Face recognitionAutomated attendance system using Face recognition
Automated attendance system using Face recognitionIRJET Journal
 
Overview of Image Based Ear Biometric with Smartphone App
Overview of Image Based Ear Biometric with Smartphone AppOverview of Image Based Ear Biometric with Smartphone App
Overview of Image Based Ear Biometric with Smartphone AppIRJET Journal
 
Ijarcet vol-2-issue-4-1352-1356
Ijarcet vol-2-issue-4-1352-1356Ijarcet vol-2-issue-4-1352-1356
Ijarcet vol-2-issue-4-1352-1356Editor IJARCET
 
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...Facial Expression Recognition Using Local Binary Pattern and Support Vector M...
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...AM Publications
 
IRJET- Sign Language Interpreter using Image Processing and Machine Learning
IRJET- Sign Language Interpreter using Image Processing and Machine LearningIRJET- Sign Language Interpreter using Image Processing and Machine Learning
IRJET- Sign Language Interpreter using Image Processing and Machine LearningIRJET Journal
 
Sign Language Detection using Action Recognition
Sign Language Detection using Action RecognitionSign Language Detection using Action Recognition
Sign Language Detection using Action RecognitionIRJET Journal
 
Abstract Silent Sound Technology
Abstract   Silent Sound TechnologyAbstract   Silent Sound Technology
Abstract Silent Sound Technologyvishnu murthy
 
A LIP LOCALIZATION BASED VISUAL FEATURE EXTRACTION METHOD
A LIP LOCALIZATION BASED VISUAL FEATURE EXTRACTION METHOD A LIP LOCALIZATION BASED VISUAL FEATURE EXTRACTION METHOD
A LIP LOCALIZATION BASED VISUAL FEATURE EXTRACTION METHOD ecij
 
IRJET- ASL Language Translation using ML
IRJET- ASL Language Translation using MLIRJET- ASL Language Translation using ML
IRJET- ASL Language Translation using MLIRJET Journal
 
Sign Language Identification based on Hand Gestures
Sign Language Identification based on Hand GesturesSign Language Identification based on Hand Gestures
Sign Language Identification based on Hand GesturesIRJET Journal
 

Similar to Speaker independent visual lip activity detection for (20)

IRJET- Analysis of Yawning Behavior in IoT based of Drowsy Drivers
IRJET- Analysis of Yawning Behavior in IoT based of Drowsy DriversIRJET- Analysis of Yawning Behavior in IoT based of Drowsy Drivers
IRJET- Analysis of Yawning Behavior in IoT based of Drowsy Drivers
 
IRJET- Characteristics and Mood Prediction of Human by Signature and Facial E...
IRJET- Characteristics and Mood Prediction of Human by Signature and Facial E...IRJET- Characteristics and Mood Prediction of Human by Signature and Facial E...
IRJET- Characteristics and Mood Prediction of Human by Signature and Facial E...
 
lips _reading_nagham _salim compute.pptx
lips _reading_nagham _salim compute.pptxlips _reading_nagham _salim compute.pptx
lips _reading_nagham _salim compute.pptx
 
Comparative Study of Lip Extraction Feature with Eye Feature Extraction Algor...
Comparative Study of Lip Extraction Feature with Eye Feature Extraction Algor...Comparative Study of Lip Extraction Feature with Eye Feature Extraction Algor...
Comparative Study of Lip Extraction Feature with Eye Feature Extraction Algor...
 
Facial Expression Identification System
Facial Expression Identification SystemFacial Expression Identification System
Facial Expression Identification System
 
Design of a Communication System using Sign Language aid for Differently Able...
Design of a Communication System using Sign Language aid for Differently Able...Design of a Communication System using Sign Language aid for Differently Able...
Design of a Communication System using Sign Language aid for Differently Able...
 
IRJET- Vision Based Sign Language by using Matlab
IRJET- Vision Based Sign Language by using MatlabIRJET- Vision Based Sign Language by using Matlab
IRJET- Vision Based Sign Language by using Matlab
 
IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...
IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...
IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...
 
Review On Speech Recognition using Deep Learning
Review On Speech Recognition using Deep LearningReview On Speech Recognition using Deep Learning
Review On Speech Recognition using Deep Learning
 
IRJET - A Review on: Face Recognition using Laplacianface
IRJET - A Review on: Face Recognition using LaplacianfaceIRJET - A Review on: Face Recognition using Laplacianface
IRJET - A Review on: Face Recognition using Laplacianface
 
Automated attendance system using Face recognition
Automated attendance system using Face recognitionAutomated attendance system using Face recognition
Automated attendance system using Face recognition
 
Overview of Image Based Ear Biometric with Smartphone App
Overview of Image Based Ear Biometric with Smartphone AppOverview of Image Based Ear Biometric with Smartphone App
Overview of Image Based Ear Biometric with Smartphone App
 
Ijarcet vol-2-issue-4-1352-1356
Ijarcet vol-2-issue-4-1352-1356Ijarcet vol-2-issue-4-1352-1356
Ijarcet vol-2-issue-4-1352-1356
 
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...Facial Expression Recognition Using Local Binary Pattern and Support Vector M...
Facial Expression Recognition Using Local Binary Pattern and Support Vector M...
 
IRJET- Sign Language Interpreter using Image Processing and Machine Learning
IRJET- Sign Language Interpreter using Image Processing and Machine LearningIRJET- Sign Language Interpreter using Image Processing and Machine Learning
IRJET- Sign Language Interpreter using Image Processing and Machine Learning
 
Sign Language Detection using Action Recognition
Sign Language Detection using Action RecognitionSign Language Detection using Action Recognition
Sign Language Detection using Action Recognition
 
Abstract Silent Sound Technology
Abstract   Silent Sound TechnologyAbstract   Silent Sound Technology
Abstract Silent Sound Technology
 
A LIP LOCALIZATION BASED VISUAL FEATURE EXTRACTION METHOD
A LIP LOCALIZATION BASED VISUAL FEATURE EXTRACTION METHOD A LIP LOCALIZATION BASED VISUAL FEATURE EXTRACTION METHOD
A LIP LOCALIZATION BASED VISUAL FEATURE EXTRACTION METHOD
 
IRJET- ASL Language Translation using ML
IRJET- ASL Language Translation using MLIRJET- ASL Language Translation using ML
IRJET- ASL Language Translation using ML
 
Sign Language Identification based on Hand Gestures
Sign Language Identification based on Hand GesturesSign Language Identification based on Hand Gestures
Sign Language Identification based on Hand Gestures
 

More from eSAT Publishing House

Likely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnamLikely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnameSAT Publishing House
 
Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...eSAT Publishing House
 
Hudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnamHudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnameSAT Publishing House
 
Groundwater investigation using geophysical methods a case study of pydibhim...
Groundwater investigation using geophysical methods  a case study of pydibhim...Groundwater investigation using geophysical methods  a case study of pydibhim...
Groundwater investigation using geophysical methods a case study of pydibhim...eSAT Publishing House
 
Flood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaFlood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaeSAT Publishing House
 
Enhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingEnhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingeSAT Publishing House
 
Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...eSAT Publishing House
 
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...eSAT Publishing House
 
Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...eSAT Publishing House
 
Shear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a reviewShear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a revieweSAT Publishing House
 
Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...eSAT Publishing House
 
Risk analysis and environmental hazard management
Risk analysis and environmental hazard managementRisk analysis and environmental hazard management
Risk analysis and environmental hazard managementeSAT Publishing House
 
Review study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallsReview study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallseSAT Publishing House
 
Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...eSAT Publishing House
 
Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...eSAT Publishing House
 
Coastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaCoastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaeSAT Publishing House
 
Can fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structuresCan fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structureseSAT Publishing House
 
Assessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingsAssessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingseSAT Publishing House
 
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...eSAT Publishing House
 
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...eSAT Publishing House
 

More from eSAT Publishing House (20)

Likely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnamLikely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnam
 
Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...
 
Hudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnamHudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnam
 
Groundwater investigation using geophysical methods a case study of pydibhim...
Groundwater investigation using geophysical methods  a case study of pydibhim...Groundwater investigation using geophysical methods  a case study of pydibhim...
Groundwater investigation using geophysical methods a case study of pydibhim...
 
Flood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaFlood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, india
 
Enhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingEnhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity building
 
Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...
 
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
 
Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...
 
Shear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a reviewShear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a review
 
Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...
 
Risk analysis and environmental hazard management
Risk analysis and environmental hazard managementRisk analysis and environmental hazard management
Risk analysis and environmental hazard management
 
Review study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallsReview study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear walls
 
Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...
 
Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...
 
Coastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaCoastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of india
 
Can fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structuresCan fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structures
 
Assessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingsAssessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildings
 
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
 
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
 

Recently uploaded

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 

Recently uploaded (20)

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 

Speaker independent visual lip activity detection for

  • 1. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 560 SPEAKER - INDEPENDENT VISUAL LIP ACTIVITY DETECTION FOR HUMAN - COMPUTER INTERACTION P.Sujatha1 , M.Radhakrishnan2 1 Department of Computer Science and Engineering, Sudharsan Engineering College, Pudukkottai, Tamilnadu, India, suja_param@yahoo.com 2 Director / IT, Sudharsan Engineering College, Pudukkottai, Tamilnadu, India, sumyukta2005@yahoo.com Abstract Recently there is an increased interest in using the visual features for improved speech processing. Lip reading plays a vital role in visual speech processing. In this paper, a new approach for lip reading is presented. Visual speech recognition is applied in mobile phone applications, human-computer interaction and also to recognize the spoken words of hearing impaired persons. The visual speech video is taken as input for face detection module which is used to detect the face region. The mouth region is identified based on the face region of interest (ROI). The mouth images are applied for feature extraction process. The features are extracted using every 10th coordinate, every 16th coordinate, 16 point + Discrete Cosine Transform (DCT) method and Lip DCT method. Then, these features are applied as inputs for recognizing the visual speech using Hidden Markov Model. Out of the different feature extraction methods, the DCT method gives the experimental results of better performance accuracy. 10 participants were uttered 35 different isolated words. For each word, 20 samples are collected for training and testing the process. Index Terms: Feature Extraction, HMM, Mouth ROI, DWT, Visual Speech Recognition --------------------------------------------------------------------***---------------------------------------------------------------------- 1. INTRODUCTION Visual speech recognition refers to recognizing the spoken words based on visual lip movements. Visual speech recognition is an area with great potential to solve challenging problems in speech processing. Difficulties in the audio based speech recognition system can be significantly reduced by additional information provided by the extra visual features. It is well known that visual speech information through lip movement is very useful for human speech perceptions. The main difficulty in incorporating visual information into an acoustic speech recognition method is to find a robust and accurate method for extracting essential visual speech features. Figure 1 illustrates our proposed system architecture of a visual speech recognition process. The recorded visual speech video is given as input to the system. The algorithm starts with detecting face using a popular face detection technique by Viola-Jone’s [4, 5]. After face is detected, then Mouth ROI is localized using simple algorithm. The next step is to extract the visual features of the lip region. Then, these feature vectors are applied separately as inputs to the HMM classifier for recognizing the spoken word. The aim of the paper is to extract the visual lip movements (lip features) and predicting the word which is actually pronounced. This paper is organized as follows. Section 2 describes the literature survey on extraction of visual speech features. Section 3 describes the face localization process. Section 4 describes the mouth ROI detection algorithm. Section 5 explains the lip feature extraction techniques. Section 6 explains about the classifier HMM. In section 7 the database and the experimental results are discussed, and in eighth section the conclusion is presented. Fig -1: Overview of the proposed Visual Speech Recognition system
  • 2. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 561 2. LITERATURE SURVEY An automatic speech recognizer was developed for a speaker dependent and continuous speech alphanumeric recognition application based on the European Portuguese language [1]. Hyper column model (HCM) was used to extract visual speech features from input image. The extracted features are modeled by Gaussian distributions through HMM [2]. An audio visual digit recognition using N-best decision fusion was proposed in [3]. Viola and Jones presented a face detector which is a machine learning approach for visual object detection [4, 5]. Lip reading system designed by Pentajan [6] was based on geometric features such as mouth’s height, width, area and perimeter. Another technique designed by Werda [7], an Automatic Lip Feature Extraction prototype (ALiFE) includes lip localization, lip tracking, visual feature extraction and speech unit recognition for French vowels, uttered by multiple speakers. Wang introduced [8], a region-based lip contour extraction algorithm uses a 16-point lip model to describe the lip contour. Training algorithm of HMM was proposed for visual speech recognition based on a modified simulated annealing (SA) technique to improve the convergence speed and the solution quality [9]. An approach to estimate the parameters of continuous density HMMs for visual speech recognition was presented in [10]. In [11], Haar features are used to train Adaboost classifier and combined skin and lip color separation algorithm to form a self-adaptive separation model, which can dynamically adjust constant parameters. A lip reading technique for speech recognition by using motion estimation analysis was proposed by Matthew Ramage[12]. A user authentication system based on password lip reading was presented. Motion estimation was done for lip movement image sequences representing speech. 3. FACE LOCALIZATION Viola and Jones face detector is capable of processing image rapidly and achieving high detection rates .The work has been distinguished by three key contributions. The first contribution was an integral image which allows the features used by the detector to be computed very quickly. For each pixel in the original image, there is exactly one pixel in the integral image, whose value is the sum of the original image values above to the left. The performance can be attributed to the use of an attentional cascade, using low feature number detectors based on a natural extension of Haar wavelets. Each detector in their cascade fits objects to simple rectangular masks. In order to reduce the number of computations, while moving through their cascade, they introduced a new image representation called the integral image. The second was an adaboost learning algorithm which selects a small number of visual critical features from a large set and yields extremely efficient classifiers. The third contribution was a method for combining increasingly more complex classifiers in a cascade which allows background region of the image to be quickly discarded while spending more computation on promising object like regions. In this paper, while a person in pronouncing a word, the video is captured and stored in AVI file format. Subsequently the video frames are grabbed and it is subjected to viola and Jones face detector which detects the face in the video and highlighted inside a rectangle ROI (Region of Interest). Fig -2: Face Localization process using AdaBoost classifier 4.MOUTH REGION OF INTEREST DETECTION The mouth region are the visual parts of the human speech production system; these parts hold the most visual speech information, therefore it is imperative for any VSR system to detect or localize such regions to capture the related visual information i.e., we cannot read lips without seeing them first. Therefore lip localization is an external process for any VSR system. Many techniques for lip detection / localization in digital images like Snakes, Active shape models (ASM), Active Appearance Models (AAM) and deformable templates are based on model based lip detection method. Image based lip detection methods include the use of spatial information, Pixel color and intensity, lines, corners, edges and motion. Fig -3: Mouth ROI determination in real time Video
  • 3. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 562 In this paper, Image based lip detection method is used to extract the mouth region. In a standard face the location of the mouth will be in the lower half of the face. Based on this concept, a ROI is set by reducing the left, width, top and height values with respect to the face ROI. Then the mouth ROI is localized by certain values which are derived from mathematical calculations. The extracted Mouth ROI is copied into new frame for further processing. The diagrammatic representation of Mouth ROI extraction using the algorithm given in table 1 is shown in fig 3. The proposed method has the advantage of providing a reliable Mouth ROI without any geometric model assumption and complex procedures such as determining corners and edge detection. The method was evaluated on 175000 frames of the in-house database. The experiments show that the method localizes the mouth ROI efficiently with the high level accuracy (91.15 %). Table -1: The algorithm to extract Mouth ROI from the face ROI 1. The frames of face ROI are grabbed and given as input for the mouth localization and extraction. 2. Find out the values associated with Fl, Fw, Ft and Fh of the face in the XY Plane where, Fl – Left value of the face ROI Fw – Width value of the face ROI Ft – Top value of the face ROI Fh – Height value of the face ROI 3. The mouth ROI is extracted as per the following calculations, Ml = Fl + (Fw – Fl) / 4 (1) Mw = Fw – (Fw – Fl)/ 4 (2) Mt = Ft + (2*(Fh – Ft)) / 3 (3) Mh = Fh – (Fh – Ft)/ 15 (4) Ml = left of the mouth ROI Mw = Width of the mouth ROI Mt = Top of the Mouth ROI Mh = height of the Mouth ROI 4. Ml, Mw, Mt and Mh are the values used to localize the mouth ROI. 5. Repeat the steps 2, 3 and 4 for all the frames until the video ends. Compared to other similar algorithms, the solution proposed here has the advantage of providing a reliable lip contour without any geometric model assumption and complex procedures such as determining the edge detection. This approach will be more helpful for those research works which involves the outer contour extraction of lip such as lip reading. 5. FEATURE EXTRACTION TECHNIQUES The VSR systems require the analysis of feature vectors which is extracted from the speech related visual signals in the sequence of the speaker face frames while uttering the spoken words. To find a signal or signature for each word, we need to find a proper way of extracting the most relevant features, which play an important role in recognizing that word. The frame which has only mouth (Mouth ROI) is subjected to image enhancement to improve the quality of image for further processing. The enhancement starts from increasing or decreasing the brightness or contrast of the image. The enhanced image serves as the input for thresholding where lip region is separated from the background. In this paper, adaptive thresholding is used for generating the lip region from the Mouth ROI frame. The adaptive thresholding takes a color image as input and in the simplest implementation, outputs a binary image representing the segmentation. For each pixel in the image a threshold has to be calculated. If the pixel value is below the threshold it is set to be the background value (white), otherwise it assumes the foreground value (black). The threshold value is enlarged to the size of 200 x 200 for better processing. The resulting frame after thresholding is a mass of lip contour points where the feature points of outer contour points are extracted for both upper and lower lips. The point of interest (POI) is detected by the projection of final contour on horizontal and vertical axis. The following is the proposed list of feature extraction methods that will be extracted from the sequence of lip contour points of the Mouth ROI during the uttering of the words. (i) Every 10th Coordinate Method - From the mass of lip contour points, every 10th coordinates are selected. The feature points are selected based on top to bottom and left to right, the starting and ending position of the lip contour x, y coordinates. (ii) Every 16th coordinate Method - From the mass of lip contour points, 16 coordinates are considered as feature vectors. From the center of the lip, Left, right, top and bottom of the contours and also the mid between those contour points, such as left to top, top to right, right to bottom and bottom to left x, y coordinates were selected. In addition to that, the mid coordinates between those feature vector contour points are also selected. The Normalized distance from the center point of the lip is applied for the 16 coordinates and considered as feature vectors. (iii) Every 10th coordinate + DCT Method - The Discrete Cosine Transform is applied for 16 coordinates obtained from method II and then the results are considered as feature vectors.
  • 4. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 563 (iv) DCT Method for entire lip region - The entire lip region has been selected as feature vector. The Discrete Cosine transform for the entire lip region coordinates were calculated and considered as feature points. The discrete cosine transform (DCT) method is used to separate the image into parts of differing importance (with respect to the image's visual quality). The DCT is similar to the Discrete Fourier Transform: it transforms a signal or image from spatial domain to the frequency domain. The general equation for a 2D (N by M image) DCT is defined by the following equation: (5) where The basic operation of the DCT is as follows: • The input image is N by M. • f(i,j) is the intensity of the pixel in row i and column j; • F(u,v) is the DCT coefficient in row and column of the DCT matrix. • For most images, much of the signal energy lies at low frequencies; these appear in the upper left corner of the DCT. • Compression is achieved since the lower right values represent higher frequencies, and are often small - small enough to be neglected with little visible distortion. • The DCT input is an 8 by 8 array of integers. This array contains each pixel's gray scale level; • 8 bit pixels have levels from 0 to 255. 6. HIDDEN MARKOV MODEL A hidden Markov model (HMM) is denoted by the equation: λ= (Π, A, B) (6) Where Π is the initial state distribution, A is the state transition matrix and B is the emission probability matrix. The emission probability matrix specifies, for each state, a probability distribution over the output alphabet. The output alphabet need no longer be the same as the state space. Denoting the output alphabet with θ = {1, 2, ..., M} we get a matrix with N rows and M columns, (7) Where bi (k) is the probability of symbol k being emitted from state i. The emission probability matrix is another stochastic matrix, in the sense that each row sums up to one, and all elements are greater than or equal to zero. A HMM poses three stages: (i) Evaluation or computing P (Observations | Model): This allows us to find out how well a model matches a given observation sequence. The main concern here is computational efficiency of finding an algorithm with only a polynomial running time. (ii) ) Decoding or finding the hidden state sequence: Best corresponds to the observed symbols, because there are generally many sequences that give rise to the same symbols, there is no "correct" solution to be found in most cases. Thus, some optimality criterion must be chosen. The most widely used criterion is to find a path through the model that maximizes P (Path | Observations, Model). (iii) Training or Learning: Finding the model parameter values (λ= Π, A, B) that specify a model most likely to produce a given sequence of training data. In other words, the objective is to construct a model that best fits the training data (or best represents the source that produced the data). There is no known way to analytically solve for the best model, but an iterative algorithm that often yields sufficiently good approximations. The training problem for hidden Markov models is to estimate the transition probabilities, the initial state distribution and the emission probability distributions from sample data. The features vectors are trained and tested using the HMM classifier. 7. EXPERIMENTAL RESULTS The in-house videos were recorded inside a normal room using web camera. The participants were 4 females and 6 males, distributed over different age groups. The videos were recorded at 25 frames per second. It is stored in AVI format and resized to 320*240 pixels, because it is easier to deal with AVI format and it faster for training and analysing the videos with smaller frame sizes. Each person in each recorded video utters non-contiguous 35 different words 20 times, which are numbers from 1-19 (19 words) twenty, thirty up to hundred (9 words), thousand, lakh (2 words) and cash counter words rupees, paise, sir, madam, please (5 words). These 35 words are normally used on cash counters
  • 5. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 564 and also STD booths and post offices.The hidden markov’s model was trained for every word from the visual parameters. The HMM system consists of 35 HMM models to recognize 35 words. First, the models are initialized and subsequently re-estimated with the embedded training version of the Baum-welch algorithm. Then, the training data were aligned to the models through the viterbi algorithm to obtain the state duration densities. To recognize a new word, the extracted feature vectors are fed as input to the HMM system. The maximum probability model is obtained among 35 HMM word models. The maximum probability model is recognized as the output word model and the corresponding word is displayed in the form of text.4900 samples (7 participants pronounced 20 samples of each one of 35 words) were collected for training and 2100 samples (3 participant’s pronounced 20 samples of each one of 35 words) were used for testing. The performance of the proposed method using HMM with respect to different feature vector is given in the fig 4. Then spoken word recognition rate is very low for every 16 coordinates method and the accuracy rate for the visual speech recognition for Lip DCT method is 98.8% which is higher compared to the all other feature extraction techniques [13]. Fig -4: Performance of different Feature Extraction methods 8. Conclusion In this paper, a new method for extracting the mouth region from the face is presented. The recorded visual speech video is given as input to the face localization module for detecting the face ROI. Based upon the rectangle ROI of the face another ROI is set to locate the mouth region. The mouth ROI is separated from the frame and is copied to another frame which has only the mouth region. The frame which has only moth is subjected to image enhancement to improve the quality of image for further processing. The enhanced image serves as the input for thresholding where lip region is separated from the background. The resulting frame after thresholding is a mass of lip contour points where the feature points of outer contour points are extracted. The different feature vectors from the mouth ROI is determined. The extracted feature vectors are applied separately to the HMM models and their performance are compared. As the output of the method is the corresponding text for the visual speech. The recognition rate for the visual speech is low for every 10th co-ordinates method. The Lip DCT method is used to recognize the isolated words and it achieves 98.8% of accuracy. REFERENCES [1] Vitor Pera, Filipe Sa Afonso, Ricardo Ferreira “Audio Visual Speech Recognition in a Portuguse Language Based Application”, IEEE, ICIT –Maribor,slovenia , pp.688-692, 2003. [2] Alaa Sagheer, Naoyuki Tsuruta, Rin-Ichiro Taniguchi and Sakashi Maeda, “Visual speech features Representation for Automatic Lip Reading“, IEEE, ICASSP pp.781-784, 2005. [3] Georg F.Meyer, Jeffrey B. Mulligan, Sophie M.Wuerger, “Continuous audio-visual digit recognition using N-best decision fusion”, Published by Elsevier Ltd, Information fusion-5, pp.91 -101, 2003. [4] P. Viola and M. Jones, “Robust Real-time Object Detection”, IEEE International Journal of Computer Vision vol.57, no.2, pp.137-154, May 2004. [5] P. Viola and M. Jones, “Rapid Object Detection using a Boosted Cascade of Simple Features”, Conf. Computer Vision and Pattern Recognition. Volume 1, pp. 511–518, 2001. [6] Mitsuhiro Kawamura, Naoshi Kakita, Tmoyuki Osaki, Kazunori Sugahara, Ryosuke Konishi, “On the Hardware Realization of Lip Reading System”, SICE Annual Conference in Fukui, pp 2452 -2457 , 2003 [7] Takeshi Saitoh and Ryosuke Konishi, “ Word Recognition based on Two Dimensional Lip Motion Trajectory”, International Symposium on Intelligent signal processing and communication systems japan , IEEE pp 287 – 290, 2006. [8] S.L.Wang , W.H.Lau, S.H.Leung. “Automatic Lip Contour extraction From Lip Images” Published by Elsevier Ltd, Pattern Recognition 37 pp 2375-2384, 2004. [9] Jong-Seok Lee and Cheol Hoon Park ,“Training Hidden Markov Models by Hybrid Simulated annealing for Visual
  • 6. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 02 Issue: 11 | Nov-2013, Available @ http://www.ijret.org 565 Speech recognition“ , IEEE International conference on Systems, Man and Cybernetics, pp 198 – 202, 2006. [10] Yoshihiko Nakaku, Keiichi Tokuda, Tadashi Kitamura and Takao Kobayashi, “Normalized Training for HMM- Based Visual Speech recognition” IEEE pp 234 – 237, 2000. [11] Huang Yong-hui, PAN Bao-chang, LIANG Jian, FAN Xiao-yan, “ A new lip-automatic detection and location algorithm in lip-reading system” Systems Man and Cybernetics (SMC), IEEE International Conference, pp. 2402 - 2405, 2010. [12] Sujatha, P.; Krishnan, M.R., "Lip feature extraction for visual speech recognition using Hidden Markov Model," Computing, Communication and Applications (ICCCA), 2012 International Conference on , vol., no., pp.1,5, 22-24 Feb. 2012 [13] Matthew Ramage., & Euan Lindsay, “ Wrapping snakes for improved lip segmentation” IEEE International conference on acoustics, speech and signal processing, pp. 1205–1208, 2009. [14] Rafael C.Gonzalez and Richard E.Woods, “ Digital Image Processing”, Addison Wesley ,Second edition. BIOGRAPHIES [1] P.Sujatha is a faculty member of the Departmant of Computer Science and Engineering, Sudharsan Engineering College, Tamilnadu, India. She has 12 years teaching experience. Her current research interest includes image processing, computer vision and data mining. Dr.M.Radhakrishnan is curently a Professor in Civil Engineering and Director/IT Sudharsan Engineering College, Tamilnadu, India. He has more than 35 years of teaching experience. His field of interest includes Computer Aided Structural Analysis, Computer Networks, Image Processing and Effort Estimation.