SlideShare a Scribd company logo
Zeroth Review - 2021
M.Vignesh
221003105
IV-CSE A
221003105@sastra.ac.in
S.Mahadevan
221003057
IV-CSE A
221003057@sastra.ac.in
Guided By
Ms. Bhavani R
APII/CSE/SRC/SASTRA
PROJECT
OVERVIEW
• Visual Speech Recognition
• Extract lip features
• Neural Network model to train the lip sequence
• Transcribing and evaluate Lip movements into text
3
INTRODUCTION
4
• There are many Recognition System that recognize words from audio features.
• Lip reading System is one of the developing technology
• Aims to recognize words only by visual feature without audio
• Classify and recognize words by visemes movements
PROBLEM
STATEMENT
5
• Noisy Environment
• Speech Speed
• Accent
• Pronunciation
• Facial Features
OBJECTIVE
6
• Extract textual or speech data from facial features
• Train a Neural network system to process visemes sequence
• Develop a Speaker Independed system
• Recognize and classify ten different words
S/W –H/W REQUIREMENTS
7
SOFTWARE REQUIREMENTS:
• Anaconda
• System:64 bit OS, x64 processor
HARDWARE REQUIREMENTS:
• 4 GB RAM
• Better GPU(For performance)
EXISTING VS PROPOSED SYSTEM
8
Existing System:
• Uses BBCLRS2 Dataset
• Recognize and classify only ASCII characters and decode words
• Complex processing and gives better accuracy after 2000 epochs
Proposed System:
• Uses MIRACL-VC1 Dataset
• Recognize and classify ten different words
• Simple and gives better accuracy after 200 epochs
9
S.No Paper Title, author Journal details with Date
of publication
Methodology applied Merits and demerits
1 Lip-Reading Driven Deep
Learning Approach for
Speech Enhancement, Ahsan
Adeel, Mandar Gogate, Amir
Hussain, and William M. Whitmer
2019- IEEE Transaction This paper uses LSTM driven
Audio Visual
mapping approach.
• Increased accuracy
• Autonomous Speech
enhancement
• Poor performance for
Realtime speech
2 An audio-visual corpus for
multimodal automatic
speech recognition, Andrzej
Czyzewski, Bozena Kostek,
Piotr Bratoszewski,Jozef Kotus,
Marcin Szykulski
2017-Springer This paper uses Active
Appearance Model(AAM) and
Hidden Markov Models(HMM)
• Recognize in street noise
• Babble noise dramatically
worsens the accuracy of
speech recognition
3 Extraction of Visual Features for
Lipreading, Iain Matthews,
Timothy F. Cootes, J. Andrew
Bangham, Stephen Cox,
Richard Harvey
2017-IEEE Transaction This paper uses Active shape
model (ASM) and point
distribution model (PDM)
• Accuracy improved when
a noisy audio
signal is augmented with visual
information
• Poor performance in
Babble noise
4 Audio-visual speech
recognition using deep
learning, Kuniaki Noda, Yuki
Yamaguchi, Kazuhiro Nakadai,
Hiroshi G. Okuno,Tetsuya Ogata
2014-Springer This paper uses Hidden Markov
Model(HMM)
• Increased Performance
• Reverberation, illumination,
and facial orientation,
occur
5 Speaker-Independent Speech
Recognition using
Visual Features, Pooventhiran
G., Sandeep A.
2020-IEEE This paper uses 3D-CNN model • Improved Accuracy
• Complex
LITERATURE SURVEY
10
PROPOSED ARCHITECTURE
REFERENCES
11
[1] A. Thanda and S. M. Venkatesan, “Audio visual speech recognition using deep recurrent
neural networks,” in IAPR workshop on multimodal pattern recognition of social signals in human-
computer interaction. Springer, 2016
[2] E. Petajan, B. Bischoff, D. Bodoff, and N. M. Brooke, “An improved automatic lipreading
system to enhance speech recognition,” in Proceedings of the SIGCHI conference on Human factors
in computing systems, 1988
[3] A. Torfi, S. M. Iranmanesh, N. Nasrabadi, and J. Dawson, “3d convolutional
neural networks for cross audio-visual matching recognition,” IEEE Access, vol. 5, pp. 22 081–22 091,
2017.
[4] N. Alothmany, R. Boston, C. Li, S. Shaiman, and J. Durrant, “Classification
of visemes using visual cues,” in Proceedings ELMAR-2010.
IEEE, 2010.
[5] I. Almajai, S. Cox, R. Harvey, and Y. Lan, “Improved speaker independent lip reading using
speaker adaptive training and deep neural networks,” in 2016 IEEE International Conference on
Acoustics, Speech and Signal Processing (ICASSP).
12
THANK YOU

More Related Content

What's hot

Hand Gesture Recognition system for deaf and dump people
Hand Gesture Recognition system for deaf and dump peopleHand Gesture Recognition system for deaf and dump people
Hand Gesture Recognition system for deaf and dump peopleManu Devappa
 
SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK Kamonasish Hore
 
Face recognition technology
Face recognition technologyFace recognition technology
Face recognition technologyPushkar Dutt
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminarDiptimaya Sarangi
 
Reverse Channel
Reverse ChannelReverse Channel
Reverse ChannelAnantha10
 
MPEG video compression standard
MPEG video compression standardMPEG video compression standard
MPEG video compression standardanuragjagetiya
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniquessonukumar142
 
Ppt presentation
Ppt presentationPpt presentation
Ppt presentationvishal4799
 
Face recognition technology
Face recognition technologyFace recognition technology
Face recognition technologyranjit banshpal
 
Face recognization
Face recognizationFace recognization
Face recognizationleenak770
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceIlhaan Marwat
 
Video compression
Video compressionVideo compression
Video compressionnnmaurya
 
TEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxTEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxNsaroj kumar
 
Speech synthesis technology
Speech synthesis technologySpeech synthesis technology
Speech synthesis technologyKalluri Madhuri
 
Gender and Age Detection using OpenCV.pptx
Gender and Age Detection using OpenCV.pptxGender and Age Detection using OpenCV.pptx
Gender and Age Detection using OpenCV.pptxSakshiVishwakarma12
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 

What's hot (20)

Hand Gesture Recognition system for deaf and dump people
Hand Gesture Recognition system for deaf and dump peopleHand Gesture Recognition system for deaf and dump people
Hand Gesture Recognition system for deaf and dump people
 
SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK
 
Face recognition technology
Face recognition technologyFace recognition technology
Face recognition technology
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Reverse Channel
Reverse ChannelReverse Channel
Reverse Channel
 
MPEG video compression standard
MPEG video compression standardMPEG video compression standard
MPEG video compression standard
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
Audio compression
Audio compressionAudio compression
Audio compression
 
Ppt presentation
Ppt presentationPpt presentation
Ppt presentation
 
Face recognition technology
Face recognition technologyFace recognition technology
Face recognition technology
 
Face recognization
Face recognizationFace recognization
Face recognization
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Video compression
Video compressionVideo compression
Video compression
 
TEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxTEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptx
 
Speech synthesis technology
Speech synthesis technologySpeech synthesis technology
Speech synthesis technology
 
Gender and Age Detection using OpenCV.pptx
Gender and Age Detection using OpenCV.pptxGender and Age Detection using OpenCV.pptx
Gender and Age Detection using OpenCV.pptx
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Mpeg 7 slides
Mpeg 7 slides Mpeg 7 slides
Mpeg 7 slides
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Gesture Recognition
Gesture RecognitionGesture Recognition
Gesture Recognition
 

Similar to Lip Reading.pptx

LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...IRJET Journal
 
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxLiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxVishnuRajuV
 
Deep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker RecognitionDeep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker RecognitionSai Kiran Kadam
 
Emotion recognition using facial expressions and speech
Emotion recognition using facial expressions and speechEmotion recognition using facial expressions and speech
Emotion recognition using facial expressions and speechLakshmi Sarvani Videla
 
"Approaches for Vision-based Driver Monitoring," a Presentation from PathPart...
"Approaches for Vision-based Driver Monitoring," a Presentation from PathPart..."Approaches for Vision-based Driver Monitoring," a Presentation from PathPart...
"Approaches for Vision-based Driver Monitoring," a Presentation from PathPart...Edge AI and Vision Alliance
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologyAamir-sheriff
 
lip reading using deep learning presentation
lip reading using deep learning presentationlip reading using deep learning presentation
lip reading using deep learning presentationgokuldongala
 
A Survey on Speaker Recognition System
A Survey on Speaker Recognition SystemA Survey on Speaker Recognition System
A Survey on Speaker Recognition SystemVani011
 
Character Recognition using Data Mining Technique (Artificial Neural Network)
Character Recognition using Data Mining Technique (Artificial Neural Network)Character Recognition using Data Mining Technique (Artificial Neural Network)
Character Recognition using Data Mining Technique (Artificial Neural Network)Sudipto Krishna Dutta
 
Deep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event DetectionDeep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event DetectionSai Kiran Kadam
 
Biometric Recognition using Deep Learning
Biometric Recognition using Deep LearningBiometric Recognition using Deep Learning
Biometric Recognition using Deep LearningSahithiKotha2
 
Mobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCMobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCNathanBowen8
 
Umesh nfc login application for mobile devices29th nov
Umesh nfc login application for mobile devices29th novUmesh nfc login application for mobile devices29th nov
Umesh nfc login application for mobile devices29th novUmeshjamce
 
Text Independent Speaker recognitom framework for detecting criminals.ppt
Text Independent Speaker recognitom framework for detecting criminals.pptText Independent Speaker recognitom framework for detecting criminals.ppt
Text Independent Speaker recognitom framework for detecting criminals.pptGrace136708
 
Recognizing Facial Expression Through Frequency Neural Network.pptx
Recognizing Facial Expression Through Frequency Neural Network.pptxRecognizing Facial Expression Through Frequency Neural Network.pptx
Recognizing Facial Expression Through Frequency Neural Network.pptxsrajece
 

Similar to Lip Reading.pptx (20)

LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
 
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxLiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
 
Deep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker RecognitionDeep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker Recognition
 
Emotion recognition using facial expressions and speech
Emotion recognition using facial expressions and speechEmotion recognition using facial expressions and speech
Emotion recognition using facial expressions and speech
 
"Approaches for Vision-based Driver Monitoring," a Presentation from PathPart...
"Approaches for Vision-based Driver Monitoring," a Presentation from PathPart..."Approaches for Vision-based Driver Monitoring," a Presentation from PathPart...
"Approaches for Vision-based Driver Monitoring," a Presentation from PathPart...
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
lip reading using deep learning presentation
lip reading using deep learning presentationlip reading using deep learning presentation
lip reading using deep learning presentation
 
A Survey on Speaker Recognition System
A Survey on Speaker Recognition SystemA Survey on Speaker Recognition System
A Survey on Speaker Recognition System
 
Pooja_Patil
Pooja_PatilPooja_Patil
Pooja_Patil
 
Character Recognition using Data Mining Technique (Artificial Neural Network)
Character Recognition using Data Mining Technique (Artificial Neural Network)Character Recognition using Data Mining Technique (Artificial Neural Network)
Character Recognition using Data Mining Technique (Artificial Neural Network)
 
Deep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event DetectionDeep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event Detection
 
Biometric Recognition using Deep Learning
Biometric Recognition using Deep LearningBiometric Recognition using Deep Learning
Biometric Recognition using Deep Learning
 
ACM Dev 2012
ACM Dev 2012ACM Dev 2012
ACM Dev 2012
 
The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022
 
final-day1-july2.pptx
final-day1-july2.pptxfinal-day1-july2.pptx
final-day1-july2.pptx
 
Mobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCMobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSC
 
Umesh nfc login application for mobile devices29th nov
Umesh nfc login application for mobile devices29th novUmesh nfc login application for mobile devices29th nov
Umesh nfc login application for mobile devices29th nov
 
Text Independent Speaker recognitom framework for detecting criminals.ppt
Text Independent Speaker recognitom framework for detecting criminals.pptText Independent Speaker recognitom framework for detecting criminals.ppt
Text Independent Speaker recognitom framework for detecting criminals.ppt
 
Recognizing Facial Expression Through Frequency Neural Network.pptx
Recognizing Facial Expression Through Frequency Neural Network.pptxRecognizing Facial Expression Through Frequency Neural Network.pptx
Recognizing Facial Expression Through Frequency Neural Network.pptx
 
Cv huaiping
Cv huaipingCv huaiping
Cv huaiping
 

Recently uploaded

Construction method of steel structure space frame .pptx
Construction method of steel structure space frame .pptxConstruction method of steel structure space frame .pptx
Construction method of steel structure space frame .pptxwendy cai
 
A case study of cinema management system project report..pdf
A case study of cinema management system project report..pdfA case study of cinema management system project report..pdf
A case study of cinema management system project report..pdfKamal Acharya
 
2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edgePaco Orozco
 
Democratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek AryaDemocratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek Aryaabh.arya
 
Arduino based vehicle speed tracker project
Arduino based vehicle speed tracker projectArduino based vehicle speed tracker project
Arduino based vehicle speed tracker projectRased Khan
 
İTÜ CAD and Reverse Engineering Workshop
İTÜ CAD and Reverse Engineering WorkshopİTÜ CAD and Reverse Engineering Workshop
İTÜ CAD and Reverse Engineering WorkshopEmre Günaydın
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdfKamal Acharya
 
RS Khurmi Machine Design Clutch and Brake Exercise Numerical Solutions
RS Khurmi Machine Design Clutch and Brake Exercise Numerical SolutionsRS Khurmi Machine Design Clutch and Brake Exercise Numerical Solutions
RS Khurmi Machine Design Clutch and Brake Exercise Numerical SolutionsAtif Razi
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234AafreenAbuthahir2
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Electivekarthi keyan
 
Online resume builder management system project report.pdf
Online resume builder management system project report.pdfOnline resume builder management system project report.pdf
Online resume builder management system project report.pdfKamal Acharya
 
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxCloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxMd. Shahidul Islam Prodhan
 
Explosives Industry manufacturing process.pdf
Explosives Industry manufacturing process.pdfExplosives Industry manufacturing process.pdf
Explosives Industry manufacturing process.pdf884710SadaqatAli
 
Courier management system project report.pdf
Courier management system project report.pdfCourier management system project report.pdf
Courier management system project report.pdfKamal Acharya
 
Introduction to Casting Processes in Manufacturing
Introduction to Casting Processes in ManufacturingIntroduction to Casting Processes in Manufacturing
Introduction to Casting Processes in Manufacturingssuser0811ec
 
fluid mechanics gate notes . gate all pyqs answer
fluid mechanics gate notes . gate all pyqs answerfluid mechanics gate notes . gate all pyqs answer
fluid mechanics gate notes . gate all pyqs answerapareshmondalnita
 
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical EngineeringIntroduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical EngineeringC Sai Kiran
 
Danfoss NeoCharge Technology -A Revolution in 2024.pdf
Danfoss NeoCharge Technology -A Revolution in 2024.pdfDanfoss NeoCharge Technology -A Revolution in 2024.pdf
Danfoss NeoCharge Technology -A Revolution in 2024.pdfNurvisNavarroSanchez
 

Recently uploaded (20)

Construction method of steel structure space frame .pptx
Construction method of steel structure space frame .pptxConstruction method of steel structure space frame .pptx
Construction method of steel structure space frame .pptx
 
A case study of cinema management system project report..pdf
A case study of cinema management system project report..pdfA case study of cinema management system project report..pdf
A case study of cinema management system project report..pdf
 
2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge
 
Democratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek AryaDemocratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek Arya
 
Arduino based vehicle speed tracker project
Arduino based vehicle speed tracker projectArduino based vehicle speed tracker project
Arduino based vehicle speed tracker project
 
İTÜ CAD and Reverse Engineering Workshop
İTÜ CAD and Reverse Engineering WorkshopİTÜ CAD and Reverse Engineering Workshop
İTÜ CAD and Reverse Engineering Workshop
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdf
 
RS Khurmi Machine Design Clutch and Brake Exercise Numerical Solutions
RS Khurmi Machine Design Clutch and Brake Exercise Numerical SolutionsRS Khurmi Machine Design Clutch and Brake Exercise Numerical Solutions
RS Khurmi Machine Design Clutch and Brake Exercise Numerical Solutions
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
Online resume builder management system project report.pdf
Online resume builder management system project report.pdfOnline resume builder management system project report.pdf
Online resume builder management system project report.pdf
 
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxCloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
 
Explosives Industry manufacturing process.pdf
Explosives Industry manufacturing process.pdfExplosives Industry manufacturing process.pdf
Explosives Industry manufacturing process.pdf
 
Courier management system project report.pdf
Courier management system project report.pdfCourier management system project report.pdf
Courier management system project report.pdf
 
Introduction to Casting Processes in Manufacturing
Introduction to Casting Processes in ManufacturingIntroduction to Casting Processes in Manufacturing
Introduction to Casting Processes in Manufacturing
 
fluid mechanics gate notes . gate all pyqs answer
fluid mechanics gate notes . gate all pyqs answerfluid mechanics gate notes . gate all pyqs answer
fluid mechanics gate notes . gate all pyqs answer
 
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical EngineeringIntroduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
 
Danfoss NeoCharge Technology -A Revolution in 2024.pdf
Danfoss NeoCharge Technology -A Revolution in 2024.pdfDanfoss NeoCharge Technology -A Revolution in 2024.pdf
Danfoss NeoCharge Technology -A Revolution in 2024.pdf
 

Lip Reading.pptx

  • 1. Zeroth Review - 2021 M.Vignesh 221003105 IV-CSE A 221003105@sastra.ac.in S.Mahadevan 221003057 IV-CSE A 221003057@sastra.ac.in Guided By Ms. Bhavani R APII/CSE/SRC/SASTRA
  • 2. PROJECT OVERVIEW • Visual Speech Recognition • Extract lip features • Neural Network model to train the lip sequence • Transcribing and evaluate Lip movements into text 3
  • 3. INTRODUCTION 4 • There are many Recognition System that recognize words from audio features. • Lip reading System is one of the developing technology • Aims to recognize words only by visual feature without audio • Classify and recognize words by visemes movements
  • 4. PROBLEM STATEMENT 5 • Noisy Environment • Speech Speed • Accent • Pronunciation • Facial Features
  • 5. OBJECTIVE 6 • Extract textual or speech data from facial features • Train a Neural network system to process visemes sequence • Develop a Speaker Independed system • Recognize and classify ten different words
  • 6. S/W –H/W REQUIREMENTS 7 SOFTWARE REQUIREMENTS: • Anaconda • System:64 bit OS, x64 processor HARDWARE REQUIREMENTS: • 4 GB RAM • Better GPU(For performance)
  • 7. EXISTING VS PROPOSED SYSTEM 8 Existing System: • Uses BBCLRS2 Dataset • Recognize and classify only ASCII characters and decode words • Complex processing and gives better accuracy after 2000 epochs Proposed System: • Uses MIRACL-VC1 Dataset • Recognize and classify ten different words • Simple and gives better accuracy after 200 epochs
  • 8. 9 S.No Paper Title, author Journal details with Date of publication Methodology applied Merits and demerits 1 Lip-Reading Driven Deep Learning Approach for Speech Enhancement, Ahsan Adeel, Mandar Gogate, Amir Hussain, and William M. Whitmer 2019- IEEE Transaction This paper uses LSTM driven Audio Visual mapping approach. • Increased accuracy • Autonomous Speech enhancement • Poor performance for Realtime speech 2 An audio-visual corpus for multimodal automatic speech recognition, Andrzej Czyzewski, Bozena Kostek, Piotr Bratoszewski,Jozef Kotus, Marcin Szykulski 2017-Springer This paper uses Active Appearance Model(AAM) and Hidden Markov Models(HMM) • Recognize in street noise • Babble noise dramatically worsens the accuracy of speech recognition 3 Extraction of Visual Features for Lipreading, Iain Matthews, Timothy F. Cootes, J. Andrew Bangham, Stephen Cox, Richard Harvey 2017-IEEE Transaction This paper uses Active shape model (ASM) and point distribution model (PDM) • Accuracy improved when a noisy audio signal is augmented with visual information • Poor performance in Babble noise 4 Audio-visual speech recognition using deep learning, Kuniaki Noda, Yuki Yamaguchi, Kazuhiro Nakadai, Hiroshi G. Okuno,Tetsuya Ogata 2014-Springer This paper uses Hidden Markov Model(HMM) • Increased Performance • Reverberation, illumination, and facial orientation, occur 5 Speaker-Independent Speech Recognition using Visual Features, Pooventhiran G., Sandeep A. 2020-IEEE This paper uses 3D-CNN model • Improved Accuracy • Complex LITERATURE SURVEY
  • 10. REFERENCES 11 [1] A. Thanda and S. M. Venkatesan, “Audio visual speech recognition using deep recurrent neural networks,” in IAPR workshop on multimodal pattern recognition of social signals in human- computer interaction. Springer, 2016 [2] E. Petajan, B. Bischoff, D. Bodoff, and N. M. Brooke, “An improved automatic lipreading system to enhance speech recognition,” in Proceedings of the SIGCHI conference on Human factors in computing systems, 1988 [3] A. Torfi, S. M. Iranmanesh, N. Nasrabadi, and J. Dawson, “3d convolutional neural networks for cross audio-visual matching recognition,” IEEE Access, vol. 5, pp. 22 081–22 091, 2017. [4] N. Alothmany, R. Boston, C. Li, S. Shaiman, and J. Durrant, “Classification of visemes using visual cues,” in Proceedings ELMAR-2010. IEEE, 2010. [5] I. Almajai, S. Cox, R. Harvey, and Y. Lan, “Improved speaker independent lip reading using speaker adaptive training and deep neural networks,” in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).