SlideShare a Scribd company logo
1 of 25
๏ƒ˜ Lip reading, known as speechreading, is a
technique of understanding speech by visually
interpreting to analysis of the moving lips when
normal sound is not available, where speech is
validated by both the shape and movement of the
lips. This thesis investigates various issues faced by
an visual lips reading system and proposes a novel
โ€œvisual wordsโ€ based approach to visual lip
reading.
๏ƒ˜ Lip reading is used to understand or interpret speech without hearing it, a
technique especially mastered by people with hearing difficulties. The
ability to lip read enables a person with a hearing impairment to
communicate with others. Recent advances in the fields of computer
vision, pattern recognition, and imageprocessing has led to a growing
interest in this challenging task of lip reading.
๏ƒ˜ Is it possible to build an a system for lip reader comparable to or even better
than a human lip reader.
๏ƒ˜ The human mouth is one of the most deformable parts of the human body,
leading to different appearances such as mouth opened, closed, widely
opened, so there are problems related to extracting the shape and edge of
the lips with accuracy, which the proposed system depends on these
features for tracking lips movement using different technique.
๏ƒ˜ Which best method can be Chosen for lips feature extraction.
๏ƒ˜ When designing the system for lip reading, we will need to address
limitations Revolve around; there are no perspicuous rules used to
determine spoken Arabic words, no steady dictionary that can translate the
sequential of the video frames to a corresponding word, as well as, there is
no visual speech Arabic dataset.
๏ƒ˜ Recognizing speech is a very basic task for human beings, highlighting a
significant gap between the possibilities offered by current technology and
user requirements. A fundamental motivation is to contribute to bridge this
gap, allowing future users to use speech technologies without current
limitations and constraints
๏ƒ˜ Proposing a system that study should have the ability to accurately extract
the visual features of movement lips that the system later relies on it for lip
tracking and speech recognition.
๏ƒ˜ Recording training data has been an integral part. To train a Arabic visual
speech recognizer large quantities of speaker video data are required
๏ƒ˜ Vwords can be applied efficiently for speaker identification through the
personโ€™s utterance, depending on his/her different (unique to some extent)
way of speech.
๏ƒ˜ The study presented by this work contributes to the lips reading research;
proposed Arabic visual word recognition methods, which add techniques for
localizing lips, extracting visual features, and tracking for recognizing lips
motion.
๏ƒ˜ For both types of lip traits (physiological and behavioral) a comprehensive
study performed to discover the underlying mechanism of the
discriminatory power of lip biometrics, and Presenting a detailed analysis on
the role of the various physiological and behavioral features of the lips in
analyzing the way the speaker pronounces the Arabic word and the degree
of its convergence between the speakers.
๏ƒ˜ Proposed polynomial motion feature for lip reading.
๏ƒ˜ The new-recorded Arabic database for lip-reading purposes can be used in
other biometrics and image processing research studies.
๏ƒ˜ The central contribution of this study to the research community
(particularly the VSR community) is the development of a accurate and
efficiency Arabic VSR system using the proposed Vwords approach,
5
conclusion
1
Polynomial
Tracking
2
Geometrical
Feature
3 VGG16_vsr
4
Deep_vsr
results
๏ƒ˜ System structure for this work. In the first phase: the pre-processing
operation takes place, in which the face area is localized, as all the
information in tracking the visual speech is located in the mouth region.
Later, the region of the โ€‹โ€‹interest (ROI), represented by the mouth extracted.
๏ƒ˜ The second phase is the process of extracting those features related to
visual speech from the region of the โ€‹โ€‹interest. In the proposed model, the
features extracted from the movement and tracking of the lips contour are
classified as physiological features that depend on the shape of the lips to
recognize the spoken word.
๏ƒ˜ The lips motion for utterance word is then kept
as a coefficient for the polynomial function,
which interprets the movement of the lips by
using the polynomial equation and represents
this movement through Drawing curves.
๏ƒ˜ The curve can be applied to any lip model
because it is an adaptive curve without being
restricted by the size of the lip model;
๏ƒ˜ polynomial Equation:
2
1
3
4
5
Y= A + Bx + Cx2 + Dx3 + Ex4 +โ€ฆ..
2
1
3
4
5
๏ƒ˜ Synthesis lips motion based on geometric
features doing by using facial points that
correspond to the lip region which represent
thought points from 49 to 68.
๏ƒ˜ Lip features geometry can be extracted by
measuring the distance between the upper and
lower lip, corners.
๏ƒ˜ Calculated MAR_out for extracting geometrical
lips shape using formula:
2
1
3
4
5
2
1
3
4
5 ๏ƒ˜ Deep learning techniques provide perfect
solutions to the problems of automatically
extracting features, the proposed model was
mainly based on the VGG16 network.
๏ƒ˜ The model consists of two main parts: in the first
part, it is used to extract visual features, which is
the information that represents the spoken
word, and the second part is the classification
which is based on those features extracted for
the purpose of recognizing the spoken word.
2
1
3
4
5
2
1
3
4
5
2
1
3
4
5
2
1
3
4
5
2
1
3
4
5
2
1
3
4
5 ๏ƒ˜ Using pre-trained VGG16_vsr in lip reading at
the level of a word in the Arabic language to
increase accuracy in predicting the word. The
proposed method based on image processing
and transfer learning, in addition to fine-tuning
and data augmentations technology, provided a
high efficiency and accuracy of system
performance.
๏ƒ˜ The proposed โ€œvisual wordsโ€ (Vwords) scheme uses lips geometric
measurements tackle the VSR problem, where the system recognizes the whole
word In this approach, a word is represented by a signature that consists of
several feature vectors. Each signal is constructed by temporal measurements
of its associated feature. for instance, the mouth height feature measured over
the time period of a spoken word.
๏ƒ˜ Using MAR_out and inner with three key point on the lips counter leads to
increase the lips tracking accuracy for recognized visual words.
๏ƒ˜ The slopes occur along the line of movement of the lips of the frames at
different times; the variation in the slopes may vary depending on the peak in
the speech movement at that time, so when implementing our VSR model
based on a polynomial mathematical equation, we got to conclude that high
order is used in the polynomial function to simulate the curve of the
movements of the control points, so that the high order of the polynomial
function leads to the possibility of analyzing the visual speech and storing the
visual information in the function parameters, which are later represented as a
curve, and that the obtained curve is less curvy and wobbly.
โ€œthank for all of youโ€

More Related Content

Similar to lips _reading _in computer_ vision_n.ppt

A DEEP LEARNING BASED EVALUATION OF ARTICULATION DISORDER AND LEARNING ASSIST...
A DEEP LEARNING BASED EVALUATION OF ARTICULATION DISORDER AND LEARNING ASSIST...A DEEP LEARNING BASED EVALUATION OF ARTICULATION DISORDER AND LEARNING ASSIST...
A DEEP LEARNING BASED EVALUATION OF ARTICULATION DISORDER AND LEARNING ASSIST...
ijnlc
ย 
Emotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifierEmotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifier
eSAT Journals
ย 
PurposeSpeech recognition software has existed for decades; diff.docx
PurposeSpeech recognition software has existed for decades; diff.docxPurposeSpeech recognition software has existed for decades; diff.docx
PurposeSpeech recognition software has existed for decades; diff.docx
makdul
ย 
Developing mobile application of interactive english pronunciation training t...
Developing mobile application of interactive english pronunciation training t...Developing mobile application of interactive english pronunciation training t...
Developing mobile application of interactive english pronunciation training t...
Alexander Decker
ย 

Similar to lips _reading _in computer_ vision_n.ppt (20)

A DEEP LEARNING BASED EVALUATION OF ARTICULATION DISORDER AND LEARNING ASSIST...
A DEEP LEARNING BASED EVALUATION OF ARTICULATION DISORDER AND LEARNING ASSIST...A DEEP LEARNING BASED EVALUATION OF ARTICULATION DISORDER AND LEARNING ASSIST...
A DEEP LEARNING BASED EVALUATION OF ARTICULATION DISORDER AND LEARNING ASSIST...
ย 
Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Sp...
Advances in Automatic Speech Recognition: From Audio-Only  To Audio-Visual Sp...Advances in Automatic Speech Recognition: From Audio-Only  To Audio-Visual Sp...
Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Sp...
ย 
ELSA's Speech Recognition Overview
ELSA's Speech Recognition OverviewELSA's Speech Recognition Overview
ELSA's Speech Recognition Overview
ย 
IRJET - Gesture based Communication Recognition System
IRJET -  	  Gesture based Communication Recognition SystemIRJET -  	  Gesture based Communication Recognition System
IRJET - Gesture based Communication Recognition System
ย 
English speaking proficiency assessment using speech and electroencephalograp...
English speaking proficiency assessment using speech and electroencephalograp...English speaking proficiency assessment using speech and electroencephalograp...
English speaking proficiency assessment using speech and electroencephalograp...
ย 
Emotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifierEmotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifier
ย 
Emotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifierEmotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifier
ย 
CMPE258 Short story.pptx
CMPE258 Short story.pptxCMPE258 Short story.pptx
CMPE258 Short story.pptx
ย 
Functnal analysis
Functnal analysisFunctnal analysis
Functnal analysis
ย 
LIP READING: VISUAL SPEECH RECOGNITION USING LIP READING
LIP READING: VISUAL SPEECH RECOGNITION USING LIP READINGLIP READING: VISUAL SPEECH RECOGNITION USING LIP READING
LIP READING: VISUAL SPEECH RECOGNITION USING LIP READING
ย 
Hand Gesture Recognition and Translation Application
Hand Gesture Recognition and Translation ApplicationHand Gesture Recognition and Translation Application
Hand Gesture Recognition and Translation Application
ย 
PurposeSpeech recognition software has existed for decades; diff.docx
PurposeSpeech recognition software has existed for decades; diff.docxPurposeSpeech recognition software has existed for decades; diff.docx
PurposeSpeech recognition software has existed for decades; diff.docx
ย 
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
ย 
Video Audio Interface for recognizing gestures of Indian sign Language
Video Audio Interface for recognizing gestures of Indian sign LanguageVideo Audio Interface for recognizing gestures of Indian sign Language
Video Audio Interface for recognizing gestures of Indian sign Language
ย 
Developing mobile application of interactive english pronunciation training t...
Developing mobile application of interactive english pronunciation training t...Developing mobile application of interactive english pronunciation training t...
Developing mobile application of interactive english pronunciation training t...
ย 
USING OBJECTIVE WORDS IN THE REVIEWS TO IMPROVE THE COLLOQUIAL ARABIC SENTIME...
USING OBJECTIVE WORDS IN THE REVIEWS TO IMPROVE THE COLLOQUIAL ARABIC SENTIME...USING OBJECTIVE WORDS IN THE REVIEWS TO IMPROVE THE COLLOQUIAL ARABIC SENTIME...
USING OBJECTIVE WORDS IN THE REVIEWS TO IMPROVE THE COLLOQUIAL ARABIC SENTIME...
ย 
MULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURES
MULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURESMULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURES
MULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURES
ย 
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language ModelsIRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
ย 
Survey on Facial Expression Analysis and Recognition
Survey on Facial Expression Analysis and RecognitionSurvey on Facial Expression Analysis and Recognition
Survey on Facial Expression Analysis and Recognition
ย 
10
1010
10
ย 

More from naghamallella

logic gate presentation for and or n.ppt
logic gate presentation for and or n.pptlogic gate presentation for and or n.ppt
logic gate presentation for and or n.ppt
naghamallella
ย 
6_2019_04_09!08_59_48_PM logic gate_.ppt
6_2019_04_09!08_59_48_PM logic gate_.ppt6_2019_04_09!08_59_48_PM logic gate_.ppt
6_2019_04_09!08_59_48_PM logic gate_.ppt
naghamallella
ย 
bin packing 2 for real time scheduli.ppt
bin packing 2 for real time scheduli.pptbin packing 2 for real time scheduli.ppt
bin packing 2 for real time scheduli.ppt
naghamallella
ย 
bin packing2 and scheduling for mul.pptx
bin packing2 and scheduling for mul.pptxbin packing2 and scheduling for mul.pptx
bin packing2 and scheduling for mul.pptx
naghamallella
ย 
BOOTP computer science for multiproc.ppt
BOOTP computer science for multiproc.pptBOOTP computer science for multiproc.ppt
BOOTP computer science for multiproc.ppt
naghamallella
ย 
bin packing and scheduling multiproc.ppt
bin packing and scheduling multiproc.pptbin packing and scheduling multiproc.ppt
bin packing and scheduling multiproc.ppt
naghamallella
ย 
multiprocessor _system _presentation.ppt
multiprocessor _system _presentation.pptmultiprocessor _system _presentation.ppt
multiprocessor _system _presentation.ppt
naghamallella
ย 
web _security_ for _confedindality s.ppt
web _security_ for _confedindality s.pptweb _security_ for _confedindality s.ppt
web _security_ for _confedindality s.ppt
naghamallella
ย 
thread_ multiprocessor_ scheduling_a.ppt
thread_ multiprocessor_ scheduling_a.pptthread_ multiprocessor_ scheduling_a.ppt
thread_ multiprocessor_ scheduling_a.ppt
naghamallella
ย 
distributed real time system schedul.ppt
distributed real time system schedul.pptdistributed real time system schedul.ppt
distributed real time system schedul.ppt
naghamallella
ย 

More from naghamallella (20)

OS-20210426203801 introduction to os.ppt
OS-20210426203801 introduction to os.pptOS-20210426203801 introduction to os.ppt
OS-20210426203801 introduction to os.ppt
ย 
basic logic gate presentation date23.ppt
basic logic gate presentation date23.pptbasic logic gate presentation date23.ppt
basic logic gate presentation date23.ppt
ย 
logic gate presentation for and or n.ppt
logic gate presentation for and or n.pptlogic gate presentation for and or n.ppt
logic gate presentation for and or n.ppt
ย 
6_2019_04_09!08_59_48_PM logic gate_.ppt
6_2019_04_09!08_59_48_PM logic gate_.ppt6_2019_04_09!08_59_48_PM logic gate_.ppt
6_2019_04_09!08_59_48_PM logic gate_.ppt
ย 
bin packing 2 for real time scheduli.ppt
bin packing 2 for real time scheduli.pptbin packing 2 for real time scheduli.ppt
bin packing 2 for real time scheduli.ppt
ย 
bin packing2 and scheduling for mul.pptx
bin packing2 and scheduling for mul.pptxbin packing2 and scheduling for mul.pptx
bin packing2 and scheduling for mul.pptx
ย 
BOOTP computer science for multiproc.ppt
BOOTP computer science for multiproc.pptBOOTP computer science for multiproc.ppt
BOOTP computer science for multiproc.ppt
ย 
trusted computing platform alliancee.ppt
trusted computing platform alliancee.ppttrusted computing platform alliancee.ppt
trusted computing platform alliancee.ppt
ย 
trusted computing for security confe.ppt
trusted computing for security confe.ppttrusted computing for security confe.ppt
trusted computing for security confe.ppt
ย 
bin packing and scheduling multiproc.ppt
bin packing and scheduling multiproc.pptbin packing and scheduling multiproc.ppt
bin packing and scheduling multiproc.ppt
ย 
multiprocessor _system _presentation.ppt
multiprocessor _system _presentation.pptmultiprocessor _system _presentation.ppt
multiprocessor _system _presentation.ppt
ย 
image processing for jpeg presentati.ppt
image processing for jpeg presentati.pptimage processing for jpeg presentati.ppt
image processing for jpeg presentati.ppt
ย 
introduction to jpeg for image proce.ppt
introduction to jpeg for image proce.pptintroduction to jpeg for image proce.ppt
introduction to jpeg for image proce.ppt
ย 
jpg image processing nagham salim_as.ppt
jpg image processing nagham salim_as.pptjpg image processing nagham salim_as.ppt
jpg image processing nagham salim_as.ppt
ย 
electronic mail security for authent.ppt
electronic mail security for authent.pptelectronic mail security for authent.ppt
electronic mail security for authent.ppt
ย 
web _security_ for _confedindality s.ppt
web _security_ for _confedindality s.pptweb _security_ for _confedindality s.ppt
web _security_ for _confedindality s.ppt
ย 
thread_ multiprocessor_ scheduling_a.ppt
thread_ multiprocessor_ scheduling_a.pptthread_ multiprocessor_ scheduling_a.ppt
thread_ multiprocessor_ scheduling_a.ppt
ย 
distributed real time system schedul.ppt
distributed real time system schedul.pptdistributed real time system schedul.ppt
distributed real time system schedul.ppt
ย 
Trusted Computing security _platform.ppt
Trusted Computing security _platform.pptTrusted Computing security _platform.ppt
Trusted Computing security _platform.ppt
ย 
Trusted Computing _plate form_ model.ppt
Trusted Computing _plate form_ model.pptTrusted Computing _plate form_ model.ppt
Trusted Computing _plate form_ model.ppt
ย 

Recently uploaded

VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
SUHANI PANDEY
ย 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
MsecMca
ย 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
ย 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
ย 
Top Rated Call Girls In chittoor ๐Ÿ“ฑ {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor ๐Ÿ“ฑ {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor ๐Ÿ“ฑ {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor ๐Ÿ“ฑ {7001035870} VIP Escorts chittoor
dharasingh5698
ย 

Recently uploaded (20)

Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
ย 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
ย 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
ย 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
ย 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
ย 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
ย 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
ย 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
ย 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
ย 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
ย 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
ย 
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
ย 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ย 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
ย 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
ย 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
ย 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
ย 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
ย 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
ย 
Top Rated Call Girls In chittoor ๐Ÿ“ฑ {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor ๐Ÿ“ฑ {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor ๐Ÿ“ฑ {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor ๐Ÿ“ฑ {7001035870} VIP Escorts chittoor
ย 

lips _reading _in computer_ vision_n.ppt

  • 1.
  • 2.
  • 3. ๏ƒ˜ Lip reading, known as speechreading, is a technique of understanding speech by visually interpreting to analysis of the moving lips when normal sound is not available, where speech is validated by both the shape and movement of the lips. This thesis investigates various issues faced by an visual lips reading system and proposes a novel โ€œvisual wordsโ€ based approach to visual lip reading. ๏ƒ˜ Lip reading is used to understand or interpret speech without hearing it, a technique especially mastered by people with hearing difficulties. The ability to lip read enables a person with a hearing impairment to communicate with others. Recent advances in the fields of computer vision, pattern recognition, and imageprocessing has led to a growing interest in this challenging task of lip reading.
  • 4. ๏ƒ˜ Is it possible to build an a system for lip reader comparable to or even better than a human lip reader. ๏ƒ˜ The human mouth is one of the most deformable parts of the human body, leading to different appearances such as mouth opened, closed, widely opened, so there are problems related to extracting the shape and edge of the lips with accuracy, which the proposed system depends on these features for tracking lips movement using different technique. ๏ƒ˜ Which best method can be Chosen for lips feature extraction. ๏ƒ˜ When designing the system for lip reading, we will need to address limitations Revolve around; there are no perspicuous rules used to determine spoken Arabic words, no steady dictionary that can translate the sequential of the video frames to a corresponding word, as well as, there is no visual speech Arabic dataset.
  • 5. ๏ƒ˜ Recognizing speech is a very basic task for human beings, highlighting a significant gap between the possibilities offered by current technology and user requirements. A fundamental motivation is to contribute to bridge this gap, allowing future users to use speech technologies without current limitations and constraints ๏ƒ˜ Proposing a system that study should have the ability to accurately extract the visual features of movement lips that the system later relies on it for lip tracking and speech recognition. ๏ƒ˜ Recording training data has been an integral part. To train a Arabic visual speech recognizer large quantities of speaker video data are required ๏ƒ˜ Vwords can be applied efficiently for speaker identification through the personโ€™s utterance, depending on his/her different (unique to some extent) way of speech.
  • 6. ๏ƒ˜ The study presented by this work contributes to the lips reading research; proposed Arabic visual word recognition methods, which add techniques for localizing lips, extracting visual features, and tracking for recognizing lips motion. ๏ƒ˜ For both types of lip traits (physiological and behavioral) a comprehensive study performed to discover the underlying mechanism of the discriminatory power of lip biometrics, and Presenting a detailed analysis on the role of the various physiological and behavioral features of the lips in analyzing the way the speaker pronounces the Arabic word and the degree of its convergence between the speakers. ๏ƒ˜ Proposed polynomial motion feature for lip reading. ๏ƒ˜ The new-recorded Arabic database for lip-reading purposes can be used in other biometrics and image processing research studies. ๏ƒ˜ The central contribution of this study to the research community (particularly the VSR community) is the development of a accurate and efficiency Arabic VSR system using the proposed Vwords approach,
  • 8. ๏ƒ˜ System structure for this work. In the first phase: the pre-processing operation takes place, in which the face area is localized, as all the information in tracking the visual speech is located in the mouth region. Later, the region of the โ€‹โ€‹interest (ROI), represented by the mouth extracted. ๏ƒ˜ The second phase is the process of extracting those features related to visual speech from the region of the โ€‹โ€‹interest. In the proposed model, the features extracted from the movement and tracking of the lips contour are classified as physiological features that depend on the shape of the lips to recognize the spoken word.
  • 9.
  • 10. ๏ƒ˜ The lips motion for utterance word is then kept as a coefficient for the polynomial function, which interprets the movement of the lips by using the polynomial equation and represents this movement through Drawing curves. ๏ƒ˜ The curve can be applied to any lip model because it is an adaptive curve without being restricted by the size of the lip model; ๏ƒ˜ polynomial Equation: 2 1 3 4 5 Y= A + Bx + Cx2 + Dx3 + Ex4 +โ€ฆ..
  • 11. 2 1 3 4 5 ๏ƒ˜ Synthesis lips motion based on geometric features doing by using facial points that correspond to the lip region which represent thought points from 49 to 68. ๏ƒ˜ Lip features geometry can be extracted by measuring the distance between the upper and lower lip, corners. ๏ƒ˜ Calculated MAR_out for extracting geometrical lips shape using formula:
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18. 2 1 3 4 5 2 1 3 4 5 ๏ƒ˜ Deep learning techniques provide perfect solutions to the problems of automatically extracting features, the proposed model was mainly based on the VGG16 network. ๏ƒ˜ The model consists of two main parts: in the first part, it is used to extract visual features, which is the information that represents the spoken word, and the second part is the classification which is based on those features extracted for the purpose of recognizing the spoken word.
  • 19.
  • 22.
  • 23. 2 1 3 4 5 2 1 3 4 5 ๏ƒ˜ Using pre-trained VGG16_vsr in lip reading at the level of a word in the Arabic language to increase accuracy in predicting the word. The proposed method based on image processing and transfer learning, in addition to fine-tuning and data augmentations technology, provided a high efficiency and accuracy of system performance.
  • 24. ๏ƒ˜ The proposed โ€œvisual wordsโ€ (Vwords) scheme uses lips geometric measurements tackle the VSR problem, where the system recognizes the whole word In this approach, a word is represented by a signature that consists of several feature vectors. Each signal is constructed by temporal measurements of its associated feature. for instance, the mouth height feature measured over the time period of a spoken word. ๏ƒ˜ Using MAR_out and inner with three key point on the lips counter leads to increase the lips tracking accuracy for recognized visual words. ๏ƒ˜ The slopes occur along the line of movement of the lips of the frames at different times; the variation in the slopes may vary depending on the peak in the speech movement at that time, so when implementing our VSR model based on a polynomial mathematical equation, we got to conclude that high order is used in the polynomial function to simulate the curve of the movements of the control points, so that the high order of the polynomial function leads to the possibility of analyzing the visual speech and storing the visual information in the function parameters, which are later represented as a curve, and that the obtained curve is less curvy and wobbly.
  • 25. โ€œthank for all of youโ€

Editor's Notes

  1. ุชูุณุชุฎุฏู… ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡ ู„ูู‡ู… ุงู„ูƒู„ุงู… ุฃูˆ ุชูุณูŠุฑู‡ ุฏูˆู† ุณู…ุงุนู‡ ุŒ ูˆู‡ูŠ ุชู‚ู†ูŠุฉ ูŠุชู‚ู†ู‡ุง ุจุดูƒู„ ุฎุงุต ุงู„ุฃุดุฎุงุต ุงู„ุฐูŠู† ูŠุนุงู†ูˆู† ู…ู† ุตุนูˆุจุงุช ููŠ ุงู„ุณู…ุน. ุชู…ูƒู† ุงู„ู‚ุฏุฑุฉ ุนู„ู‰ ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡ ุงู„ุดุฎุต ุงู„ู…ุตุงุจ ุจุถุนู ุงู„ุณู…ุน ู…ู† ุงู„ุชูˆุงุตู„ ู…ุน ุงู„ุขุฎุฑูŠู†. ุฃุฏุช ุงู„ุชุทูˆุฑุงุช ุงู„ุญุฏูŠุซุฉ ููŠ ู…ุฌุงู„ุงุช ุฑุคูŠุฉ ุงู„ูƒู…ุจูŠูˆุชุฑ ูˆุงู„ุชุนุฑู ุนู„ู‰ ุงู„ุฃู†ู…ุงุท ูˆู…ุนุงู„ุฌุฉ ุงู„ุฅุดุงุฑุงุช ุฅู„ู‰ ุฒูŠุงุฏุฉ ุงู„ุงู‡ุชู…ุงู… ุจู‡ุฐู‡ ุงู„ู…ู‡ู…ุฉ ุงู„ุตุนุจุฉ ุงู„ู…ุชู…ุซู„ุฉ ููŠ ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡. ูŠุดุงุฑ ุฅู„ู‰ ุนู…ู„ูŠุชู‡ุง ุจุงุณู… ุงู„ุชุนุฑู ุงู„ู…ุฑุฆูŠ ุนู„ู‰ ุงู„ูƒู„ุงู…. ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡ ุŒ ูˆุงู„ู…ุนุฑูˆูุฉ ุจุงุณู… ู‚ุฑุงุกุฉ ุงู„ูƒู„ุงู… ุŒ ู‡ูŠ ุชู‚ู†ูŠุฉ ู„ูู‡ู… ุงู„ูƒู„ุงู… ู…ู† ุฎู„ุงู„ ุงู„ุชุฑุฌู…ุฉ ุงู„ู…ุฑุฆูŠุฉ ู„ุชุญู„ูŠู„ ุงู„ุดูุงู‡ ุงู„ู…ุชุญุฑูƒุฉ ุนู†ุฏู…ุง ู„ุง ูŠุชูˆูุฑ ุงู„ุตูˆุช ุงู„ุทุจูŠุนูŠ ุŒ ุญูŠุซ ูŠุชู… ุงู„ุชุญู‚ู‚ ู…ู† ุตุญุฉ ุงู„ูƒู„ุงู… ู…ู† ุฎู„ุงู„ ุดูƒู„ ุงู„ุดูุงู‡ ูˆุญุฑูƒุชู‡ุง. ุงู„ุจุญุซ ุงู„ุฐูŠ ู‚ุฏู… ููŠ ู‡ุฐุง ุงู„ู…ุฌุงู„ ูŠุชู†ุงูˆู„ ู…ุฎุชู„ู ุงู„ู‚ุถุงูŠุง ุงู„ุชูŠ ูŠูˆุงุฌู‡ู‡ุง ู†ุธุงู… ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡ ุงู„ู…ุฑุฆูŠุฉ ูˆุชู‚ุชุฑุญ ู†ู‡ุฌู‹ุง ุฌุฏูŠุฏู‹ุง ูŠุนุชู…ุฏ ุนู„ู‰ "ุงู„ูƒู„ู…ุงุช ุงู„ู…ุฑุฆูŠุฉ" ู„ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡ ุงู„ู…ุฑุฆูŠุฉ. ูŠู‡ุฏู ู‡ุฐุง ุงู„ุจุญุซ ุฅู„ู‰ ุชุญุฏูŠุฏ ูƒูŠููŠุฉ ุชุฎู…ูŠู† ุงู„ูƒู„ู…ุงุช ุงู„ุนุฑุจูŠุฉ ุงู„ูุฑุฏูŠุฉ ุจู†ุงุกู‹ ุนู„ู‰ ุงู„ุฅุดุงุฑุงุช ุงู„ู…ุฑุฆูŠุฉ ู…ุซู„ ุญุฑูƒุฉ ุงู„ุดูุงู‡ ูˆุงุณุชุฎุฑุงุฌ ุงู„ุดูƒู„. ุชู… ุชุทูˆูŠุฑ ุนู…ู„ูŠุฉ ู…ู† ุซู„ุงุซ ุฎุทูˆุงุช ู„ุชุญู‚ูŠู‚ ู‡ุฐุง ุงู„ู‡ุฏู ุ› ุงู„ุฃูˆู„ ุŒ ุชุญุฏูŠุฏ ุงู„ูˆุฌู‡. ุซุงู†ูŠู‹ุง ุŒ ู…ู†ุทู‚ุฉ ุงู„ุดูุฉ ู…ุณุชู‡ุฏูุฉ ุŒ ูˆุฃุฎูŠุฑู‹ุง ูŠุชู… ุชุญู„ูŠู„ ุญุฑูƒุฉ ุงู„ุดูุงู‡ ุงู„ู†ุงุทู‚ุฉ ู„ุชุญุฏูŠุฏ ู…ุง ูŠู‚ุงู„.
  2. ู‡ู„ ู…ู† ุงู„ู…ู…ูƒู† ุจู†ุงุก ู†ุธุงู… ู„ู‚ุงุฑุฆ ุงู„ุดูุงู‡ ูŠุถุงู‡ูŠ ุฃูˆ ุญุชู‰ ุฃูุถู„ ู…ู† ู‚ุงุฑุฆ ุงู„ุดูุงู‡ ุงู„ุจุดุฑูŠ. ูŠุนุชุจุฑ ูู… ุงู„ุฅู†ุณุงู† ู…ู† ุฃูƒุซุฑ ุฃุฌุฒุงุก ุฌุณู… ุงู„ุฅู†ุณุงู† ุชุดูˆู‡ู‹ุง ุŒ ู…ู…ุง ูŠุคุฏูŠ ุฅู„ู‰ ู…ุธุงู‡ุฑ ู…ุฎุชู„ูุฉ ู…ุซู„ ูุชุญ ุงู„ูู… ูˆุฅุบู„ุงู‚ู‡ ูˆูุชุญู‡ ุนู„ู‰ ู†ุทุงู‚ ูˆุงุณุน ุŒ ู„ุฐู„ูƒ ู‡ู†ุงูƒ ู…ุดุงูƒู„ ุชุชุนู„ู‚ ุจุงุณุชุฎุฑุงุฌ ุดูƒู„ ูˆุญุงูุฉ ุงู„ุดูุงู‡ ุจุฏู‚ุฉ ูˆู‡ูˆ ุงู„ู†ุธุงู… ุงู„ู…ู‚ุชุฑุญ ูŠุนุชู…ุฏ ุนู„ู‰ ู‡ุฐู‡ ุงู„ู…ูŠุฒุงุช ู„ุชุชุจุน ุญุฑูƒุฉ ุงู„ุดูุงู‡ ุจุงุณุชุฎุฏุงู… ุชู‚ู†ูŠุฉ ู…ุฎุชู„ูุฉ. ุฃูุถู„ ุทุฑูŠู‚ุฉ ูŠู…ูƒู† ุงุฎุชูŠุงุฑู‡ุง ู„ุงุณุชุฎุฑุงุฌ ู…ูŠุฒุงุช ุงู„ุดูุงู‡. ุนู†ุฏ ุชุตู…ูŠู… ู†ุธุงู… ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡ ุŒ ุณู†ุญุชุงุฌ ุฅู„ู‰ ู…ุนุงู„ุฌุฉ ุงู„ู‚ูŠูˆุฏ ุงู„ุชูŠ ุชุฏูˆุฑ ุญูˆู„ู‡ุง ุ› ู„ุง ุชูˆุฌุฏ ู‚ูˆุงุนุฏ ูˆุงุถุญุฉ ู…ุณุชุฎุฏู…ุฉ ู„ุชุญุฏูŠุฏ ุงู„ูƒู„ู…ุงุช ุงู„ุนุฑุจูŠุฉ ุงู„ู…ู†ุทูˆู‚ุฉ ุŒ ูˆู„ุง ูŠูˆุฌุฏ ู‚ุงู…ูˆุณ ุซุงุจุช ูŠู…ูƒู†ู‡ ุชุฑุฌู…ุฉ ุชุณู„ุณู„ ุฅุทุงุฑุงุช ุงู„ููŠุฏูŠูˆ ุฅู„ู‰ ูƒู„ู…ุฉ ู…ู‚ุงุจู„ุฉ ุŒ ูุถู„ุงู‹ ุนู† ุนุฏู… ูˆุฌูˆุฏ ู…ุฌู…ูˆุนุฉ ุจูŠุงู†ุงุช ุนุฑุจูŠุฉ ุฎุงุตุฉ ุจุงู„ูƒู„ุงู… ุงู„ู…ุฑุฆูŠ.
  3. ูŠุนุฏ ุงู„ุชุนุฑู ุนู„ู‰ ุงู„ูƒู„ุงู… ู…ู‡ู…ุฉ ุฃุณุงุณูŠุฉ ุฌุฏู‹ุง ู„ู„ุจุดุฑ ุŒ ู…ู…ุง ูŠุณู„ุท ุงู„ุถูˆุก ุนู„ู‰ ูุฌูˆุฉ ูƒุจูŠุฑุฉ ุจูŠู† ุงู„ุฅู…ูƒุงู†ูŠุงุช ุงู„ุชูŠ ุชูˆูุฑู‡ุง ุงู„ุชูƒู†ูˆู„ูˆุฌูŠุง ุงู„ุญุงู„ูŠุฉ ูˆู…ุชุทู„ุจุงุช ุงู„ู…ุณุชุฎุฏู…. ุงู„ุฏุงูุน ุงู„ุฃุณุงุณูŠ ู‡ูˆ ุงู„ู…ุณุงู‡ู…ุฉ ููŠ ุณุฏ ู‡ุฐู‡ ุงู„ูุฌูˆุฉ ุŒ ู…ู…ุง ูŠุณู…ุญ ู„ู„ู…ุณุชุฎุฏู…ูŠู† ููŠ ุงู„ู…ุณุชู‚ุจู„ ุจุงุณุชุฎุฏุงู… ุชู‚ู†ูŠุงุช ุงู„ูƒู„ุงู… ุฏูˆู† ุงู„ู‚ูŠูˆุฏ ูˆุงู„ู‚ูŠูˆุฏ ุงู„ุญุงู„ูŠุฉ 2. ุงู‚ุชุฑุงุญ ู†ุธุงู… ูŠุฌุจ ุฃู† ูŠูƒูˆู† ู„ู‡ ุงู„ู‚ุฏุฑุฉ ุนู„ู‰ ุงุณุชุฎุฑุงุฌ ุงู„ุณู…ุงุช ุงู„ู…ุฑุฆูŠุฉ ู„ุญุฑูƒุฉ ุงู„ุดูุงู‡ ุจุฏู‚ุฉ ูˆุงู„ุชูŠ ูŠุนุชู…ุฏ ุนู„ูŠู‡ุง ุงู„ู†ุธุงู… ู„ุงุญู‚ู‹ุง ู„ุชุชุจุน ุงู„ุดูุงู‡ ูˆุงู„ุชุนุฑู ุนู„ู‰ ุงู„ูƒู„ุงู…. ู„ู‚ุฏ ูƒุงู† ุชุณุฌูŠู„ ุจูŠุงู†ุงุช ุงู„ุชุฏุฑูŠุจ ุฌุฒุกู‹ุง ู„ุง ูŠุชุฌุฒุฃ. ู„ุชุฏุฑูŠุจ ุฃุฏุงุฉ ุงู„ุชุนุฑู ุนู„ู‰ ุงู„ูƒู„ุงู… ุงู„ู…ุฑุฆูŠ ุงู„ุนุฑุจูŠ ุŒ ูŠู„ุฒู… ูˆุฌูˆุฏ ูƒู…ูŠุงุช ูƒุจูŠุฑุฉ ู…ู† ุจูŠุงู†ุงุช ููŠุฏูŠูˆ ู…ูƒุจุฑ ุงู„ุตูˆุช ูŠู…ูƒู† ุชุทุจูŠู‚ Vwords ุจูƒูุงุกุฉ ู„ุชุญุฏูŠุฏ ุงู„ู…ุชุญุฏุซ ู…ู† ุฎู„ุงู„ ู†ุทู‚ ุงู„ุดุฎุต ุŒ ุงุนุชู…ุงุฏู‹ุง ุนู„ู‰ ุทุฑูŠู‚ุฉ ุญุฏูŠุซู‡ ุงู„ู…ุฎุชู„ูุฉ (ุงู„ูุฑูŠุฏุฉ ุฅู„ู‰ ุญุฏ ู…ุง).
  4. 1ุฒ ุงู„ุฏุฑุงุณุฉ ุงู„ุชูŠ ู‚ุฏู…ู‡ุง ู‡ุฐุง ุงู„ุนู…ู„ ุชุณุงู‡ู… ููŠ ุฃุจุญุงุซ ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡. ุทุฑู‚ ุงู„ุชุนุฑู ุงู„ู…ุฑุฆูŠ ุนู„ู‰ ุงู„ูƒู„ู…ุงุช ุงู„ุนุฑุจูŠุฉ ุงู„ู…ู‚ุชุฑุญุฉ ุŒ ูˆุงู„ุชูŠ ุชุถูŠู ุชู‚ู†ูŠุงุช ู„ุชุญุฏูŠุฏ ู…ูˆุถุน ุงู„ุดูุงู‡ ุŒ ูˆุงุณุชุฎุฑุงุฌ ุงู„ู…ูŠุฒุงุช ุงู„ู…ุฑุฆูŠุฉ ุŒ ูˆุชุชุจุน ุงู„ุชุนุฑู ุนู„ู‰ ุญุฑูƒุฉ ุงู„ุดูุงู‡. 2.ุชุชู…ุซู„ ุงู„ู…ุณุงู‡ู…ุฉ ุงู„ู…ุฑูƒุฒูŠุฉ ู„ู‡ุฐู‡ ุงู„ุฏุฑุงุณุฉ ููŠ ู…ุฌุชู…ุน ุงู„ุจุญุซ (ุฎุงุตุฉ ู…ุฌุชู…ุน VSR) ููŠ ุชุทูˆูŠุฑ ู†ุธุงู… VSR ุนุฑุจูŠ ุฏู‚ูŠู‚ ูˆูุนุงู„ ุจุงุณุชุฎุฏุงู… ู†ู‡ุฌ Vwords ุงู„ู…ู‚ุชุฑุญ ุŒ 3ุฒู…ูŠุฒุฉ ุงู„ุญุฑูƒุฉ ู…ุชุนุฏุฏุฉ ุงู„ุญุฏูˆุฏ ุงู„ู…ู‚ุชุฑุญุฉ ู„ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡. 4. ุจุงู„ู†ุณุจุฉ ู„ูƒู„ุง ุงู„ู†ูˆุนูŠู† ู…ู† ุณู…ุงุช ุงู„ุดูุงู‡ (ุงู„ูุณูŠูˆู„ูˆุฌูŠุฉ ูˆุงู„ุณู„ูˆูƒูŠุฉ) ุŒ ุชู… ุฅุฌุฑุงุก ุฏุฑุงุณุฉ ุดุงู…ู„ุฉ ู„ุงูƒุชุดุงู ุงู„ุขู„ูŠุฉ ุงู„ูƒุงู…ู†ุฉ ูˆุฑุงุก ุงู„ู‚ูˆุฉ ุงู„ุชู…ูŠูŠุฒูŠุฉ ู„ู„ู‚ูŠุงุณุงุช ุงู„ุญูŠูˆูŠุฉ ู„ู„ุดูุฉ ุŒ ูˆุชู‚ุฏูŠู… ุชุญู„ูŠู„ ู…ูุตู„ ุญูˆู„ ุฏูˆุฑ ุงู„ุณู…ุงุช ุงู„ูุณูŠูˆู„ูˆุฌูŠุฉ ูˆุงู„ุณู„ูˆูƒูŠุฉ ุงู„ู…ุฎุชู„ูุฉ ู„ู„ุดูุงู‡ ููŠ ุชุญู„ูŠู„ ุงู„ุทุฑูŠู‚ุฉ. ุงู„ู…ุชุญุฏุซ ูŠู„ูุธ ุงู„ูƒู„ู…ุฉ ุงู„ุนุฑุจูŠุฉ ูˆุฏุฑุฌุฉ ุชู‚ุงุฑุจู‡ุง ุจูŠู† ุงู„ู…ุชุญุฏุซูŠู†. 5. ูŠู…ูƒู† ุงุณุชุฎุฏุงู… ู‚ุงุนุฏุฉ ุงู„ุจูŠุงู†ุงุช ุงู„ุนุฑุจูŠุฉ ุงู„ู…ุณุฌู„ุฉ ุญุฏูŠุซู‹ุง ู„ุฃุบุฑุงุถ ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡ ููŠ ุงู„ุฏุฑุงุณุงุช ุงู„ุจุญุซูŠุฉ ุงู„ุฃุฎุฑู‰ ุงู„ู…ุชุนู„ู‚ุฉ ุจุงู„ู‚ูŠุงุณุงุช ุงู„ุญูŠูˆูŠุฉ ูˆู…ุนุงู„ุฌุฉ ุงู„ุตูˆุฑ.
  5. ู‡ูŠูƒู„ ุงู„ู†ุธุงู… ู„ู‡ุฐุง ุงู„ุนู…ู„. ููŠ ุงู„ู…ุฑุญู„ุฉ ุงู„ุฃูˆู„ู‰: ุชุชู… ุนู…ู„ูŠุฉ ุงู„ู…ุนุงู„ุฌุฉ ุงู„ู…ุณุจู‚ุฉ ุŒ ุญูŠุซ ูŠุชู… ุชุญุฏูŠุฏ ู…ู†ุทู‚ุฉ ุงู„ูˆุฌู‡ ุŒ ุญูŠุซ ุชู‚ุน ุฌู…ูŠุน ุงู„ู…ุนู„ูˆู…ุงุช ุงู„ุฎุงุตุฉ ุจุชุชุจุน ุงู„ูƒู„ุงู… ุงู„ู…ุฑุฆูŠ ููŠ ู…ู†ุทู‚ุฉ ุงู„ูู…. ููŠ ูˆู‚ุช ู„ุงุญู‚ ุŒ ู…ู†ุทู‚ุฉ ุงู„ูุงุฆุฏุฉ (ROI) ุŒ ู…ู…ุซู„ุฉ ุจุงู„ูู… ุงู„ู…ุณุชุฎุฑุฌ. ุงู„ู…ุฑุญู„ุฉ ุงู„ุซุงู†ูŠุฉ ู‡ูŠ ุนู…ู„ูŠุฉ ุงุณุชุฎุฑุงุฌ ุชู„ูƒ ุงู„ุณู…ุงุช ุงู„ู…ุชุนู„ู‚ุฉ ุจุงู„ูƒู„ุงู… ุงู„ู…ุฑุฆูŠ ู…ู† ู…ู†ุทู‚ุฉ ุงู„ุงู‡ุชู…ุงู…. ููŠ ุงู„ู†ู…ูˆุฐุฌ ุงู„ู…ู‚ุชุฑุญ ุŒ ุชูุตู†ู ุงู„ุณู…ุงุช ุงู„ู…ุณุชุฎุฑุฌุฉ ู…ู† ุญุฑูƒุฉ ูˆุชุชุจุน ู…ุญูŠุท ุงู„ุดูุงู‡ ุนู„ู‰ ุฃู†ู‡ุง ุณู…ุงุช ูุณูŠูˆู„ูˆุฌูŠุฉ ุชุนุชู…ุฏ ุนู„ู‰ ุดูƒู„ ุงู„ุดูุงู‡ ู„ู„ุชุนุฑู ุนู„ู‰ ุงู„ูƒู„ู…ุฉ ุงู„ู…ู†ุทูˆู‚ุฉ.
  6. ุซู… ูŠุชู… ุงู„ุงุญุชูุงุธ ุจุญุฑูƒุฉ ุงู„ุดูุงู‡ ู„ู„ูƒู„ู…ุฉ ุงู„ู…ู†ุทูˆู‚ุฉ ูƒู…ุนุงู…ู„ ู„ูˆุธูŠูุฉ ู…ุชุนุฏุฏุฉ ุงู„ุญุฏูˆุฏ ุŒ ูˆุงู„ุชูŠ ุชูุณุฑ ุญุฑูƒุฉ ุงู„ุดูุงู‡ ุจุงุณุชุฎุฏุงู… ู…ุนุงุฏู„ุฉ ู…ุชุนุฏุฏุฉ ุงู„ุญุฏูˆุฏ ูˆุชู…ุซู„ ู‡ุฐู‡ ุงู„ุญุฑูƒุฉ ู…ู† ุฎู„ุงู„ ู…ู†ุญู†ูŠุงุช ุงู„ุฑุณู…. ูŠู…ูƒู† ุชุทุจูŠู‚ ุงู„ู…ู†ุญู†ู‰ ุนู„ู‰ ุฃูŠ ู†ู…ูˆุฐุฌ ุดูุงู‡ ู„ุฃู†ู‡ ู…ู†ุญู†ู‰ ุชูƒูŠููŠ ุฏูˆู† ุงู„ุชู‚ูŠุฏ ุจุญุฌู… ู†ู…ูˆุฐุฌ ุงู„ุดูุงู‡ ุ›
  7. ุชูˆูุฑ ุชู‚ู†ูŠุงุช ุงู„ุชุนู„ู… ุงู„ุนู…ูŠู‚ ุญู„ูˆู„ู‹ุง ู…ุซุงู„ูŠุฉ ู„ู…ุดุงูƒู„ ุงุณุชุฎุฑุงุฌ ุงู„ู…ูŠุฒุงุช ุชู„ู‚ุงุฆูŠู‹ุง ุŒ ูˆูƒุงู† ุงู„ู†ู…ูˆุฐุฌ ุงู„ู…ู‚ุชุฑุญ ูŠุนุชู…ุฏ ุจุดูƒู„ ุฃุณุงุณูŠ ุนู„ู‰ ุดุจูƒุฉ VGG16. ูŠุชูƒูˆู† ุงู„ู†ู…ูˆุฐุฌ ู…ู† ุฌุฒุฃูŠู† ุฑุฆูŠุณูŠูŠู†: ุงู„ุฌุฒุก ุงู„ุฃูˆู„ ูŠุณุชุฎุฏู… ู„ุงุณุชุฎุฑุงุฌ ุงู„ุณู…ุงุช ุงู„ู…ุฑุฆูŠุฉ ุŒ ูˆู‡ูŠ ุงู„ู…ุนู„ูˆู…ุงุช ุงู„ุชูŠ ุชู…ุซู„ ุงู„ูƒู„ู…ุฉ ุงู„ู…ู†ุทูˆู‚ุฉ ุŒ ูˆุงู„ุฌุฒุก ุงู„ุซุงู†ูŠ ู‡ูˆ ุงู„ุชุตู†ูŠู ุงู„ุฐูŠ ูŠุนุชู…ุฏ ุนู„ู‰ ุชู„ูƒ ุงู„ู…ูŠุฒุงุช ุงู„ู…ุณุชุฎุฑุฌุฉ ู„ุบุฑุถ ุงู„ุชุนุฑู. ุงู„ูƒู„ู…ุฉ ุงู„ู…ู†ุทูˆู‚ุฉ.
  8. ุงุณุชุฎุฏุงู… VGG16_vsr ุงู„ู…ุฏุฑุจุฉ ู…ุณุจู‚ู‹ุง ููŠ ู‚ุฑุงุกุฉ ุงู„ุดูุงู‡ ุนู„ู‰ ู…ุณุชูˆู‰ ุงู„ูƒู„ู…ุฉ ููŠ ุงู„ู„ุบุฉ ุงู„ุนุฑุจูŠุฉ ู„ุฒูŠุงุฏุฉ ุงู„ุฏู‚ุฉ ููŠ ุงู„ุชู†ุจุค ุจุงู„ูƒู„ู…ุฉ. ุงู„ุทุฑูŠู‚ุฉ ุงู„ู…ู‚ุชุฑุญุฉ ุงู„ู‚ุงุฆู…ุฉ ุนู„ู‰ ุชุนู„ู… ู…ุนุงู„ุฌุฉ ุงู„ุตูˆุฑ ูˆู†ู‚ู„ู‡ุง ุŒ ุจุงู„ุฅุถุงูุฉ ุฅู„ู‰ ุชู‚ู†ูŠุฉ ุงู„ุถุจุท ุงู„ุฏู‚ูŠู‚ ูˆุฒูŠุงุฏุฉ ุงู„ุจูŠุงู†ุงุช ุŒ ูˆูุฑุช ูƒูุงุกุฉ ุนุงู„ูŠุฉ ูˆุฏู‚ุฉ ููŠ ุฃุฏุงุก ุงู„ู†ุธุงู….
  9. ุชุญุฏุซ ุงู„ู…ู†ุญุฏุฑุงุช ุนู„ู‰ ุทูˆู„ ุฎุท ุญุฑูƒุฉ ุดูุงู‡ ุงู„ุฅุทุงุฑุงุช ููŠ ุฃูˆู‚ุงุช ู…ุฎุชู„ูุฉ ุ› ู‚ุฏ ูŠุฎุชู„ู ุงู„ุชุจุงูŠู† ููŠ ุงู„ู…ู†ุญุฏุฑุงุช ุงุนุชู…ุงุฏู‹ุง ุนู„ู‰ ุงู„ุฐุฑูˆุฉ ููŠ ุญุฑูƒุฉ ุงู„ูƒู„ุงู… ููŠ ุฐู„ูƒ ุงู„ูˆู‚ุช ุŒ ู„ุฐู„ูƒ ุนู†ุฏ ุชู†ููŠุฐ ู†ู…ูˆุฐุฌ VSR ุงู„ุฎุงุต ุจู†ุง ุงุณุชู†ุงุฏู‹ุง ุฅู„ู‰ ู…ุนุงุฏู„ุฉ ุฑูŠุงุถูŠุฉ ู…ุชุนุฏุฏุฉ ุงู„ุญุฏูˆุฏ ุŒ ูˆุตู„ู†ุง ุฅู„ู‰ ุงุณุชู†ุชุงุฌ ุฃู† ุงู„ุชุฑุชูŠุจ ุงู„ุนุงู„ูŠ ูŠุณุชุฎุฏู… ููŠ ุฏุงู„ุฉ ูƒุซูŠุฑุฉ ุงู„ุญุฏูˆุฏ ู„ู…ุญุงูƒุงุฉ ู…ู†ุญู†ู‰ ุญุฑูƒุงุช ู†ู‚ุงุท ุงู„ุชุญูƒู… ุŒ ุจุญูŠุซ ูŠุคุฏูŠ ุงู„ุชุฑุชูŠุจ ุงู„ุนุงู„ูŠ ู„ูˆุธูŠูุฉ ูƒุซูŠุฑุฉ ุงู„ุญุฏูˆุฏ ุฅู„ู‰ ุฅู…ูƒุงู†ูŠุฉ ุชุญู„ูŠู„ ุงู„ูƒู„ุงู… ุงู„ู…ุฑุฆูŠ ูˆุชุฎุฒูŠู† ุงู„ู…ุนู„ูˆู…ุงุช ุงู„ู…ุฑุฆูŠุฉ ููŠ ู…ุนู„ู…ุงุช ุงู„ูˆุธูŠูุฉ ุŒ ูˆุงู„ุชูŠ ูŠุชู… ุชู…ุซูŠู„ู‡ุง ู„ุงุญู‚ู‹ุง ุนู„ู‰ ุดูƒู„ ู…ู†ุญู†ู‰ ุŒ ูˆุฃู† ูŠูƒูˆู† ุงู„ู…ู†ุญู†ู‰ ุงู„ู†ุงุชุฌ ุฃู‚ู„ ู…ุชุนุฑุฌ ูˆู…ุชุฐุจุฐุจ ูŠุณุชุฎุฏู… ู…ู‚ุชุฑุญ "ุงู„ูƒู„ู…ุงุช ุงู„ู…ุฑุฆูŠุฉ" (Vwords) ุงู„ู…ู‚ุชุฑุญ ู‚ูŠุงุณุงุช ุดูุงู‡ ู‡ู†ุฏุณูŠุฉ ู„ู…ุนุงู„ุฌุฉ ู…ุดูƒู„ุฉ VSR ุŒ ุญูŠุซ ูŠุชุนุฑู ุงู„ู†ุธุงู… ุนู„ู‰ ุงู„ูƒู„ู…ุฉ ุจุฃูƒู…ู„ู‡ุง ููŠ ู‡ุฐุง ุงู„ู†ู‡ุฌ ุŒ ูŠุชู… ุชู…ุซูŠู„ ุงู„ูƒู„ู…ุฉ ุจุชูˆู‚ูŠุน ูŠุชูƒูˆู† ู…ู† ุนุฏุฉ ู…ุชุฌู‡ุงุช ุฎุงุตูŠุฉ. ูŠุชู… ุฅู†ุดุงุก ูƒู„ ุฅุดุงุฑุฉ ุจูˆุงุณุทุฉ ู‚ูŠุงุณุงุช ุฒู…ู†ูŠุฉ ู„ู„ู…ูŠุฒุฉ ุงู„ู…ุฑุชุจุทุฉ ุจู‡ุง. ุนู„ู‰ ุณุจูŠู„ ุงู„ู…ุซุงู„ ุŒ ูŠุชู… ู‚ูŠุงุณ ู…ูŠุฒุฉ ุงุฑุชูุงุน ุงู„ูู… ุฎู„ุงู„ ุงู„ูุชุฑุฉ ุงู„ุฒู…ู†ูŠุฉ ู„ู„ูƒู„ู…ุฉ ุงู„ู…ู†ุทูˆู‚ุฉ.