SlideShare a Scribd company logo
1 of 40
Bayesian Decision Theory
Case Studies
CS479/679 Pattern Recognition
Dr. George Bebis
Case Study 1
• A. Madabhushi and J. Aggarwal, A bayesian approach
to human activity recognition, 2nd International
Workshop on Visual Surveillance, pp. 25-30, June
1999.
Human activity recognition
• Recognize human actions using
visual information.
– Useful for monitoring of human
activity in department stores,
airports, high-security buildings
etc.
• Building systems that can
recognize any type of action is a
difficult and challenging
problem.
Goal (this paper)
• Build a system for recognizing the following 10
actions, from a frontal or lateral view:
• sitting down
• standing up
• bending down
• getting up
• hugging
• squatting
• rising from a squatting position
• bending sideways
• falling backward
• walking
Multi-class
classification
problem
10 x 2 = 20 classes
Motivation
• People sit, stand, walk, bend down, and get up
in a more or less similar fashion.
• Human actions can be recognized by tracking
various body parts.
Approach (this paper)
• Use head motion trajectory
– The head of a person moves in
a characteristic fashion during
these actions.
• Recognition is formulated as
Bayesian classification using
the movement of the head
over consecutive frames.
Main Steps
input output
Two models are
computed for each
action, corresponding
to the frontal and
lateral views
(i.e., 20 models total).
Strengths and Weaknesses
• Strengths
– The system can recognize actions where the gait of
the subject in the test sequence differs considerably
from the training sequences.
– It can recognize actions for people of varying physical
appearance (i.e., tall, short, fat, thin etc.).
• Limitations
– Only actions in the frontal or lateral view can be
recognized successfully by this system.
– Some non-realistic assumptions.
Representing Actions (i.e., features)
• Estimate the centroid of the head in each
frame:
• Take absolute differences in successive
frames:
| | |
|
,
features
Head Detection and Tracking
• Accurate head detection and tracking are
challenging problems.
• The centroid of the head was tracked
manually from frame to frame.
Bayesian Formulation
• Given an input sequence, the posterior
probabilities are computed for each action
(frontal or lateral) using the Bayes rule:
(i=1, 2, …, 20)
Valid assumption?
Likelihood
• Feature vectors X and Y are assumed to be
independent, following a multivariate
Gaussian distribution:
Valid assumption?
1
/2 1/2
1 1
( / ) exp[ ( ) ( )]
(2 ) | | 2
t
i iX iX iX
n
iX
p X X X
  


    

1
/2 1/2
1 1
( / ) exp[ ( ) ( )]
(2 ) | | 2
t
i iY iY iY
n
iY
p Y X X
  


    

( , / ) ( / ) ( / )
i i i
p X Y p X p Y
  

Parameter Estimation
(Bayesian Estimation – we’ll cover later)
• The sample means are used to estimate μiX
and μiY
• The sample covariance matrices are used to
estimate ΣiX and ΣiY :
1
1 n
i ij
j
X
n
 

 
1
1 n
iY ij
j
Y
n


 
1
1
( )( )
1
n
T
i ij iX ij iX
j
X X
n
 


   


1
1
( )( )
1
n
T
iY ij iY ij iY
j
Y Y
n
 

   


Action Classification
• Given an input sequence, the posterior
probability is computed for each action.
• The unknown action is classified based on the
most likely action:
Pi=
Discriminating Among
Similar Actions
• In some actions, the head moves in a similar
fashion, making it difficult to distinguish these
actions from one another.
• Heuristics are used to distinguish among
similar actions.
– e.g., when bending down, the head goes much
lower than when sitting down.
Training
• A fixed CCD camera working at 2 frames per
second was used to obtain the training data.
– They collected a total of 38 sequences (i.e., each
person performed all the actions in both the
frontal and lateral views).
– People of diverse physical appearance were used
to model the actions.
– Subjects were asked to perform the actions at a
comfortable pace.
Training (cont’d)
• Assumptions
– It was found that each action can be completed
within 10 frames.
– Only the first 10 frames from each sequence were
used for training (i.e., 5 seconds)
Valid assumption?
Testing
• 39 sequences (different from training) were
used for testing.
• Only the first 10 frames from each sequence were used
for testing.
Of the 8
sequences
classified
incorrectly, 6
were assigned to
the correct action
but to the wrong
view.
Practical Issues/Limitations
• How would one find the first and last frames of
an action in general (segmentation)?
• How would one deal with actions performed at
various speeds or with incomplete sequences
(i.e., missing frames)?
• How would one deal with different viewpoints?
– i.e., other than front or lateral views
Case Study 2
• J. Yang and A. Waibel, A Real-time Face Tracker,
Proceedings of WACV, 1996.
Overview
• Goal: build a system that can detect and track
a person’s face while the person moves freely
in some environment.
– Useful in a number of applications such as video
conference, visual surveillance, face recognition,
etc.
• Approach: model skin color distribution using
statistical models.
Why Using Skin Color?
• Traditional systems for
face detection rely on
template matching or
facial features.
– Not very robust and time
consuming.
• Skin-color leads to faster
and more robust face
detection.
Main Steps (this paper)
(1) Detect human faces using a generic skin-
color model.
(2) Track face of interest by controlling the
camera position and zoom.
(3) Adapt skin-color model parameters based
on face appearance and lighting conditions.
Main System Components
• A probabilistic model to characterize skin-color
distributions of human faces.
• A motion model to estimate human motion and
to predict search window in the next frame.
• A camera model to predict camera motion (i.e.,
camera’s response was much slower than frame
rate).
How is Skin Color Affected?
• Skin color is influenced by several factors:
– Skin color varies from person to person.
– Skin color can be affected by ambient light,
motion etc.
– Different cameras produce significantly different
color values, even for the same person.
What Color Space Should be Used?
• RGB is the most common color space but not
the best one for characterizing skin-color (i.e.,
it represents not only color but also
brightness).
• Represent skin-color using normalized RGB
(chromatic space) which is defined as follows:
(Note: the normalized blue value is redundant since r + g + b = 1)
r
g
 
  
 
x features
Skin-Color Clustering
• Skin colors do not fall randomly in the
chromatic color space but actually form a
cluster.
Note: this was actually a color image but shown as grayscale since printing
color images back in the 90s was a bit costly!
Skin-Color Clustering (cont’d)
• Skin-colors of different people are also
clustered in chromatic color space
– i.e., they differ more in brightness than in color.
(skin-color distribution of 40 people - different races)
Example:
Model Skin-Color Distribution
Using a 2D Gaussian
• Experiments under different lighting conditions and
persons revealed that the skin-color distribution has
a rather regular shape.
• Represent skin-color distribution using a 2D Gaussian
with mean μ and covariance Σ:
Examples:
1
/2 1/2
1 1
( ) exp[ ( ) ( )]
(2 ) | | 2
t
n
g  


    

x x x
r
g
 
  
 
x
Parameter Estimation
(Bayesian Estimation – we’ll cover later)
• Collect skin-color regions from a set of face
images.
• Estimate the mean and covariance using the
sample mean and sample covariance:
Face Detection Using
Skin-Color Model
• Each pixel x in the input image is:
(1) converted into the chromatic color space.
(2) compared with the distribution of the skin-color
model.
• Only the skin color class is modeled in this case
which requires choosing an optimum threshold t.
1
/2 1/2
1 1
( ) exp[ ( ) ( )]
(2 ) | | 2
t
n
g  


    

x x x
Example
Input Classification
Modeling both skin and non-skin
color distributions
• One can also model the non-skin color distribution.
• Becomes a two-class classification: skin vs non-skin
• Compute max posterior probability using Bayes rule.
• How does this approach compare with the previous
approach?
– No need to explicitly select a threshold t.
– Prior probabilities are needed for each class.
– Modeling the non-skin color distribution is challenging.
Dealing with skin-color-like objects
• It is impossible in general to detect only faces
simply from the result of color matching.
– e.g., background may contain skin colors
False positives
Dealing with skin-color-like objects
(cont’d)
• Additional information could be used to
reject false positives (e.g., shape & motion
information)
Skin-color model adaptation
• If a person is moving, the apparent skin colors
change as the person’s position relative to the
camera or light changes.
• Idea: adapt model parameters (μ,Σ) to account
for such changes.
Skin-color model adaptation (cont’d)
• The weights ai, bi, ci determine how much past
estimates will influence current parameters.
• N determines how long the past estimates will
influence the current estimates.
μ𝑟=𝑟
μ𝑔=𝑔
System initialization
• Automatic mode
– A general skin-color model is used to identify skin-
color regions.
– Motion and shape information is used to reject
non-face regions.
– The largest face region is selected (i.e., face
closest to the camera).
– Skin-color model is adapted to the face being
tracked.
System initialization (cont’d)
• Interactive mode
– The user selects a point on the face of interest
using the mouse.
– The tracker searches around the point to find the
face using a general skin-color model.
– Skin-color model is adapted to the face being
tracked.
Detection Results
Search window
Face
Occlusion
Viewpoint changes

More Related Content

Similar to BayesianDecisionTheoryCaseStudiesf .pptx

pattern recognition.ppt
pattern recognition.pptpattern recognition.ppt
pattern recognition.pptSowmiyaBaskar4
 
Human action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorHuman action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorSoma Boubou
 
Detection and recognition of face using neural network
Detection and recognition of face using neural networkDetection and recognition of face using neural network
Detection and recognition of face using neural networkSmriti Tikoo
 
Gesture Recognition Review: A Survey of Various Gesture Recognition Algorithms
Gesture Recognition Review: A Survey of Various Gesture Recognition AlgorithmsGesture Recognition Review: A Survey of Various Gesture Recognition Algorithms
Gesture Recognition Review: A Survey of Various Gesture Recognition AlgorithmsIJRES Journal
 
SIRG-BSU_3_used-important.pdf
SIRG-BSU_3_used-important.pdfSIRG-BSU_3_used-important.pdf
SIRG-BSU_3_used-important.pdfDrAhmedElngar
 
Human pose estimation with deep learning
Human pose estimation with deep learningHuman pose estimation with deep learning
Human pose estimation with deep learningengiyad95
 
Data Mining - Facial Expression Recognition
Data Mining - Facial Expression RecognitionData Mining - Facial Expression Recognition
Data Mining - Facial Expression RecognitionArshiaSali1
 
photo detection in personal photo collection
photo detection in personal photo collectionphoto detection in personal photo collection
photo detection in personal photo collectionsonalijagtap15
 
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)Abdulrahman Kerim
 
Pattern recognition and Machine Learning.
Pattern recognition and Machine Learning.Pattern recognition and Machine Learning.
Pattern recognition and Machine Learning.Rohit Kumar
 
Image segmentation
Image segmentationImage segmentation
Image segmentationKuppusamy P
 
An analysis of_machine_and_human_analytics_in_classification
An analysis of_machine_and_human_analytics_in_classificationAn analysis of_machine_and_human_analytics_in_classification
An analysis of_machine_and_human_analytics_in_classificationSubhashis Hazarika
 
Computer vision and robotics
Computer vision and roboticsComputer vision and robotics
Computer vision and roboticsBiniam Asnake
 
Motion and tracking
Motion and trackingMotion and tracking
Motion and trackingpotaters
 

Similar to BayesianDecisionTheoryCaseStudiesf .pptx (20)

pattern recognition.ppt
pattern recognition.pptpattern recognition.ppt
pattern recognition.ppt
 
face detection
face detectionface detection
face detection
 
Human action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorHuman action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptor
 
10833762.ppt
10833762.ppt10833762.ppt
10833762.ppt
 
Detection and recognition of face using neural network
Detection and recognition of face using neural networkDetection and recognition of face using neural network
Detection and recognition of face using neural network
 
Gesture Recognition Review: A Survey of Various Gesture Recognition Algorithms
Gesture Recognition Review: A Survey of Various Gesture Recognition AlgorithmsGesture Recognition Review: A Survey of Various Gesture Recognition Algorithms
Gesture Recognition Review: A Survey of Various Gesture Recognition Algorithms
 
SIRG-BSU_3_used-important.pdf
SIRG-BSU_3_used-important.pdfSIRG-BSU_3_used-important.pdf
SIRG-BSU_3_used-important.pdf
 
Human pose estimation with deep learning
Human pose estimation with deep learningHuman pose estimation with deep learning
Human pose estimation with deep learning
 
Data Mining - Facial Expression Recognition
Data Mining - Facial Expression RecognitionData Mining - Facial Expression Recognition
Data Mining - Facial Expression Recognition
 
photo detection in personal photo collection
photo detection in personal photo collectionphoto detection in personal photo collection
photo detection in personal photo collection
 
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
 
Pattern recognition and Machine Learning.
Pattern recognition and Machine Learning.Pattern recognition and Machine Learning.
Pattern recognition and Machine Learning.
 
Image segmentation
Image segmentationImage segmentation
Image segmentation
 
An analysis of_machine_and_human_analytics_in_classification
An analysis of_machine_and_human_analytics_in_classificationAn analysis of_machine_and_human_analytics_in_classification
An analysis of_machine_and_human_analytics_in_classification
 
Computer vision and robotics
Computer vision and roboticsComputer vision and robotics
Computer vision and robotics
 
Motion and tracking
Motion and trackingMotion and tracking
Motion and tracking
 
Final year ppt
Final year pptFinal year ppt
Final year ppt
 
Face recognition
Face recognition Face recognition
Face recognition
 
FaceRecognition.ppt
FaceRecognition.pptFaceRecognition.ppt
FaceRecognition.ppt
 
FaceRecognition.ppt
FaceRecognition.pptFaceRecognition.ppt
FaceRecognition.ppt
 

Recently uploaded

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 

Recently uploaded (20)

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 

BayesianDecisionTheoryCaseStudiesf .pptx

  • 1. Bayesian Decision Theory Case Studies CS479/679 Pattern Recognition Dr. George Bebis
  • 2. Case Study 1 • A. Madabhushi and J. Aggarwal, A bayesian approach to human activity recognition, 2nd International Workshop on Visual Surveillance, pp. 25-30, June 1999.
  • 3. Human activity recognition • Recognize human actions using visual information. – Useful for monitoring of human activity in department stores, airports, high-security buildings etc. • Building systems that can recognize any type of action is a difficult and challenging problem.
  • 4. Goal (this paper) • Build a system for recognizing the following 10 actions, from a frontal or lateral view: • sitting down • standing up • bending down • getting up • hugging • squatting • rising from a squatting position • bending sideways • falling backward • walking Multi-class classification problem 10 x 2 = 20 classes
  • 5. Motivation • People sit, stand, walk, bend down, and get up in a more or less similar fashion. • Human actions can be recognized by tracking various body parts.
  • 6. Approach (this paper) • Use head motion trajectory – The head of a person moves in a characteristic fashion during these actions. • Recognition is formulated as Bayesian classification using the movement of the head over consecutive frames.
  • 7. Main Steps input output Two models are computed for each action, corresponding to the frontal and lateral views (i.e., 20 models total).
  • 8. Strengths and Weaknesses • Strengths – The system can recognize actions where the gait of the subject in the test sequence differs considerably from the training sequences. – It can recognize actions for people of varying physical appearance (i.e., tall, short, fat, thin etc.). • Limitations – Only actions in the frontal or lateral view can be recognized successfully by this system. – Some non-realistic assumptions.
  • 9. Representing Actions (i.e., features) • Estimate the centroid of the head in each frame: • Take absolute differences in successive frames: | | | | , features
  • 10. Head Detection and Tracking • Accurate head detection and tracking are challenging problems. • The centroid of the head was tracked manually from frame to frame.
  • 11. Bayesian Formulation • Given an input sequence, the posterior probabilities are computed for each action (frontal or lateral) using the Bayes rule: (i=1, 2, …, 20) Valid assumption?
  • 12. Likelihood • Feature vectors X and Y are assumed to be independent, following a multivariate Gaussian distribution: Valid assumption? 1 /2 1/2 1 1 ( / ) exp[ ( ) ( )] (2 ) | | 2 t i iX iX iX n iX p X X X            1 /2 1/2 1 1 ( / ) exp[ ( ) ( )] (2 ) | | 2 t i iY iY iY n iY p Y X X            ( , / ) ( / ) ( / ) i i i p X Y p X p Y    
  • 13. Parameter Estimation (Bayesian Estimation – we’ll cover later) • The sample means are used to estimate μiX and μiY • The sample covariance matrices are used to estimate ΣiX and ΣiY : 1 1 n i ij j X n      1 1 n iY ij j Y n     1 1 ( )( ) 1 n T i ij iX ij iX j X X n           1 1 ( )( ) 1 n T iY ij iY ij iY j Y Y n         
  • 14. Action Classification • Given an input sequence, the posterior probability is computed for each action. • The unknown action is classified based on the most likely action: Pi=
  • 15. Discriminating Among Similar Actions • In some actions, the head moves in a similar fashion, making it difficult to distinguish these actions from one another. • Heuristics are used to distinguish among similar actions. – e.g., when bending down, the head goes much lower than when sitting down.
  • 16. Training • A fixed CCD camera working at 2 frames per second was used to obtain the training data. – They collected a total of 38 sequences (i.e., each person performed all the actions in both the frontal and lateral views). – People of diverse physical appearance were used to model the actions. – Subjects were asked to perform the actions at a comfortable pace.
  • 17. Training (cont’d) • Assumptions – It was found that each action can be completed within 10 frames. – Only the first 10 frames from each sequence were used for training (i.e., 5 seconds) Valid assumption?
  • 18. Testing • 39 sequences (different from training) were used for testing. • Only the first 10 frames from each sequence were used for testing. Of the 8 sequences classified incorrectly, 6 were assigned to the correct action but to the wrong view.
  • 19. Practical Issues/Limitations • How would one find the first and last frames of an action in general (segmentation)? • How would one deal with actions performed at various speeds or with incomplete sequences (i.e., missing frames)? • How would one deal with different viewpoints? – i.e., other than front or lateral views
  • 20. Case Study 2 • J. Yang and A. Waibel, A Real-time Face Tracker, Proceedings of WACV, 1996.
  • 21. Overview • Goal: build a system that can detect and track a person’s face while the person moves freely in some environment. – Useful in a number of applications such as video conference, visual surveillance, face recognition, etc. • Approach: model skin color distribution using statistical models.
  • 22. Why Using Skin Color? • Traditional systems for face detection rely on template matching or facial features. – Not very robust and time consuming. • Skin-color leads to faster and more robust face detection.
  • 23. Main Steps (this paper) (1) Detect human faces using a generic skin- color model. (2) Track face of interest by controlling the camera position and zoom. (3) Adapt skin-color model parameters based on face appearance and lighting conditions.
  • 24. Main System Components • A probabilistic model to characterize skin-color distributions of human faces. • A motion model to estimate human motion and to predict search window in the next frame. • A camera model to predict camera motion (i.e., camera’s response was much slower than frame rate).
  • 25. How is Skin Color Affected? • Skin color is influenced by several factors: – Skin color varies from person to person. – Skin color can be affected by ambient light, motion etc. – Different cameras produce significantly different color values, even for the same person.
  • 26. What Color Space Should be Used? • RGB is the most common color space but not the best one for characterizing skin-color (i.e., it represents not only color but also brightness). • Represent skin-color using normalized RGB (chromatic space) which is defined as follows: (Note: the normalized blue value is redundant since r + g + b = 1) r g        x features
  • 27. Skin-Color Clustering • Skin colors do not fall randomly in the chromatic color space but actually form a cluster. Note: this was actually a color image but shown as grayscale since printing color images back in the 90s was a bit costly!
  • 28. Skin-Color Clustering (cont’d) • Skin-colors of different people are also clustered in chromatic color space – i.e., they differ more in brightness than in color. (skin-color distribution of 40 people - different races) Example:
  • 29. Model Skin-Color Distribution Using a 2D Gaussian • Experiments under different lighting conditions and persons revealed that the skin-color distribution has a rather regular shape. • Represent skin-color distribution using a 2D Gaussian with mean μ and covariance Σ: Examples: 1 /2 1/2 1 1 ( ) exp[ ( ) ( )] (2 ) | | 2 t n g           x x x r g        x
  • 30. Parameter Estimation (Bayesian Estimation – we’ll cover later) • Collect skin-color regions from a set of face images. • Estimate the mean and covariance using the sample mean and sample covariance:
  • 31. Face Detection Using Skin-Color Model • Each pixel x in the input image is: (1) converted into the chromatic color space. (2) compared with the distribution of the skin-color model. • Only the skin color class is modeled in this case which requires choosing an optimum threshold t. 1 /2 1/2 1 1 ( ) exp[ ( ) ( )] (2 ) | | 2 t n g           x x x
  • 33. Modeling both skin and non-skin color distributions • One can also model the non-skin color distribution. • Becomes a two-class classification: skin vs non-skin • Compute max posterior probability using Bayes rule. • How does this approach compare with the previous approach? – No need to explicitly select a threshold t. – Prior probabilities are needed for each class. – Modeling the non-skin color distribution is challenging.
  • 34. Dealing with skin-color-like objects • It is impossible in general to detect only faces simply from the result of color matching. – e.g., background may contain skin colors False positives
  • 35. Dealing with skin-color-like objects (cont’d) • Additional information could be used to reject false positives (e.g., shape & motion information)
  • 36. Skin-color model adaptation • If a person is moving, the apparent skin colors change as the person’s position relative to the camera or light changes. • Idea: adapt model parameters (μ,Σ) to account for such changes.
  • 37. Skin-color model adaptation (cont’d) • The weights ai, bi, ci determine how much past estimates will influence current parameters. • N determines how long the past estimates will influence the current estimates. μ𝑟=𝑟 μ𝑔=𝑔
  • 38. System initialization • Automatic mode – A general skin-color model is used to identify skin- color regions. – Motion and shape information is used to reject non-face regions. – The largest face region is selected (i.e., face closest to the camera). – Skin-color model is adapted to the face being tracked.
  • 39. System initialization (cont’d) • Interactive mode – The user selects a point on the face of interest using the mouse. – The tracker searches around the point to find the face using a general skin-color model. – Skin-color model is adapted to the face being tracked.