SlideShare a Scribd company logo
Describing People: A Poselet-Based
Approach to Attribute Classification

               Lubomir Bourdev1,2
                Subhransu Maji1
                 Jitendra Malik1



  1EECS   U.C. Berkeley    2Adobe   Systems Inc.
Goal: Extract attributes from
      images of people
Who has long hair?
Who has short pants?
Male or female?
Prior work
on poselets and on attributes
Prior work on Poselets
•   Introduced by [Bourdev and Malik, ICCV09]
•   Detection with poselets [Bourdev et al, ECCV10]
•   Applications
    •   Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11]
    •   Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et al, ICCV11]
    •   Human parsing [Wang et al, CVPR11]
    •   Semantic contours [Hariharan et al, ICCV11]
    •   Subordinate level categorization [Farrell et al, ICCV11]
Prior work on Poselets
•   Introduced by [Bourdev and Malik, ICCV09]
•   Detection with poselets [Bourdev et al, ECCV10]
•   Applications
    •   Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11]
    •   Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et al, ICCV11]
    •   Human parsing [Wang et al, CVPR11]
    •   Semantic contours [Hariharan et al, ICCV11]
    •   Subordinate level categorization [Farrell et al, ICCV11]
Prior work on Poselets
•   Introduced by [Bourdev and Malik, ICCV09]
•   Detection with poselets [Bourdev et al, ECCV10]
•   Applications
    •   Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11]
    •   Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et al, ICCV11]
    •   Human parsing [Wang et al, CVPR11]
    •   Semantic contours [Hariharan et al, ICCV11]
    •   Subordinate level categorization [Farrell et al, ICCV11]
Prior work on Poselets
•   Introduced by [Bourdev and Malik, ICCV09]
•   Detection with poselets [Bourdev et al, ECCV10]
•   Applications
    •   Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11]
    •   Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et al, ICCV11]
    •   Human parsing [Wang et al, CVPR11]
    •   Semantic contours [Hariharan et al, ICCV11]
    •   Subordinate level categorization [Farrell et al, ICCV11]
Prior work on Poselets
•   Introduced by [Bourdev and Malik, ICCV09]
•   Detection with poselets [Bourdev et al, ECCV10]
•   Applications
    •   Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11]
    •   Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et al, ICCV11]
    •   Human parsing [Wang et al, CVPR11]
    •   Semantic contours [Hariharan et al, ICCV11]
    •   Subordinate level categorization [Farrell et al, ICCV11]
Prior work on Attributes
Attributes as intermediate parts                 Image retrieval with attributes
Discovering attributes from text                 Attributes and actions
Discovering attributes from images               Active learning with attributes
Attributes from motion capture                   Attributes of people
Joint learning of classes & attributes           Gender attribute

[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10]
[Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al,
ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11]
[Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11]
[Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Prior work on Attributes
Attributes as intermediate parts                 Image retrieval with attributes
Discovering attributes from text                 Attributes and actions
Discovering attributes from images               Active learning with attributes
Attributes from motion capture                   Attributes of people
Joint learning of classes & attributes           Gender attribute

[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10]
[Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al,
ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11]
[Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11][Yao et al, ICCV11]
[Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Prior work on Attributes
Attributes as intermediate parts                 Image retrieval with attributes
Discovering attributes from text                 Attributes and actions
Discovering attributes from images               Active learning with attributes
Attributes from motion capture                   Attributes of people
Joint learning of classes & attributes           Gender attribute

[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10]
[Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al,
ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11]
[Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11]
[Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Prior work on Attributes
Attributes as intermediate parts                 Image retrieval with attributes
Discovering attributes from text                 Attributes and actions
Discovering attributes from images               Active learning with attributes
Attributes from motion capture                   Attributes of people
Joint learning of classes & attributes           Gender attribute

[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10]
[Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al,
ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11]
[Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11]
[Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Prior work on Attributes
Attributes as intermediate parts                 Image retrieval with attributes
Discovering attributes from text                 Attributes and actions
Discovering attributes from images               Active learning with attributes
Attributes from motion capture                   Attributes of people
Joint learning of classes & attributes           Gender attribute

[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10]
[Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al,
ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11]
[Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11]
[Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Prior work on Attributes
Attributes as intermediate parts                 Image retrieval with attributes
Discovering attributes from text                 Attributes and actions
Discovering attributes from images               Active learning with attributes
Attributes from motion capture                   Attributes of people
Joint learning of classes & attributes           Gender attribute

[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10]
[Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al,
ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11]
[Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11]
[Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Prior work on Attributes
Attributes as intermediate parts                 Image retrieval with attributes
Discovering attributes from text                 Attributes and actions
Discovering attributes from images               Active learning with attributes
Attributes from motion capture                   Attributes of people
Joint learning of classes & attributes           Gender attribute

[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al,
CVPR10][Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson
el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al,
CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al,
ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Prior work on Attributes
Attributes as intermediate parts                 Image retrieval with attributes
Discovering attributes from text                 Attributes and actions
Discovering attributes from images               Active learning with attributes
Attributes from motion capture                   Attributes of people
Joint learning of classes & attributes           Gender attribute

[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10]
[Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al,
ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11]
[Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11]
[Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Prior work on Attributes
Attributes as intermediate parts                 Image retrieval with attributes
Discovering attributes from text                 Attributes and actions
Discovering attributes from images               Active learning with attributes
Attributes from motion capture                   Attributes of people
Joint learning of classes & attributes           Gender attribute

[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10]
[Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al,
ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11]
[Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11]
[Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Prior work on Attributes
Attributes as intermediate parts                 Attributes and actions
Discovering attributes from text                 Active learning with attributes
Discovering attributes from images               Attributes of people
Attributes from motion capture                   Gender attribute
Joint learning of classes & attributes
Image retrieval with attributes
[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10]
[Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al,
ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11]
[Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11]
[Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Prior work on Attributes
Attributes as intermediate parts                 Image retrieval with attributes
Discovering attributes from text                 Attributes and actions
Discovering attributes from images               Active learning with attributes
Attributes from motion capture                   Attributes of people
Joint learning of classes & attributes           Gender attribute

[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02]
[Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08]
[Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al,
BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10]
[Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al,
ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11]
[Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11]
[Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
Poselets
for Attribute Classification
Male or female?
Gender recognition is easier if we
      factor out the pose
Poselets




      [Bourdev & Malik ICCV09]
Poselets




Examples may differ visually but have common semantics
How do we train a poselet?
Finding correspondences at training time




Given part of a human   How do we find a similar
pose                    pose configuration in the
                        training set?
Finding correspondences at training time




                  Left Shoulder

                  Left Hip


We use keypoints to annotate the joints, eyes, nose,
 etc. of people
Finding correspondences at training time




          Residual Error
Training poselet classifiers


Residual   0.15   0.20   0.10    0.85   0.15    0.35
Error:

1.   Given a seed patch
2.   Find the closest patch for every other person
3.   Sort them by residual error
4.   Threshold them
Training poselet classifiers



1.   Given a seed patch
2.   Find the closest patch for every other person
3.   Sort them by residual error
4.   Threshold them
5.   Use them as positive training examples to train
     a linear SVM with HOG features
Attribute Classification Algorithm
           at Test Time
Goal: Extract attributes of this person
Goal: Extract attributes of this person




              Target person bounds
     Input:
              Bounds of other nearby people
Step 1: Detect poselet activations




                 [Bourdev et al, ECCV10]
Step 2: Cluster the activations




                [Bourdev et al, ECCV10]
Step 3: Predict person bounds




               [Bourdev et al, ECCV10]
Step 4: Identify the correct cluster




                 Max-flow in bipartite graph
Start with its poselet activations




Poselet
Activations
Features
 •   Pyramid HOG
 •   LAB histogram
 •   Skin features
     •     Hands-skin
     •     Legs-skin
                         Poselet   Skin   Arms   B .* C
                         patch     mask   mask

Features


Poselet
Activations
Attribute Classification Overview




Poselet-level
Attribute
Classifiers
Features


Poselet
Activations
Attribute Classification Overview


Person-level
Attribute
Classifiers

Poselet-level
Attribute
Classifiers
Features


Poselet
Activations
Attribute Classification Overview
Context-level
Attribute
Classifiers
Person-level
Attribute
Classifiers

Poselet-level
Attribute
Classifiers
Features


Poselet
Activations
Results
Our dataset
•   Source: VOC 2010 trainval for Person + H3D

•   ~8000 annotations (4000 train + 4000 test)

•   9 binary attributes specified by 5 independent annotators via AMT

•   Ground truth label: If 4 of the 5 agree

•   Dataset will be made publicly available
Visual search on our test set
“Wears hat”




“Female”
“Has long hair”




“Wears glasses”
“Wears shorts”




“Has long sleeves”
“Doesn’t have long sleeves”
Our baseline
•   Canny-modulated HOG with SPM kernel [Lazebnik et al CVPR06]

•   To help the baseline trained separate SPM for four viewpoints:




      Full view     Head zoom       Upper body          Legs


•   For each attribute we pick the best SPM as our baseline
Precision/recall on our test set
Label     -   ---
frequency


SPM
          ___
No        ___
context

Full      ___
Model
State-of-the-art Gender Recognition
• We outperform Cognitec (top-notch face
  recognizer)
• We outperform any gender recognizer based on
  frontal faces (are there others?)
  • 61% of our test have frontal faces.
  • Even with perfect classification of frontal faces,
      max AP=80.5% vs. our AP of 82.4%
Confusions
                                        long hair
Men most confused as women




Women most confused as men   baseball hat      hair hidden
annotation
Non-T-shirt most confused to be T-shirt          errors




Short pants most confused to be long pants




         Are these pants short?   wrong person    occlusion
Best poselets per attribute

Gender:


Long Hair:



Wears glasses:
We can describe a picture of a person



                  “A woman with long hair,
                  glasses and long pants”(??)
Conclusion
How poselets help in high-level vision




 The image is a complex      Poselets decouple pose and
function of the viewpoint,       camera view from
  pose, appearance, etc.             appearance
Google “poselets” to get:

•   The set of published poselet papers
•   H3D data set + Matlab tools
•   Java3D annotation tool + video tutorial
•   Matlab code to detect people using poselets
•   Our latest trained poselets
Poselets website
                          Failure mode
http://eecs.berkeley.edu/~lbourdev/poselets hair,
                         “A man with with long
                          “A woman short
                               “Aglasses,with short hair,
                                “Aperson short short hair,
                                    man with sleeves and
                                hair and long sleeves”
•   The set of published poseletno hat pants” sleeves
                                glasses, short sleeves”
                                 papers and long
                                  long
•   H3D data set + Matlab toolsand person with
                                 “A shorts”
    Java3D annotation tool + video tutorial
                                  longcomputer vision
                                   “A pants”
•
•   Matlab code to detect people using poselets
                                   professor who likes
•   Our latest trained poselets
                                 machine learning”

More Related Content

More from zukun

Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statisticszukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibrationzukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionzukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluationzukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-softwarezukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptorszukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectorszukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-introzukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video searchzukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video searchzukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video searchzukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learningzukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionzukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick startzukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysiszukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structureszukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities zukun
 
Icml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featuresIcml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featureszukun
 
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...zukun
 
Quoc le tera-scale deep learning
Quoc le   tera-scale deep learningQuoc le   tera-scale deep learning
Quoc le tera-scale deep learningzukun
 

More from zukun (20)

Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 
Icml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featuresIcml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant features
 
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
 
Quoc le tera-scale deep learning
Quoc le   tera-scale deep learningQuoc le   tera-scale deep learning
Quoc le tera-scale deep learning
 

Recently uploaded

UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
ViralQR
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 

Recently uploaded (20)

UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 

Describing People: A Poselet-based approach to attribute classification

  • 1. Describing People: A Poselet-Based Approach to Attribute Classification Lubomir Bourdev1,2 Subhransu Maji1 Jitendra Malik1 1EECS U.C. Berkeley 2Adobe Systems Inc.
  • 2. Goal: Extract attributes from images of people
  • 3. Who has long hair?
  • 4. Who has short pants?
  • 6. Prior work on poselets and on attributes
  • 7. Prior work on Poselets • Introduced by [Bourdev and Malik, ICCV09] • Detection with poselets [Bourdev et al, ECCV10] • Applications • Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11] • Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et al, ICCV11] • Human parsing [Wang et al, CVPR11] • Semantic contours [Hariharan et al, ICCV11] • Subordinate level categorization [Farrell et al, ICCV11]
  • 8. Prior work on Poselets • Introduced by [Bourdev and Malik, ICCV09] • Detection with poselets [Bourdev et al, ECCV10] • Applications • Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11] • Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et al, ICCV11] • Human parsing [Wang et al, CVPR11] • Semantic contours [Hariharan et al, ICCV11] • Subordinate level categorization [Farrell et al, ICCV11]
  • 9. Prior work on Poselets • Introduced by [Bourdev and Malik, ICCV09] • Detection with poselets [Bourdev et al, ECCV10] • Applications • Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11] • Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et al, ICCV11] • Human parsing [Wang et al, CVPR11] • Semantic contours [Hariharan et al, ICCV11] • Subordinate level categorization [Farrell et al, ICCV11]
  • 10. Prior work on Poselets • Introduced by [Bourdev and Malik, ICCV09] • Detection with poselets [Bourdev et al, ECCV10] • Applications • Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11] • Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et al, ICCV11] • Human parsing [Wang et al, CVPR11] • Semantic contours [Hariharan et al, ICCV11] • Subordinate level categorization [Farrell et al, ICCV11]
  • 11. Prior work on Poselets • Introduced by [Bourdev and Malik, ICCV09] • Detection with poselets [Bourdev et al, ECCV10] • Applications • Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11] • Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et al, ICCV11] • Human parsing [Wang et al, CVPR11] • Semantic contours [Hariharan et al, ICCV11] • Subordinate level categorization [Farrell et al, ICCV11]
  • 12. Prior work on Attributes Attributes as intermediate parts Image retrieval with attributes Discovering attributes from text Attributes and actions Discovering attributes from images Active learning with attributes Attributes from motion capture Attributes of people Joint learning of classes & attributes Gender attribute [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 13. Prior work on Attributes Attributes as intermediate parts Image retrieval with attributes Discovering attributes from text Attributes and actions Discovering attributes from images Active learning with attributes Attributes from motion capture Attributes of people Joint learning of classes & attributes Gender attribute [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11][Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 14. Prior work on Attributes Attributes as intermediate parts Image retrieval with attributes Discovering attributes from text Attributes and actions Discovering attributes from images Active learning with attributes Attributes from motion capture Attributes of people Joint learning of classes & attributes Gender attribute [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 15. Prior work on Attributes Attributes as intermediate parts Image retrieval with attributes Discovering attributes from text Attributes and actions Discovering attributes from images Active learning with attributes Attributes from motion capture Attributes of people Joint learning of classes & attributes Gender attribute [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 16. Prior work on Attributes Attributes as intermediate parts Image retrieval with attributes Discovering attributes from text Attributes and actions Discovering attributes from images Active learning with attributes Attributes from motion capture Attributes of people Joint learning of classes & attributes Gender attribute [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 17. Prior work on Attributes Attributes as intermediate parts Image retrieval with attributes Discovering attributes from text Attributes and actions Discovering attributes from images Active learning with attributes Attributes from motion capture Attributes of people Joint learning of classes & attributes Gender attribute [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 18. Prior work on Attributes Attributes as intermediate parts Image retrieval with attributes Discovering attributes from text Attributes and actions Discovering attributes from images Active learning with attributes Attributes from motion capture Attributes of people Joint learning of classes & attributes Gender attribute [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10][Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 19. Prior work on Attributes Attributes as intermediate parts Image retrieval with attributes Discovering attributes from text Attributes and actions Discovering attributes from images Active learning with attributes Attributes from motion capture Attributes of people Joint learning of classes & attributes Gender attribute [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 20. Prior work on Attributes Attributes as intermediate parts Image retrieval with attributes Discovering attributes from text Attributes and actions Discovering attributes from images Active learning with attributes Attributes from motion capture Attributes of people Joint learning of classes & attributes Gender attribute [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 21. Prior work on Attributes Attributes as intermediate parts Attributes and actions Discovering attributes from text Active learning with attributes Discovering attributes from images Attributes of people Attributes from motion capture Gender attribute Joint learning of classes & attributes Image retrieval with attributes [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 22. Prior work on Attributes Attributes as intermediate parts Image retrieval with attributes Discovering attributes from text Attributes and actions Discovering attributes from images Active learning with attributes Attributes from motion capture Attributes of people Joint learning of classes & attributes Gender attribute [Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam& Yang, PAMI02] [Ferrari &Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]
  • 25. Gender recognition is easier if we factor out the pose
  • 26. Poselets [Bourdev & Malik ICCV09]
  • 27. Poselets Examples may differ visually but have common semantics
  • 28. How do we train a poselet?
  • 29. Finding correspondences at training time Given part of a human How do we find a similar pose pose configuration in the training set?
  • 30. Finding correspondences at training time Left Shoulder Left Hip We use keypoints to annotate the joints, eyes, nose, etc. of people
  • 31. Finding correspondences at training time Residual Error
  • 32. Training poselet classifiers Residual 0.15 0.20 0.10 0.85 0.15 0.35 Error: 1. Given a seed patch 2. Find the closest patch for every other person 3. Sort them by residual error 4. Threshold them
  • 33. Training poselet classifiers 1. Given a seed patch 2. Find the closest patch for every other person 3. Sort them by residual error 4. Threshold them 5. Use them as positive training examples to train a linear SVM with HOG features
  • 35. Goal: Extract attributes of this person
  • 36. Goal: Extract attributes of this person Target person bounds Input: Bounds of other nearby people
  • 37. Step 1: Detect poselet activations [Bourdev et al, ECCV10]
  • 38. Step 2: Cluster the activations [Bourdev et al, ECCV10]
  • 39. Step 3: Predict person bounds [Bourdev et al, ECCV10]
  • 40. Step 4: Identify the correct cluster Max-flow in bipartite graph
  • 41. Start with its poselet activations Poselet Activations
  • 42. Features • Pyramid HOG • LAB histogram • Skin features • Hands-skin • Legs-skin Poselet Skin Arms B .* C patch mask mask Features Poselet Activations
  • 47. Our dataset • Source: VOC 2010 trainval for Person + H3D • ~8000 annotations (4000 train + 4000 test) • 9 binary attributes specified by 5 independent annotators via AMT • Ground truth label: If 4 of the 5 agree • Dataset will be made publicly available
  • 48. Visual search on our test set “Wears hat” “Female”
  • 52. Our baseline • Canny-modulated HOG with SPM kernel [Lazebnik et al CVPR06] • To help the baseline trained separate SPM for four viewpoints: Full view Head zoom Upper body Legs • For each attribute we pick the best SPM as our baseline
  • 53. Precision/recall on our test set Label - --- frequency SPM ___ No ___ context Full ___ Model
  • 54. State-of-the-art Gender Recognition • We outperform Cognitec (top-notch face recognizer) • We outperform any gender recognizer based on frontal faces (are there others?) • 61% of our test have frontal faces. • Even with perfect classification of frontal faces, max AP=80.5% vs. our AP of 82.4%
  • 55. Confusions long hair Men most confused as women Women most confused as men baseball hat hair hidden
  • 56. annotation Non-T-shirt most confused to be T-shirt errors Short pants most confused to be long pants Are these pants short? wrong person occlusion
  • 57. Best poselets per attribute Gender: Long Hair: Wears glasses:
  • 58. We can describe a picture of a person “A woman with long hair, glasses and long pants”(??)
  • 60. How poselets help in high-level vision The image is a complex Poselets decouple pose and function of the viewpoint, camera view from pose, appearance, etc. appearance
  • 61. Google “poselets” to get: • The set of published poselet papers • H3D data set + Matlab tools • Java3D annotation tool + video tutorial • Matlab code to detect people using poselets • Our latest trained poselets
  • 62. Poselets website Failure mode http://eecs.berkeley.edu/~lbourdev/poselets hair, “A man with with long “A woman short “Aglasses,with short hair, “Aperson short short hair, man with sleeves and hair and long sleeves” • The set of published poseletno hat pants” sleeves glasses, short sleeves” papers and long long • H3D data set + Matlab toolsand person with “A shorts” Java3D annotation tool + video tutorial longcomputer vision “A pants” • • Matlab code to detect people using poselets professor who likes • Our latest trained poselets machine learning”

Editor's Notes

  1. Blablabla.Blablabla-bla. Bla!