SlideShare a Scribd company logo
1 of 38
Constellation Models and Unsupervised Learning for Object Class Recognition Fergus, Perona and Zisserman,  “ Object Class Recognition by Unsupervised Scale-Invariant Learning”,  CVPR 03. Presented by  Yuval Netzer
Slides Credit: Gary Bradski, Sebastian Thrun, Rob Fergus, Pietro Perona, Andrew Zisserman, Li Fei-Fei, Antonio Torralba
3 main issues to address: Representation  Learning Recognition Difficulties: Size variation Background clutter Occlusion Intra-class variation “ Unsupervised” learning! 10000s of categories.. Task: Recognition of object  categories “ Object class recognition using unsupervised scale-invariant learning”
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Representation: Constellation of Parts
Object categorization:  the statistical viewpoint vs. ,[object Object],posterior ratio likelihood ratio prior ratio
Object categorization:  the statistical viewpoint posterior ratio likelihood ratio prior ratio ,[object Object],[object Object]
Discriminative ,[object Object],Zebra Non-zebra Decision boundary
Generative ,[object Object],Middle  Low High Middle Low
Generative vs. Discriminative
Generative Methods ,[object Object],[object Object],Discriminative Methods    They use the flexibility of the model in relevant regions of input space    They interpolate between training examples, and hence can fail if novel inputs are presented
Foreground model Joint Gaussian shape pdf Pp oisson ( N |  ) p ( x ) = A -1   (uniform) Poission pdf on # detections Uniform shape pdf Prob. of detection table Generative probabilistic model Clutter model Gaussian  relative scale pdf Log(scale) Gaussian appearance pdf Uniform relative scale pdf Log(scale) Gaussian part appearance pdf 0.8 0.75 0.9
Foreground model Joint Gaussian shape pdf Pp oisson ( N |  ) p ( x ) = A -1   (uniform) Poission pdf on # detections Uniform shape pdf Prob. of detection table Generative probabilistic model Clutter model Gaussian  relative scale pdf Log(scale) Gaussian appearance pdf Uniform relative scale pdf Log(scale) Gaussian part appearance pdf 0.8 0.75 0.9
Scale, Saliency and Image Description. Timor Kadir and Michael Brady.
 
p(A)=  parts   G (A part | c part , V part )
Sparse representation ,[object Object],[object Object],[object Object],[object Object]
Combining it together for  Recognition : X  – Shape (features locations) A  – Appearance (feature representations) S  – Scale (vector of feature relative scales) Assume we learned model parameters θ  Extract X, S, A from image Make a  Bayesian  decision: We apply  threshold  to the likelihood ratio R to decide whether the input image belongs to the class (so priors ratio may be set to 1). Approx. using  (delta function) All non object images modeled using a single set of parameters
The correspondence problem ,[object Object],[object Object],[object Object]
Recognition (cont.) The term p(X, S, A | θ) can be factored into: Each of the terms has a closed (computable) form given the model parameters θ h  – Hypothesis (which part is represented by which observed feature)
Foreground model Prob. of detection table Gaussian part appearance pdf Gaussian  relative scale pdf Log(scale) Can represent both geometrically constrained objects and appearance distinctive objects lacking geometric form Gaussian shape pdf 0.8 0.75 0.9
[object Object],Learning using EM ,[object Object],[object Object],[object Object],[object Object]
Learning procedure E-step: Compute assignments for which regions belong to which part M-step: Update model parameters  ,[object Object],[object Object],[object Object],[object Object]
Example scheme, using EM for maximum likelihood learning Given an initial estimate of   : ... Image 1 Image 2 Image  i 1. Assign probabilities to constellations Large P Small P 2. Use probabilities as weights to re-estimate parameters. Example:   Large P x + Small P x pdf new estimate of   +  …  =
Efficient search methods ,[object Object],[object Object]
posterior ratio likelihood ratio prior ratio
Experimental procedure Two series of experiments: Datasets : Motorbikes, Faces, Spotted cats, Airplanes, Cars from behind and side  Between 200 and 800 images in size Training 50% images No identifcation of object within image  ,[object Object],[object Object],Testing 50% images Simple object present/absent test ROC equal error rate computed, using  background set of images
Motorbikes Pre flipped to face to the right
Motorbikes
Motorbikes Equal error rate: 7.5% Tight shape model
Background images
Frontal faces Equal error rate: 4.6%
Airplanes Equal error rate: 9.8%
Scale-invariant Spotted cats Equal error rate: 10.0% Loose shape model
Scale-invariant cars Equal error rate: 9.7%
ROC equal error rates Pre-scaled data (identical settings): Scale-invariant learning and recognition:
Scale-invariant cars
ROC equal error rates Pre-scaled data (identical settings): Scale-invariant learning and recognition:
Robustness of algorithm

More Related Content

What's hot

What's hot (20)

2017: Prototype-based models in unsupervised and supervised machine learning
2017: Prototype-based models in unsupervised and supervised machine learning2017: Prototype-based models in unsupervised and supervised machine learning
2017: Prototype-based models in unsupervised and supervised machine learning
 
Discussion of ABC talk by Francesco Pauli, Padova, March 21, 2013
Discussion of ABC talk by Francesco Pauli, Padova, March 21, 2013Discussion of ABC talk by Francesco Pauli, Padova, March 21, 2013
Discussion of ABC talk by Francesco Pauli, Padova, March 21, 2013
 
(Approximate) Bayesian computation as a new empirical Bayes (something)?
(Approximate) Bayesian computation as a new empirical Bayes (something)?(Approximate) Bayesian computation as a new empirical Bayes (something)?
(Approximate) Bayesian computation as a new empirical Bayes (something)?
 
Discussion of ABC talk by Stefano Cabras, Padova, March 21, 2013
Discussion of ABC talk by Stefano Cabras, Padova, March 21, 2013Discussion of ABC talk by Stefano Cabras, Padova, March 21, 2013
Discussion of ABC talk by Stefano Cabras, Padova, March 21, 2013
 
BIRS 12w5105 meeting
BIRS 12w5105 meetingBIRS 12w5105 meeting
BIRS 12w5105 meeting
 
Colloquium in honor of Hans Ruedi Künsch
Colloquium in honor of Hans Ruedi KünschColloquium in honor of Hans Ruedi Künsch
Colloquium in honor of Hans Ruedi Künsch
 
Dr Chris Drovandi (QUT) - Bayesian Indirect Inference Using a Parametric Auxi...
Dr Chris Drovandi (QUT) - Bayesian Indirect Inference Using a Parametric Auxi...Dr Chris Drovandi (QUT) - Bayesian Indirect Inference Using a Parametric Auxi...
Dr Chris Drovandi (QUT) - Bayesian Indirect Inference Using a Parametric Auxi...
 
Graph Based Pattern Recognition
Graph Based Pattern RecognitionGraph Based Pattern Recognition
Graph Based Pattern Recognition
 
Recognition as Graph Matching
  Recognition as Graph Matching  Recognition as Graph Matching
Recognition as Graph Matching
 
On the vexing dilemma of hypothesis testing and the predicted demise of the B...
On the vexing dilemma of hypothesis testing and the predicted demise of the B...On the vexing dilemma of hypothesis testing and the predicted demise of the B...
On the vexing dilemma of hypothesis testing and the predicted demise of the B...
 
von Mises lecture, Berlin
von Mises lecture, Berlinvon Mises lecture, Berlin
von Mises lecture, Berlin
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Including Factors in B...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Including Factors in B...MUMS: Bayesian, Fiducial, and Frequentist Conference - Including Factors in B...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Including Factors in B...
 
Statistics symposium talk, Harvard University
Statistics symposium talk, Harvard UniversityStatistics symposium talk, Harvard University
Statistics symposium talk, Harvard University
 
Presentation
PresentationPresentation
Presentation
 
Interpretable Sparse Sliced Inverse Regression for digitized functional data
Interpretable Sparse Sliced Inverse Regression for digitized functional dataInterpretable Sparse Sliced Inverse Regression for digitized functional data
Interpretable Sparse Sliced Inverse Regression for digitized functional data
 
About functional SIR
About functional SIRAbout functional SIR
About functional SIR
 
ML - Multiple Linear Regression
ML - Multiple Linear RegressionML - Multiple Linear Regression
ML - Multiple Linear Regression
 
ABC in London, May 5, 2011
ABC in London, May 5, 2011ABC in London, May 5, 2011
ABC in London, May 5, 2011
 
RuleML2015: Input-Output STIT Logic for Normative Systems
RuleML2015: Input-Output STIT Logic for Normative SystemsRuleML2015: Input-Output STIT Logic for Normative Systems
RuleML2015: Input-Output STIT Logic for Normative Systems
 
Machine learning
Machine learningMachine learning
Machine learning
 

Viewers also liked

Cvpr2007 object category recognition p2 - part based models
Cvpr2007 object category recognition   p2 - part based modelsCvpr2007 object category recognition   p2 - part based models
Cvpr2007 object category recognition p2 - part based models
zukun
 
Shuffle and learn: Unsupervised Learning using Temporal Order Verification (U...
Shuffle and learn: Unsupervised Learning using Temporal Order Verification (U...Shuffle and learn: Unsupervised Learning using Temporal Order Verification (U...
Shuffle and learn: Unsupervised Learning using Temporal Order Verification (U...
Universitat Politècnica de Catalunya
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
zukun
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2
zukun
 
cvpr2011: human activity recognition - part 1: introduction
cvpr2011: human activity recognition - part 1: introductioncvpr2011: human activity recognition - part 1: introduction
cvpr2011: human activity recognition - part 1: introduction
zukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
zukun
 
Action Recognition (Thesis presentation)
Action Recognition (Thesis presentation)Action Recognition (Thesis presentation)
Action Recognition (Thesis presentation)
nikhilus85
 
AAAI08 tutorial: visual object recognition
AAAI08 tutorial: visual object recognitionAAAI08 tutorial: visual object recognition
AAAI08 tutorial: visual object recognition
zukun
 
Human activity recognition
Human activity recognition Human activity recognition
Human activity recognition
srikanthgadam
 

Viewers also liked (20)

Cvpr2007 object category recognition p2 - part based models
Cvpr2007 object category recognition   p2 - part based modelsCvpr2007 object category recognition   p2 - part based models
Cvpr2007 object category recognition p2 - part based models
 
Shuffle and learn: Unsupervised Learning using Temporal Order Verification (U...
Shuffle and learn: Unsupervised Learning using Temporal Order Verification (U...Shuffle and learn: Unsupervised Learning using Temporal Order Verification (U...
Shuffle and learn: Unsupervised Learning using Temporal Order Verification (U...
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspective
 
Pc Seminar Jordi
Pc Seminar JordiPc Seminar Jordi
Pc Seminar Jordi
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2
 
cvpr2011: human activity recognition - part 1: introduction
cvpr2011: human activity recognition - part 1: introductioncvpr2011: human activity recognition - part 1: introduction
cvpr2011: human activity recognition - part 1: introduction
 
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Part1
Part1Part1
Part1
 
Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...
Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...
Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...
 
Action Recognition (Thesis presentation)
Action Recognition (Thesis presentation)Action Recognition (Thesis presentation)
Action Recognition (Thesis presentation)
 
Object recognition
Object recognitionObject recognition
Object recognition
 
Anil Thomas - Object recognition
Anil Thomas - Object recognitionAnil Thomas - Object recognition
Anil Thomas - Object recognition
 
AAAI08 tutorial: visual object recognition
AAAI08 tutorial: visual object recognitionAAAI08 tutorial: visual object recognition
AAAI08 tutorial: visual object recognition
 
Human activity recognition
Human activity recognitionHuman activity recognition
Human activity recognition
 
Human activity recognition
Human activity recognition Human activity recognition
Human activity recognition
 
Unsupervised Learning with Apache Spark
Unsupervised Learning with Apache SparkUnsupervised Learning with Apache Spark
Unsupervised Learning with Apache Spark
 
Lbp based edge-texture features for object recoginition
Lbp based edge-texture features for object recoginitionLbp based edge-texture features for object recoginition
Lbp based edge-texture features for object recoginition
 
Supervised and unsupervised learning
Supervised and unsupervised learningSupervised and unsupervised learning
Supervised and unsupervised learning
 

Similar to Constellation Models and Unsupervised Learning for Object Class Recognition

Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
chenhm
 
얼굴 검출 기법과 감성 언어 인식기법
얼굴 검출 기법과 감성 언어 인식기법얼굴 검출 기법과 감성 언어 인식기법
얼굴 검출 기법과 감성 언어 인식기법
cyberemotion
 
얼굴검출기법 감성언어인식기법
얼굴검출기법 감성언어인식기법얼굴검출기법 감성언어인식기법
얼굴검출기법 감성언어인식기법
cyberemotions
 
Download presentation source
Download presentation sourceDownload presentation source
Download presentation source
butest
 
Moshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenfaceMoshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenface
wolf
 
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
Cvpr2007 object category recognition   p4 - combined segmentation and recogni...Cvpr2007 object category recognition   p4 - combined segmentation and recogni...
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
zukun
 
Surface Data Capture Image Matching
Surface Data Capture Image MatchingSurface Data Capture Image Matching
Surface Data Capture Image Matching
RASHID JAVED
 

Similar to Constellation Models and Unsupervised Learning for Object Class Recognition (20)

Beyond nouns eccv_2008
Beyond nouns eccv_2008Beyond nouns eccv_2008
Beyond nouns eccv_2008
 
Recognition
RecognitionRecognition
Recognition
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
 
ppt
pptppt
ppt
 
Naive Bayes Presentation
Naive Bayes PresentationNaive Bayes Presentation
Naive Bayes Presentation
 
5 single image super resolution using
5 single image super resolution using5 single image super resolution using
5 single image super resolution using
 
Brain Maps like Mine
Brain Maps like MineBrain Maps like Mine
Brain Maps like Mine
 
Machine Learning Algorithms Review(Part 2)
Machine Learning Algorithms Review(Part 2)Machine Learning Algorithms Review(Part 2)
Machine Learning Algorithms Review(Part 2)
 
얼굴 검출 기법과 감성 언어 인식기법
얼굴 검출 기법과 감성 언어 인식기법얼굴 검출 기법과 감성 언어 인식기법
얼굴 검출 기법과 감성 언어 인식기법
 
얼굴검출기법 감성언어인식기법
얼굴검출기법 감성언어인식기법얼굴검출기법 감성언어인식기법
얼굴검출기법 감성언어인식기법
 
Download presentation source
Download presentation sourceDownload presentation source
Download presentation source
 
2.7 other classifiers
2.7 other classifiers2.7 other classifiers
2.7 other classifiers
 
Sentence generation
Sentence generationSentence generation
Sentence generation
 
Moshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenfaceMoshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenface
 
Lec10 matching
Lec10 matchingLec10 matching
Lec10 matching
 
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
Cvpr2007 object category recognition   p4 - combined segmentation and recogni...Cvpr2007 object category recognition   p4 - combined segmentation and recogni...
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
 
Surface Data Capture Image Matching
Surface Data Capture Image MatchingSurface Data Capture Image Matching
Surface Data Capture Image Matching
 
Tracking Faces using Active Appearance Models
Tracking Faces using Active Appearance ModelsTracking Faces using Active Appearance Models
Tracking Faces using Active Appearance Models
 
[ppt]
[ppt][ppt]
[ppt]
 
[ppt]
[ppt][ppt]
[ppt]
 

More from wolf

A bayesian framework for unsupervised one-shot learning of object categories
A bayesian framework for unsupervised one-shot learning of object categoriesA bayesian framework for unsupervised one-shot learning of object categories
A bayesian framework for unsupervised one-shot learning of object categories
wolf
 
Object recognition seminar S2006E01
Object recognition seminar S2006E01Object recognition seminar S2006E01
Object recognition seminar S2006E01
wolf
 

More from wolf (12)

Eigenfaces and Fisherfaces
Eigenfaces and FisherfacesEigenfaces and Fisherfaces
Eigenfaces and Fisherfaces
 
Shai Avidan's Support vector tracking and ensemble tracking
Shai Avidan's Support vector tracking and ensemble trackingShai Avidan's Support vector tracking and ensemble tracking
Shai Avidan's Support vector tracking and ensemble tracking
 
A bayesian framework for unsupervised one-shot learning of object categories
A bayesian framework for unsupervised one-shot learning of object categoriesA bayesian framework for unsupervised one-shot learning of object categories
A bayesian framework for unsupervised one-shot learning of object categories
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
 
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...
 
Recovering 3D human body configurations using shape contexts
Recovering 3D human body configurations using shape contextsRecovering 3D human body configurations using shape contexts
Recovering 3D human body configurations using shape contexts
 
Rafi Zachut's slides on class specific segmentation
Rafi Zachut's slides on class specific segmentationRafi Zachut's slides on class specific segmentation
Rafi Zachut's slides on class specific segmentation
 
Avihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slidesAvihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slides
 
Ala Stolpnik's Standard Model talk
Ala Stolpnik's Standard Model talkAla Stolpnik's Standard Model talk
Ala Stolpnik's Standard Model talk
 
Michal Erel's SIFT presentation
Michal Erel's SIFT presentationMichal Erel's SIFT presentation
Michal Erel's SIFT presentation
 
Gil Shapira's Active Appearance Model slides
Gil Shapira's Active Appearance Model slidesGil Shapira's Active Appearance Model slides
Gil Shapira's Active Appearance Model slides
 
Object recognition seminar S2006E01
Object recognition seminar S2006E01Object recognition seminar S2006E01
Object recognition seminar S2006E01
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 

Constellation Models and Unsupervised Learning for Object Class Recognition

  • 1. Constellation Models and Unsupervised Learning for Object Class Recognition Fergus, Perona and Zisserman, “ Object Class Recognition by Unsupervised Scale-Invariant Learning”, CVPR 03. Presented by Yuval Netzer
  • 2. Slides Credit: Gary Bradski, Sebastian Thrun, Rob Fergus, Pietro Perona, Andrew Zisserman, Li Fei-Fei, Antonio Torralba
  • 3. 3 main issues to address: Representation Learning Recognition Difficulties: Size variation Background clutter Occlusion Intra-class variation “ Unsupervised” learning! 10000s of categories.. Task: Recognition of object categories “ Object class recognition using unsupervised scale-invariant learning”
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 10.
  • 11. Foreground model Joint Gaussian shape pdf Pp oisson ( N |  ) p ( x ) = A -1 (uniform) Poission pdf on # detections Uniform shape pdf Prob. of detection table Generative probabilistic model Clutter model Gaussian relative scale pdf Log(scale) Gaussian appearance pdf Uniform relative scale pdf Log(scale) Gaussian part appearance pdf 0.8 0.75 0.9
  • 12. Foreground model Joint Gaussian shape pdf Pp oisson ( N |  ) p ( x ) = A -1 (uniform) Poission pdf on # detections Uniform shape pdf Prob. of detection table Generative probabilistic model Clutter model Gaussian relative scale pdf Log(scale) Gaussian appearance pdf Uniform relative scale pdf Log(scale) Gaussian part appearance pdf 0.8 0.75 0.9
  • 13. Scale, Saliency and Image Description. Timor Kadir and Michael Brady.
  • 14.  
  • 15. p(A)=  parts G (A part | c part , V part )
  • 16.
  • 17. Combining it together for Recognition : X – Shape (features locations) A – Appearance (feature representations) S – Scale (vector of feature relative scales) Assume we learned model parameters θ Extract X, S, A from image Make a Bayesian decision: We apply threshold to the likelihood ratio R to decide whether the input image belongs to the class (so priors ratio may be set to 1). Approx. using (delta function) All non object images modeled using a single set of parameters
  • 18.
  • 19. Recognition (cont.) The term p(X, S, A | θ) can be factored into: Each of the terms has a closed (computable) form given the model parameters θ h – Hypothesis (which part is represented by which observed feature)
  • 20. Foreground model Prob. of detection table Gaussian part appearance pdf Gaussian relative scale pdf Log(scale) Can represent both geometrically constrained objects and appearance distinctive objects lacking geometric form Gaussian shape pdf 0.8 0.75 0.9
  • 21.
  • 22.
  • 23. Example scheme, using EM for maximum likelihood learning Given an initial estimate of  : ... Image 1 Image 2 Image i 1. Assign probabilities to constellations Large P Small P 2. Use probabilities as weights to re-estimate parameters. Example:  Large P x + Small P x pdf new estimate of  + … =
  • 24.
  • 25. posterior ratio likelihood ratio prior ratio
  • 26.
  • 27. Motorbikes Pre flipped to face to the right
  • 29. Motorbikes Equal error rate: 7.5% Tight shape model
  • 31. Frontal faces Equal error rate: 4.6%
  • 32. Airplanes Equal error rate: 9.8%
  • 33. Scale-invariant Spotted cats Equal error rate: 10.0% Loose shape model
  • 34. Scale-invariant cars Equal error rate: 9.7%
  • 35. ROC equal error rates Pre-scaled data (identical settings): Scale-invariant learning and recognition:
  • 37. ROC equal error rates Pre-scaled data (identical settings): Scale-invariant learning and recognition: