SlideShare a Scribd company logo
1 of 21
Download to read offline
Modern features: advances,
               applications and software

•  Why now?


•  2005 IJCV paper on “A comparison of affine region detectors”
 – What has happened since?
 – Improvements over the classics?



•  Release of new software suites for feature detection
 – VLFeat
 – Also benchmarks
Modern features: advances,
              applications and software
Programme:

 2.30 – 2.45 Introduction
 2.45 – 3.30 Feature detectors
 3.30 – 4.15 Feature descriptors
 4.15 – 4.30 Break
 4.30 – 5.00 Software
 5.00 – 5.30 Benchmarks
 5.30 – 6.15 Hands-on software session



See webpage for this up to date programme
       https://sites.google.com/site/eccv12features/
Features Detector.
•  Definition: A feature detector (extractor) is an algorithm taking an image
   as input and outputting a set of regions (“local features”).

•  “Local Features” are regions, i.e. in principle arbitrary sets of pixels, not
   necessarily contiguous, which are at least :
   – distinguishable in an image regardless of viewpoint/illumination
   – robust to occlusion must be local
   – Must have a discriminative neighborhood: they are “features”

•    Terminology has not stabilised:
     Local Feature = Interest “Point” = Keypoint =
                  = Feature “Point” = The “Patch”
                  = Distinguished Region = Features
                  = (Transformation) Covariant Region
•    Definition: A descriptor is computed on an image region defined by a
     detector. The descriptor is a representation of the intensity (colour, ….)
     function on the region.
Feature Detectors: Desiderata
•  Invariance (or covariance) to a broad class of geometric and photometric
   transforms
•  Efficiency: close to real-time performance
•  Quantity/Density of features to cover small object/part of scenes
•  Robustness to:
  –  occlusion and clutter (requires locality)
  –  to noise, blur, discretization, compression
•  Distinctiveness: individual features can be matched to a large database of
   objects
•  Stability over time (to support long-temporal-baseline matching)
•  Geometrically accuracy: precise localization
•  Generalization to similar objects
•  Even coverage, complementarity, number of geometric constraints, …
No detector dominates in all aspects, some properties are competing,
e.g. level of invariance x speed

ECCV 2012 Modern features: … Introduction.                                4/30
Feature Descriptor.
•    Definition: A descriptor is computed on an image region defined by a
     detector. The descriptor is a representation of the intensity (colour, ….)
     function on the region.


Desiderata for feature descriptors:
•  Discriminability
•  Robustness to misalignment, illumination, blur, compression, …
•  Efficiency: real-time often required
•  Compactness: small memory footprint. Very significant on mobile large-
   scale applications


Note: The region on which a descriptor is computed is a called a
measurement region. This may be directly the feature detector output or
any other function of it (eg. convex hull, triple area region..)



ECCV 2012 Modern features: … Introduction.                                  5/30
Application Domains
 •  Methods based on “Local Features” are the state-of-the-art for number of
    computer vision problems (mostly those that require local
    correspondences).
 •  Suited to instance matching over change in viewpoint, scale, lighting,
    partial occlusion, region of interest …
 •  Multiple views of the same scene, e.g.
    –  Computing epipolar geometry or a homography
    –  Photo Tourism
    –  Panoramic mosaic
 •  Query by example search in large scale image datasets, e.g.
    –  Google goggles
    –  Where am I? Match to Streetview
    –  Total recall
 •  Copy detection
 •  Re-acquisition in tracking
 •  Object category recognition

ECCV 2012 Modern features: … Introduction.                              6/30
Example 1: Wide baseline matching
•  Establish correspondence between two (or more) images
•  Useful in visual geometry: Camera calibration, 3D
  reconstruction, Structure and motion estimation, …


     Local transf: scale/affine – Detector: affine-Harris Descriptor: SIFT




ECCV 2012 Modern features: … Introduction.                                   7/30
Example 2: Panoramic mosaic




ECCV 2012 Modern features: … Introduction.   8/30
Example 2.Image Stitching: Building a Panorama




  M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003
ECCV 2012 Modern features: … Introduction.   M. Brown, D. Lowe, B. Hearn, J. Beis
                                                                              10/30
Example 3: 3D reconstruction
•  Photo Tourism overview




                                           Scene
                                       reconstruction
                                                                                Photo
 Input photographs                                                             Explorer
                                                  Relative camera
                                                  positions and orientations
                                                  Point cloud
                                                  Sparse correspondence




ECCV 2012 Modern features: … Introduction.                                                 11/30
                                                                                Slide: N. Snavely
Example 3: 3D reconstruction
57,845 downloaded images, 11,868 registered images. This video: 4,619 images.
The Old City of Dubrovnik                    Building Rome in a Day, Agarwal, Snavely, Simon, Seitz,
                                             Szeliski, ICCV 2009
                                             See also [Havlena, Torrii, Knop pand Pajdla, CVPR 2009].




ECCV 2012 Modern features: … Introduction.                                                       12/30
Example 4: Query by example search in large scale
image datasets




        Find these objects                   ...in these images and 1M more

       Search the web with a visual query …
ECCV 2012 Modern features: … Introduction.                               13/30
Example 5: Google goggles




ECCV 2012 Modern features: … Introduction.   Slide credit: I. 14/30
                                                              Laptev
ECCV 2012 Modern features: … Introduction.   15/30
Example 6: Where am I?
•  Place recognition - retrieval in a structured (on a map) database


     Query                  Image database                                  Best match

                                                      Image indexing
                                                        with spatial
                                                        verification

 Query
 Expansion
 (Panoramio,
  Flickr, … )



      Confuser
   Suppression
    Only negative
     training data
  (from geotags)




[Knopp, Sivic, features: … Introduction. http://www.di.ens.fr/willow/research/confusers/
 ECCV 2012 Modern Pajdla, ECCV 2010]                                                  16/30
Example 8: Re-acquisition in tracking

                 Tracking	
  target	
                                  Input	
  image	
  




      Weight	
  vector	
  wi	
  per	
  keypoint	
                      Descriptor	
  dj	
  per	
  keypoint	
  

                            Correspondence	
  score:	
  	
  sij	
  =	
  <wi	
  ,	
  dj>	
  
Hare, Amri, Torr, CVPR 2012

ECCV 2012 Modern features: … Introduction.                                                                       17/30
Example 8: Re-acquisition in tracking
•  Tracking Loop               Detect        Correspondence generation + PROSAC




Hare, Amri, Torr,
CVPR 2012



                               Update        Structured SVM +
                                             stochastic gradient descent
ECCV 2012 Modern features: … Introduction.                                        18/30
Example 8: Re-acquisition in tracking




ECCV 2012 Modern features: … Introduction.   19/30
Example 8: Re-acquisition in tracking




                              Courtesy of Graz University of Technology
ECCV 2012 Modern features: … Introduction.                                20/30
Example 8: Object category recognition

 Sliding window detector
      •  Classifier: SVM with linear kernel
      •  BOW representation for ROI
   Example detections for dog




 Lampert et al CVPR 08: Efficient branch and bound search over all windows
ECCV 2012 Modern features: … Introduction.                              21/30

More Related Content

What's hot

SSII2021 [OS3-01] 設備や環境の高品質計測点群取得と自動モデル化技術
SSII2021 [OS3-01] 設備や環境の高品質計測点群取得と自動モデル化技術SSII2021 [OS3-01] 設備や環境の高品質計測点群取得と自動モデル化技術
SSII2021 [OS3-01] 設備や環境の高品質計測点群取得と自動モデル化技術SSII
 
Build Your Own 3D Scanner: Conclusion
Build Your Own 3D Scanner: ConclusionBuild Your Own 3D Scanner: Conclusion
Build Your Own 3D Scanner: ConclusionDouglas Lanman
 
Build Your Own 3D Scanner: Introduction
Build Your Own 3D Scanner: IntroductionBuild Your Own 3D Scanner: Introduction
Build Your Own 3D Scanner: IntroductionDouglas Lanman
 
Depth estimation do we need to throw old things away
Depth estimation do we need to throw old things awayDepth estimation do we need to throw old things away
Depth estimation do we need to throw old things awayNAVER Engineering
 
Concept of stereo vision based virtual touch
Concept of stereo vision based virtual touchConcept of stereo vision based virtual touch
Concept of stereo vision based virtual touchVivek Chamorshikar
 
更適應性的AOI-深度強化學習之應用
更適應性的AOI-深度強化學習之應用更適應性的AOI-深度強化學習之應用
更適應性的AOI-深度強化學習之應用CHENHuiMei
 
Neural Scene Representation & Rendering: Introduction to Novel View Synthesis
Neural Scene Representation & Rendering: Introduction to Novel View SynthesisNeural Scene Representation & Rendering: Introduction to Novel View Synthesis
Neural Scene Representation & Rendering: Introduction to Novel View SynthesisVincent Sitzmann
 
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesBuild Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesDouglas Lanman
 
Vision and Multimedia Reading Group: DeCAF: a Deep Convolutional Activation F...
Vision and Multimedia Reading Group: DeCAF: a Deep Convolutional Activation F...Vision and Multimedia Reading Group: DeCAF: a Deep Convolutional Activation F...
Vision and Multimedia Reading Group: DeCAF: a Deep Convolutional Activation F...Simone Ercoli
 
Troubleshooting, Designing & Installing Digital & Analog Closed Circuit TV Sy...
Troubleshooting, Designing & Installing Digital & Analog Closed Circuit TV Sy...Troubleshooting, Designing & Installing Digital & Analog Closed Circuit TV Sy...
Troubleshooting, Designing & Installing Digital & Analog Closed Circuit TV Sy...Living Online
 
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...Dongmin Choi
 
Tennis video shot classification based on support vector
Tennis video shot classification based on support vectorTennis video shot classification based on support vector
Tennis video shot classification based on support vectores712
 
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...Edge AI and Vision Alliance
 
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...Yan Xu
 
Focused Image Creation Algorithms for digital holography
Focused Image Creation Algorithms for digital holographyFocused Image Creation Algorithms for digital holography
Focused Image Creation Algorithms for digital holographyConor Mc Elhinney
 

What's hot (20)

Lecture1
Lecture1Lecture1
Lecture1
 
SSII2021 [OS3-01] 設備や環境の高品質計測点群取得と自動モデル化技術
SSII2021 [OS3-01] 設備や環境の高品質計測点群取得と自動モデル化技術SSII2021 [OS3-01] 設備や環境の高品質計測点群取得と自動モデル化技術
SSII2021 [OS3-01] 設備や環境の高品質計測点群取得と自動モデル化技術
 
Build Your Own 3D Scanner: Conclusion
Build Your Own 3D Scanner: ConclusionBuild Your Own 3D Scanner: Conclusion
Build Your Own 3D Scanner: Conclusion
 
Build Your Own 3D Scanner: Introduction
Build Your Own 3D Scanner: IntroductionBuild Your Own 3D Scanner: Introduction
Build Your Own 3D Scanner: Introduction
 
Depth estimation do we need to throw old things away
Depth estimation do we need to throw old things awayDepth estimation do we need to throw old things away
Depth estimation do we need to throw old things away
 
Light field
Light field Light field
Light field
 
Concept of stereo vision based virtual touch
Concept of stereo vision based virtual touchConcept of stereo vision based virtual touch
Concept of stereo vision based virtual touch
 
Compressive DIsplays: SID Keynote by Ramesh Raskar
Compressive DIsplays: SID Keynote by Ramesh RaskarCompressive DIsplays: SID Keynote by Ramesh Raskar
Compressive DIsplays: SID Keynote by Ramesh Raskar
 
更適應性的AOI-深度強化學習之應用
更適應性的AOI-深度強化學習之應用更適應性的AOI-深度強化學習之應用
更適應性的AOI-深度強化學習之應用
 
Neural Scene Representation & Rendering: Introduction to Novel View Synthesis
Neural Scene Representation & Rendering: Introduction to Novel View SynthesisNeural Scene Representation & Rendering: Introduction to Novel View Synthesis
Neural Scene Representation & Rendering: Introduction to Novel View Synthesis
 
Svr Raskar
Svr RaskarSvr Raskar
Svr Raskar
 
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesBuild Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
 
Vision and Multimedia Reading Group: DeCAF: a Deep Convolutional Activation F...
Vision and Multimedia Reading Group: DeCAF: a Deep Convolutional Activation F...Vision and Multimedia Reading Group: DeCAF: a Deep Convolutional Activation F...
Vision and Multimedia Reading Group: DeCAF: a Deep Convolutional Activation F...
 
Troubleshooting, Designing & Installing Digital & Analog Closed Circuit TV Sy...
Troubleshooting, Designing & Installing Digital & Analog Closed Circuit TV Sy...Troubleshooting, Designing & Installing Digital & Analog Closed Circuit TV Sy...
Troubleshooting, Designing & Installing Digital & Analog Closed Circuit TV Sy...
 
Survey 1 (project overview)
Survey 1 (project overview)Survey 1 (project overview)
Survey 1 (project overview)
 
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
 
Tennis video shot classification based on support vector
Tennis video shot classification based on support vectorTennis video shot classification based on support vector
Tennis video shot classification based on support vector
 
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
“Person Re-Identification and Tracking at the Edge: Challenges and Techniques...
 
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
 
Focused Image Creation Algorithms for digital holography
Focused Image Creation Algorithms for digital holographyFocused Image Creation Algorithms for digital holography
Focused Image Creation Algorithms for digital holography
 

Viewers also liked

Towards Efficient Privacy-preserving Image Feature Extraction in Cloud Computing
Towards Efficient Privacy-preserving Image Feature Extraction in Cloud ComputingTowards Efficient Privacy-preserving Image Feature Extraction in Cloud Computing
Towards Efficient Privacy-preserving Image Feature Extraction in Cloud ComputingSi Chen
 
Preemptive RANSAC by David Nister.
Preemptive RANSAC by David Nister.Preemptive RANSAC by David Nister.
Preemptive RANSAC by David Nister.Ian Sa
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptorszukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-softwarezukun
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptorswolf
 

Viewers also liked (6)

Towards Efficient Privacy-preserving Image Feature Extraction in Cloud Computing
Towards Efficient Privacy-preserving Image Feature Extraction in Cloud ComputingTowards Efficient Privacy-preserving Image Feature Extraction in Cloud Computing
Towards Efficient Privacy-preserving Image Feature Extraction in Cloud Computing
 
Preemptive RANSAC by David Nister.
Preemptive RANSAC by David Nister.Preemptive RANSAC by David Nister.
Preemptive RANSAC by David Nister.
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Surf and ransac
Surf and ransacSurf and ransac
Surf and ransac
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
 

Similar to Modern Feature Detectors

Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision Chen Sagiv
 
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya
 
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformHuman Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformFadwa Fouad
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverviewMotaz El-Saban
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple featuresHirantha Pradeep
 
Mobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large RepositoriesMobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large RepositoriesUnited States Air Force Academy
 
Feature Tracking of Objects in Underwater Video Sequences
Feature Tracking of Objects in Underwater Video SequencesFeature Tracking of Objects in Underwater Video Sequences
Feature Tracking of Objects in Underwater Video SequencesIDES Editor
 
Introduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable RenderingIntroduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable RenderingPreferred Networks
 
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)Saimunur Rahman
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術CHENHuiMei
 
Optimized and efficient deblurring through constraint conditional modelling
Optimized and efficient deblurring through constraint  conditional modellingOptimized and efficient deblurring through constraint  conditional modelling
Optimized and efficient deblurring through constraint conditional modellingnooriasukmaningtyas
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015Jia-Bin Huang
 
Principles of Data Visualization
Principles of Data VisualizationPrinciples of Data Visualization
Principles of Data VisualizationEamonn Maguire
 

Similar to Modern Feature Detectors (20)

Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision
 
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
 
Content-based Image Retrieval - Eva Mohedano - UPC Barcelona 2018
Content-based Image Retrieval - Eva Mohedano - UPC Barcelona 2018Content-based Image Retrieval - Eva Mohedano - UPC Barcelona 2018
Content-based Image Retrieval - Eva Mohedano - UPC Barcelona 2018
 
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformHuman Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverview
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple features
 
Mobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large RepositoriesMobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large Repositories
 
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
 
Feature Tracking of Objects in Underwater Video Sequences
Feature Tracking of Objects in Underwater Video SequencesFeature Tracking of Objects in Underwater Video Sequences
Feature Tracking of Objects in Underwater Video Sequences
 
Introduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable RenderingIntroduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable Rendering
 
Lec11 object-re-id
Lec11 object-re-idLec11 object-re-id
Lec11 object-re-id
 
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
 
Optimized and efficient deblurring through constraint conditional modelling
Optimized and efficient deblurring through constraint  conditional modellingOptimized and efficient deblurring through constraint  conditional modelling
Optimized and efficient deblurring through constraint conditional modelling
 
med_poster_spie
med_poster_spiemed_poster_spie
med_poster_spie
 
視訊訊號處理與深度學習應用
視訊訊號處理與深度學習應用視訊訊號處理與深度學習應用
視訊訊號處理與深度學習應用
 
lecture_16_jiajun.pdf
lecture_16_jiajun.pdflecture_16_jiajun.pdf
lecture_16_jiajun.pdf
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
 
Principles of Data Visualization
Principles of Data VisualizationPrinciples of Data Visualization
Principles of Data Visualization
 

More from zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVzukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Informationzukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statisticszukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibrationzukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionzukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluationzukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectorszukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video searchzukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video searchzukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video searchzukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learningzukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionzukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick startzukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysiszukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structureszukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities zukun
 
Icml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featuresIcml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featureszukun
 
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...zukun
 
Quoc le tera-scale deep learning
Quoc le   tera-scale deep learningQuoc le   tera-scale deep learning
Quoc le tera-scale deep learningzukun
 

More from zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 
Icml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featuresIcml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant features
 
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
 
Quoc le tera-scale deep learning
Quoc le   tera-scale deep learningQuoc le   tera-scale deep learning
Quoc le tera-scale deep learning
 

Modern Feature Detectors

  • 1. Modern features: advances, applications and software •  Why now? •  2005 IJCV paper on “A comparison of affine region detectors” – What has happened since? – Improvements over the classics? •  Release of new software suites for feature detection – VLFeat – Also benchmarks
  • 2. Modern features: advances, applications and software Programme: 2.30 – 2.45 Introduction 2.45 – 3.30 Feature detectors 3.30 – 4.15 Feature descriptors 4.15 – 4.30 Break 4.30 – 5.00 Software 5.00 – 5.30 Benchmarks 5.30 – 6.15 Hands-on software session See webpage for this up to date programme https://sites.google.com/site/eccv12features/
  • 3. Features Detector. •  Definition: A feature detector (extractor) is an algorithm taking an image as input and outputting a set of regions (“local features”). •  “Local Features” are regions, i.e. in principle arbitrary sets of pixels, not necessarily contiguous, which are at least : – distinguishable in an image regardless of viewpoint/illumination – robust to occlusion must be local – Must have a discriminative neighborhood: they are “features” •  Terminology has not stabilised: Local Feature = Interest “Point” = Keypoint = = Feature “Point” = The “Patch” = Distinguished Region = Features = (Transformation) Covariant Region •  Definition: A descriptor is computed on an image region defined by a detector. The descriptor is a representation of the intensity (colour, ….) function on the region.
  • 4. Feature Detectors: Desiderata •  Invariance (or covariance) to a broad class of geometric and photometric transforms •  Efficiency: close to real-time performance •  Quantity/Density of features to cover small object/part of scenes •  Robustness to: –  occlusion and clutter (requires locality) –  to noise, blur, discretization, compression •  Distinctiveness: individual features can be matched to a large database of objects •  Stability over time (to support long-temporal-baseline matching) •  Geometrically accuracy: precise localization •  Generalization to similar objects •  Even coverage, complementarity, number of geometric constraints, … No detector dominates in all aspects, some properties are competing, e.g. level of invariance x speed ECCV 2012 Modern features: … Introduction. 4/30
  • 5. Feature Descriptor. •  Definition: A descriptor is computed on an image region defined by a detector. The descriptor is a representation of the intensity (colour, ….) function on the region. Desiderata for feature descriptors: •  Discriminability •  Robustness to misalignment, illumination, blur, compression, … •  Efficiency: real-time often required •  Compactness: small memory footprint. Very significant on mobile large- scale applications Note: The region on which a descriptor is computed is a called a measurement region. This may be directly the feature detector output or any other function of it (eg. convex hull, triple area region..) ECCV 2012 Modern features: … Introduction. 5/30
  • 6. Application Domains •  Methods based on “Local Features” are the state-of-the-art for number of computer vision problems (mostly those that require local correspondences). •  Suited to instance matching over change in viewpoint, scale, lighting, partial occlusion, region of interest … •  Multiple views of the same scene, e.g. –  Computing epipolar geometry or a homography –  Photo Tourism –  Panoramic mosaic •  Query by example search in large scale image datasets, e.g. –  Google goggles –  Where am I? Match to Streetview –  Total recall •  Copy detection •  Re-acquisition in tracking •  Object category recognition ECCV 2012 Modern features: … Introduction. 6/30
  • 7. Example 1: Wide baseline matching •  Establish correspondence between two (or more) images •  Useful in visual geometry: Camera calibration, 3D reconstruction, Structure and motion estimation, … Local transf: scale/affine – Detector: affine-Harris Descriptor: SIFT ECCV 2012 Modern features: … Introduction. 7/30
  • 8. Example 2: Panoramic mosaic ECCV 2012 Modern features: … Introduction. 8/30
  • 9. Example 2.Image Stitching: Building a Panorama M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003
  • 10. ECCV 2012 Modern features: … Introduction. M. Brown, D. Lowe, B. Hearn, J. Beis 10/30
  • 11. Example 3: 3D reconstruction •  Photo Tourism overview Scene reconstruction Photo Input photographs Explorer Relative camera positions and orientations Point cloud Sparse correspondence ECCV 2012 Modern features: … Introduction. 11/30 Slide: N. Snavely
  • 12. Example 3: 3D reconstruction 57,845 downloaded images, 11,868 registered images. This video: 4,619 images. The Old City of Dubrovnik Building Rome in a Day, Agarwal, Snavely, Simon, Seitz, Szeliski, ICCV 2009 See also [Havlena, Torrii, Knop pand Pajdla, CVPR 2009]. ECCV 2012 Modern features: … Introduction. 12/30
  • 13. Example 4: Query by example search in large scale image datasets Find these objects ...in these images and 1M more Search the web with a visual query … ECCV 2012 Modern features: … Introduction. 13/30
  • 14. Example 5: Google goggles ECCV 2012 Modern features: … Introduction. Slide credit: I. 14/30 Laptev
  • 15. ECCV 2012 Modern features: … Introduction. 15/30
  • 16. Example 6: Where am I? •  Place recognition - retrieval in a structured (on a map) database Query Image database Best match Image indexing with spatial verification Query Expansion (Panoramio, Flickr, … ) Confuser Suppression Only negative training data (from geotags) [Knopp, Sivic, features: … Introduction. http://www.di.ens.fr/willow/research/confusers/ ECCV 2012 Modern Pajdla, ECCV 2010] 16/30
  • 17. Example 8: Re-acquisition in tracking Tracking  target   Input  image   Weight  vector  wi  per  keypoint   Descriptor  dj  per  keypoint   Correspondence  score:    sij  =  <wi  ,  dj>   Hare, Amri, Torr, CVPR 2012 ECCV 2012 Modern features: … Introduction. 17/30
  • 18. Example 8: Re-acquisition in tracking •  Tracking Loop Detect Correspondence generation + PROSAC Hare, Amri, Torr, CVPR 2012 Update Structured SVM + stochastic gradient descent ECCV 2012 Modern features: … Introduction. 18/30
  • 19. Example 8: Re-acquisition in tracking ECCV 2012 Modern features: … Introduction. 19/30
  • 20. Example 8: Re-acquisition in tracking Courtesy of Graz University of Technology ECCV 2012 Modern features: … Introduction. 20/30
  • 21. Example 8: Object category recognition Sliding window detector •  Classifier: SVM with linear kernel •  BOW representation for ROI Example detections for dog Lampert et al CVPR 08: Efficient branch and bound search over all windows ECCV 2012 Modern features: … Introduction. 21/30