SlideShare a Scribd company logo
1 of 51
Download to read offline
Salient Keypoint Selection for Object Representation
Paper ID: 1570232318
Twenty Second National Conference on Communications : NCC
2016
Authors: Prerana Mukherjee, Siddharth Srivastava, Brejesh Lall
Department of Electrical Engineering
Indian Institute of Technology, Delhi
OVERVIEW
Salient Keypoint Selection for Object Representation
• Introduction
• Background
• Proposed Methodology
• Experimental Results and Discussions
• Conclusion
INTRODUCTION
• We propose a keypoint selection technique which utilizes SIFT and KAZE
keypoint detectors, a texture map and Gabor Filter.
• The obtained keypoints are a subset of SIFT and KAZE keypoints on the
original image as well as the texture map.
• These are ranked according to the proposed saliency score based on
three criteria:
• distinctivity,
• detectability
• repeatability
• These keypoints are shown to be effectively able to characterize objects
in an image.
INTRODUCTION
• Selecting relevant keypoints from a set of detected keypoints assists in
reducing:
 the computational complexity
 error propagated due to irrelevant keypoints.
• This would help in application domains where objects are primary
concern such as object classification, detection, segmentation etc.
Motivation
Most matchable keypoints: regions with reasonably high Difference of Gaussian
(DoG) responses. [1]
KAZE features have strong response along the boundary of objects while SIFT
captures shape, texture etc. similar to neuronal response of human vision
system. [6]
KEY CONTRIBUTIONS
• First work using KAZE with SIFT keypoints for keypoint selection aimed
at object characterization and its subsequent use for object matching.
• Salient Keypoint selection of SIFT features on Gabor convolved image for
representation of features inside object boundaries in context of object
characterization.
• Adapt distinctiveness, detectability and repeatability scores [1] for
keypoints to Euclidean space.
Background
• SIFT has been the de-facto choice for keypoint extraction.
• KAZE is a recent feature detection technique which exploits the non
linear scale space to detect keypoints along edges and sharp
discontinuities.
• SIKA: A combination of SIFT and KAZE keypoints has shown
complementary nature of these techniques. Though it shows the
effectiveness of the combination in object classification, we provide a
non-heuristic approach for extracting suitable keypoints from the
image with the requisite properties.
SIKA
• SIKA keypoints [7] are direct combination of SIFT and KAZE keypoints. The
selection consists of either all or a subset of keypoints based on the available
object annotations.
• Suited for Object Classification and similar tasks with available object
annotations for training.
SIKA
SIKA ALL
SIKA Complementary
SIKA: Approach
SIFT vs KAZE vs SIKA
Property SIFT KAZE SIKA
Keypoint Distribution corners boundaries objects
No. of Keypoints Large Relatively fewer Selective (Practically
needs less than 50%
of keypoints as
compared to SIFT
and KAZE)
Scale Space Linear Non linear Both
Descriptor size 128
dimensional
descriptor
64/128
dimensional
descriptor
Respective
Descriptors
Object Classification
[7]
Lags behind
CNN
No where near
CNN
Comparable to CNN
(not always)
Proposed Methodology: An overview
1. Ranked combination: SIFT and KAZE keypoints + keypoints computed
from the texture map produced by Gabor filter.
2. Sharp edges or transitions: key characteristics of objects [3]. SIFT or any
other detector loses out on this crucial boundary information.
3. Supplement the SIFT and KAZE keypoints from original image with the
SIFT keypoints obtained from the texture map using Gabor filter. Saliency
map obtained using [5] is used to threshold out 'weak' keypoints.
KAZE features based on non-linear anisotropic diffusion filtering [4].
Proposed Methodology: Flow
Fig 1. : Flow diagram for the proposed methodology
Keypoint Selection and Ranking
1. Transformations: rotation (π/6, π/3, 2 ∗ π/3), scaling (0.5, 1.5, 2),
cropping (20%, 50%), affine.
Where SKP (i) : saliency score, Dist(KP(i)) : Distinctivity, Det(KP(i)) :
Detectability, Rep(KP(i)) : Repeatability
2. The description of ith keypoint which gives the location (xi , yi) and
response of the keypoint si .
SKP (i) = Dist(KP(i)) + Det(KP(i)) + Rep(KP(i))
KP(i) = {(xi , yi), si}, i = 1...N
Keypoint Selection and Ranking
3. Distinctiveness gives the summation of the Euclidean distances
between every pair of keypoint descriptors in the same image.
Keypoint Selection and Ranking
4. Repeatability gives Euclidean distance (ED) between the keypoint
descriptor in the original image to the keypoint descriptor mapped in
the corresponding transform, t. Here, nTransf is the number of
transformations.
Keypoint Selection and Ranking
5. Detectability gives the summation of the strengths of the keypoint in
the original image and its respective transforms.
Keypoint Selection and Ranking
6. We select the KAZE and SIFT keypoints which have saliency score
greater than the respective mean saliency scores.
where N is the total count of keypoint from respective detector and
µsalscore is mean of the saliency scores.
Texture Map based SIFT keypoints
1. SIFT keypoints are calculated on the original image. Then, the
orientation histogram of the keypoints is constructed. The dominant
orientations are found by binning the keypoint orientations into
prespecified number of bins. The image is then convolved with
Gabor filter using these dominant orientations.
where u denotes the frequency of the sinusoidal function, θ gives the
orientation of the function, σ is the standard deviation of the Gaussian
function.
Texture Map based SIFT keypoints
2. Next, the saliency map [5] is calculated for the original image. For
each keypoint, if the saliency value is greater than the mean saliency
then the keypoint is retained.
where TextureKP denotes the set of keypoints which are salient for
representing the texture. µsalmap denotes the mean of the saliency map.
Algorithm: Ranking Salient keypoints
EXPERIMENTAL RESULTS AND DISCUSSIONS
Datasets:
 Caltech 101: to show the effectiveness of the algo. that the salient
keypoints characterize and represent the objects.
 VGG affine dataset: for object matching.
Object Representation
Object Representation
Fig. 2: Figure showing a) Object annotation b) Saliency Map c) Gabor filtered image
(Texture Map) d) Ranked keypoints inside the object contour
Object Representation
Fig. 3: Texture and Ranked (SIFT and KAZE) keypoints
Object Matching
Object Matching
Fig. 4: Correctly matched keypoints by the proposed
selection strategy: red (KAZE), yellow (SIFT), green
(TextureKP) on the bikes dataset (VGG).
Object Matching
Fig. 5: Average ED vs top N% keypoints of the feature set
CONCLUSION
• Novel keypoint selection scheme based on SIFT and KAZE proposed. The
technique incorporated texture information by finding SIFT keypoints on a
texture map (using Gabor).
• Technique can characterize an object region more efficiently than other
contemporary detectors.
• Less prone to false positives.
• It will help in extending the existing object matching and classification
algorithms.
• Practical applications: object localization, segmentation and many other
domains.
• Holds promise to extend the existing state of the art in many application
areas where objects are involved
[1] W. Hartmann, M. Havlena, and K. Schindler, “Predicting matchability,” in Computer Vision and Pattern Recognition
(CVPR), 2014 IEEE Conference on. IEEE, 2014, pp. 9–16.
[2] S. Buoncompagni, D. Maio, D. Maltoni, and S. Papi, “Saliency-based keypoint selection for fast object detection and
matching,” Pattern Recognition Letters, 2015.
[3] B. Alexe, T. Deselaers, and V. Ferrari, “What is an object?” in Computer Vision and Pattern Recognition (CVPR), 2010
IEEE Conference on. IEEE, 2010, pp. 73–80.
[4] P. Perona and J. Malik, “Scale-space and edge detection using anisotropic diffusion,” Pattern Analysis and Machine
Intelligence, IEEE Transactions on, vol. 12, no. 7, pp. 629–639, 1990.
[5] P. Mukherjee, B. Lall, and A. Shah, “Saliency map based improved segmentation,” in Image Processing (ICIP), 2015 IEEE
International Conference on (Accepted). IEEE, 2015.
[6] P. Alcantarilla, A. Bartoli and A. Davison, “Kaze Features,” In Proceedings of the 12th European conference on Computer
Vision, vol. 6, pp. 214-227, 2012.
[7] Srivastava, Siddharth, Prerana Mukherjee, and Brejesh Lall. "Characterizing objects with SIKA features for multiclass
classification." Applied Soft Computing (2015).
Bibliography
Thank-you!!!
Appendix
Convolve with
Gaussian
Downsample
Step 1: Construction of Scale Space
Scale Invariant Feature Transform: Keypoint
Detection
Gaussian images grouped
by octave.
DoG images grouped
by octave
Choose
consecutive DoG
images
26 neighbours
Optimization Tricks:
1. For non-maxima and
non-minima all points
need not to be
compared
2. First and last images
in the octave need
not be compared
Take pixel if it is local maxima/local minima than all of them.
This is called a KEYPOINT.
Extrema Detection (for each pixel)
• (b) Reject keypoints with low contrast
• (c ) Reject keypoints that are localized along an edge
Step II: Keypoint Localization
• Create gradient histogram for the keypoint
neighbourhood ( 36 bins)
• Neighborhood: a circular Gaussian falloff from
the keypoint center (sigma=1.5 pixels at the
current scale, so the effective neighborhood is
about 9x9)
Step III: Orientation Assignment
Any peak within 80% of the highest peak is used to create a
keypoint with that orientation
Orientation Assignment (Contd…)
Extracted keypoints,
arrows indicating scale
and orientation
• Take 16x16 square window around detected keypoint
• Decompose this into 4x4 tiles
• Compute gradient orientation for each pixel (8 bins)
• Create histogram over edge orientations weighted by magnitude
Adapted from slide by David Lowe
0 2
angle histogram
4x4x8= 128D
Scale Invariant Feature Transform: Keypoint
Description
KAZE: Background
KAZE: Background
KAZE: Background
KAZE: Background
KAZE: Background
equation for building non linear scale space using AOS
KAZE: Keypoint Detection
Comparison between gaussian blurring and nonlinear diffusion
Non linear vs linear scale space
Feature detectionKAZE: Keypoint Detection
Scharr edge filter
The Scharr operator is the most common technique with two kernels used to
estimate the two dimensional second derivatives horizontally and vertically.
The operator for the two direction is given by the following formula:
KAZE: Keypoint Detection
Feature descriptionKAZE: Keypoint Description
KAZE: Keypoint Description

More Related Content

What's hot

Performance Anaysis for Imaging System
Performance Anaysis for Imaging SystemPerformance Anaysis for Imaging System
Performance Anaysis for Imaging SystemVrushali Lanjewar
 
Score-Based Generative Modeling through Stochastic Differential Equations
Score-Based Generative Modeling through Stochastic Differential EquationsScore-Based Generative Modeling through Stochastic Differential Equations
Score-Based Generative Modeling through Stochastic Differential EquationsSangwoo Mo
 
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...CSCJournals
 
Rigorous Pack Edge Detection Fuzzy System
Rigorous Pack Edge Detection Fuzzy SystemRigorous Pack Edge Detection Fuzzy System
Rigorous Pack Edge Detection Fuzzy Systeminventy
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative ModelsMLReview
 
Kernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingKernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingIAEME Publication
 
Learning with Relative Attributes
Learning with Relative AttributesLearning with Relative Attributes
Learning with Relative AttributesVikas Jain
 
Image Denoising Based On Sparse Representation In A Probabilistic Framework
Image Denoising Based On Sparse Representation In A Probabilistic FrameworkImage Denoising Based On Sparse Representation In A Probabilistic Framework
Image Denoising Based On Sparse Representation In A Probabilistic FrameworkCSCJournals
 
[PR12] Generative Models as Distributions of Functions
[PR12] Generative Models as Distributions of Functions[PR12] Generative Models as Distributions of Functions
[PR12] Generative Models as Distributions of FunctionsJaeJun Yoo
 
Domain Invariant Representation Learning with Domain Density Transformations
Domain Invariant Representation Learning with Domain Density TransformationsDomain Invariant Representation Learning with Domain Density Transformations
Domain Invariant Representation Learning with Domain Density TransformationsHyunKyu Jeon
 
Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)Farzaneh Rezaei
 
Improvement of Anomaly Detection Algorithms in Hyperspectral Images Using Dis...
Improvement of Anomaly Detection Algorithms in Hyperspectral Images Using Dis...Improvement of Anomaly Detection Algorithms in Hyperspectral Images Using Dis...
Improvement of Anomaly Detection Algorithms in Hyperspectral Images Using Dis...sipij
 
A Fuzzy Set Approach for Edge Detection
A Fuzzy Set Approach for Edge DetectionA Fuzzy Set Approach for Edge Detection
A Fuzzy Set Approach for Edge DetectionCSCJournals
 
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLINGAPPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLINGsipij
 
Kandemir Inferring Object Relevance From Gaze In Dynamic Scenes
Kandemir Inferring Object Relevance From Gaze In Dynamic ScenesKandemir Inferring Object Relevance From Gaze In Dynamic Scenes
Kandemir Inferring Object Relevance From Gaze In Dynamic ScenesKalle
 
Generative Models for General Audiences
Generative Models for General AudiencesGenerative Models for General Audiences
Generative Models for General AudiencesSangwoo Mo
 
PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...
PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...
PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...Jinwon Lee
 

What's hot (20)

V2 v posenet
V2 v posenetV2 v posenet
V2 v posenet
 
Performance Anaysis for Imaging System
Performance Anaysis for Imaging SystemPerformance Anaysis for Imaging System
Performance Anaysis for Imaging System
 
Score-Based Generative Modeling through Stochastic Differential Equations
Score-Based Generative Modeling through Stochastic Differential EquationsScore-Based Generative Modeling through Stochastic Differential Equations
Score-Based Generative Modeling through Stochastic Differential Equations
 
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
Image Segmentation from RGBD Images by 3D Point Cloud Attributes and High-Lev...
 
Rigorous Pack Edge Detection Fuzzy System
Rigorous Pack Edge Detection Fuzzy SystemRigorous Pack Edge Detection Fuzzy System
Rigorous Pack Edge Detection Fuzzy System
 
Ijcatr04041016
Ijcatr04041016Ijcatr04041016
Ijcatr04041016
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative Models
 
Kernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of movingKernel based similarity estimation and real time tracking of moving
Kernel based similarity estimation and real time tracking of moving
 
Learning with Relative Attributes
Learning with Relative AttributesLearning with Relative Attributes
Learning with Relative Attributes
 
Image Denoising Based On Sparse Representation In A Probabilistic Framework
Image Denoising Based On Sparse Representation In A Probabilistic FrameworkImage Denoising Based On Sparse Representation In A Probabilistic Framework
Image Denoising Based On Sparse Representation In A Probabilistic Framework
 
[PR12] Generative Models as Distributions of Functions
[PR12] Generative Models as Distributions of Functions[PR12] Generative Models as Distributions of Functions
[PR12] Generative Models as Distributions of Functions
 
Domain Invariant Representation Learning with Domain Density Transformations
Domain Invariant Representation Learning with Domain Density TransformationsDomain Invariant Representation Learning with Domain Density Transformations
Domain Invariant Representation Learning with Domain Density Transformations
 
9.venkata naga vamsi. a
9.venkata naga vamsi. a9.venkata naga vamsi. a
9.venkata naga vamsi. a
 
Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)
 
Improvement of Anomaly Detection Algorithms in Hyperspectral Images Using Dis...
Improvement of Anomaly Detection Algorithms in Hyperspectral Images Using Dis...Improvement of Anomaly Detection Algorithms in Hyperspectral Images Using Dis...
Improvement of Anomaly Detection Algorithms in Hyperspectral Images Using Dis...
 
A Fuzzy Set Approach for Edge Detection
A Fuzzy Set Approach for Edge DetectionA Fuzzy Set Approach for Edge Detection
A Fuzzy Set Approach for Edge Detection
 
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLINGAPPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
 
Kandemir Inferring Object Relevance From Gaze In Dynamic Scenes
Kandemir Inferring Object Relevance From Gaze In Dynamic ScenesKandemir Inferring Object Relevance From Gaze In Dynamic Scenes
Kandemir Inferring Object Relevance From Gaze In Dynamic Scenes
 
Generative Models for General Audiences
Generative Models for General AudiencesGenerative Models for General Audiences
Generative Models for General Audiences
 
PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...
PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...
PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...
 

Similar to Salient keypoint selection improves object matching

Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesContextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesUniversitat Politècnica de Catalunya
 
Image Features Matching and Classification Using Machine Learning
Image Features Matching and Classification Using Machine LearningImage Features Matching and Classification Using Machine Learning
Image Features Matching and Classification Using Machine LearningIRJET Journal
 
Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...ijfcstjournal
 
Object Capturing In A Cluttered Scene By Using Point Feature Matching
Object Capturing In A Cluttered Scene By Using Point Feature MatchingObject Capturing In A Cluttered Scene By Using Point Feature Matching
Object Capturing In A Cluttered Scene By Using Point Feature MatchingIJERA Editor
 
Speeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object RecognitionSpeeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object RecognitionCSCJournals
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptorswolf
 
Web Image Retrieval Using Visual Dictionary
Web Image Retrieval Using Visual DictionaryWeb Image Retrieval Using Visual Dictionary
Web Image Retrieval Using Visual Dictionaryijwscjournal
 
Web Image Retrieval Using Visual Dictionary
Web Image Retrieval Using Visual DictionaryWeb Image Retrieval Using Visual Dictionary
Web Image Retrieval Using Visual Dictionaryijwscjournal
 
Finding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster ResultsFinding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster ResultsCSCJournals
 
Introduction to image processing and pattern recognition
Introduction to image processing and pattern recognitionIntroduction to image processing and pattern recognition
Introduction to image processing and pattern recognitionSaibee Alam
 
SYNOPSIS on Parse representation and Linear SVM.
SYNOPSIS on Parse representation and Linear SVM.SYNOPSIS on Parse representation and Linear SVM.
SYNOPSIS on Parse representation and Linear SVM.bhavinecindus
 
A comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrievalA comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrievalcsandit
 
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVALA COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVALcscpconf
 
Scrdet++ analysis
Scrdet++ analysisScrdet++ analysis
Scrdet++ analysisNEHA Kapoor
 
Kernel Descriptors for Visual Recognition
Kernel Descriptors for Visual RecognitionKernel Descriptors for Visual Recognition
Kernel Descriptors for Visual RecognitionPriyatham Bollimpalli
 
WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...
WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...
WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...Nexgen Technology
 
A survey on coding binary visual features extracted from video sequences
A survey on coding binary visual features extracted from video sequencesA survey on coding binary visual features extracted from video sequences
A survey on coding binary visual features extracted from video sequencesIRJET Journal
 

Similar to Salient keypoint selection improves object matching (20)

Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesContextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
 
Image Features Matching and Classification Using Machine Learning
Image Features Matching and Classification Using Machine LearningImage Features Matching and Classification Using Machine Learning
Image Features Matching and Classification Using Machine Learning
 
Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...
 
Object Capturing In A Cluttered Scene By Using Point Feature Matching
Object Capturing In A Cluttered Scene By Using Point Feature MatchingObject Capturing In A Cluttered Scene By Using Point Feature Matching
Object Capturing In A Cluttered Scene By Using Point Feature Matching
 
Speeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object RecognitionSpeeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object Recognition
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
 
Web Image Retrieval Using Visual Dictionary
Web Image Retrieval Using Visual DictionaryWeb Image Retrieval Using Visual Dictionary
Web Image Retrieval Using Visual Dictionary
 
Web Image Retrieval Using Visual Dictionary
Web Image Retrieval Using Visual DictionaryWeb Image Retrieval Using Visual Dictionary
Web Image Retrieval Using Visual Dictionary
 
K means report
K means reportK means report
K means report
 
Finding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster ResultsFinding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster Results
 
Introduction to image processing and pattern recognition
Introduction to image processing and pattern recognitionIntroduction to image processing and pattern recognition
Introduction to image processing and pattern recognition
 
SYNOPSIS on Parse representation and Linear SVM.
SYNOPSIS on Parse representation and Linear SVM.SYNOPSIS on Parse representation and Linear SVM.
SYNOPSIS on Parse representation and Linear SVM.
 
key.net
key.netkey.net
key.net
 
A comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrievalA comparative analysis of retrieval techniques in content based image retrieval
A comparative analysis of retrieval techniques in content based image retrieval
 
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVALA COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL
 
Scrdet++ analysis
Scrdet++ analysisScrdet++ analysis
Scrdet++ analysis
 
Kernel Descriptors for Visual Recognition
Kernel Descriptors for Visual RecognitionKernel Descriptors for Visual Recognition
Kernel Descriptors for Visual Recognition
 
WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...
WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...
WEAKLY SUPERVISED FINE-GRAINED CATEGORIZATION WITH PART-BASED IMAGE REPRESENT...
 
A survey on coding binary visual features extracted from video sequences
A survey on coding binary visual features extracted from video sequencesA survey on coding binary visual features extracted from video sequences
A survey on coding binary visual features extracted from video sequences
 
poster_silhouette
poster_silhouetteposter_silhouette
poster_silhouette
 

More from Prerana Mukherjee

SALPROP: SALIENT OBJECT PROPOSALS VIA AGGREGATED EDGE CUES
SALPROP: SALIENT OBJECT PROPOSALS VIA AGGREGATED EDGE CUESSALPROP: SALIENT OBJECT PROPOSALS VIA AGGREGATED EDGE CUES
SALPROP: SALIENT OBJECT PROPOSALS VIA AGGREGATED EDGE CUESPrerana Mukherjee
 
SALIENCY MAP BASED IMPROVED SEGMENTATION
SALIENCY MAP BASED IMPROVED SEGMENTATIONSALIENCY MAP BASED IMPROVED SEGMENTATION
SALIENCY MAP BASED IMPROVED SEGMENTATIONPrerana Mukherjee
 
AUTOMATED BALL TRACKING IN TENNIS VIDEO
AUTOMATED BALL TRACKING IN TENNIS VIDEOAUTOMATED BALL TRACKING IN TENNIS VIDEO
AUTOMATED BALL TRACKING IN TENNIS VIDEOPrerana Mukherjee
 
A real-time ball trajectory follower using Robot Operating System
A real-time ball trajectory follower using Robot Operating SystemA real-time ball trajectory follower using Robot Operating System
A real-time ball trajectory follower using Robot Operating SystemPrerana Mukherjee
 
Enhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wildEnhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wildPrerana Mukherjee
 
Adaptive Image Compression Using Saliency and KAZE Features
Adaptive Image Compression Using Saliency and KAZE FeaturesAdaptive Image Compression Using Saliency and KAZE Features
Adaptive Image Compression Using Saliency and KAZE FeaturesPrerana Mukherjee
 
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...Prerana Mukherjee
 
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...Prerana Mukherjee
 
DRIZY- Collaborative Driver Assistance Over Wireless Networks
DRIZY- Collaborative Driver Assistance Over Wireless NetworksDRIZY- Collaborative Driver Assistance Over Wireless Networks
DRIZY- Collaborative Driver Assistance Over Wireless NetworksPrerana Mukherjee
 
imPlag: Detecting Image Plagiarism Using Hierarchical Near Duplicate Retrieval
imPlag: Detecting Image Plagiarism Using Hierarchical Near Duplicate RetrievalimPlag: Detecting Image Plagiarism Using Hierarchical Near Duplicate Retrieval
imPlag: Detecting Image Plagiarism Using Hierarchical Near Duplicate RetrievalPrerana Mukherjee
 

More from Prerana Mukherjee (10)

SALPROP: SALIENT OBJECT PROPOSALS VIA AGGREGATED EDGE CUES
SALPROP: SALIENT OBJECT PROPOSALS VIA AGGREGATED EDGE CUESSALPROP: SALIENT OBJECT PROPOSALS VIA AGGREGATED EDGE CUES
SALPROP: SALIENT OBJECT PROPOSALS VIA AGGREGATED EDGE CUES
 
SALIENCY MAP BASED IMPROVED SEGMENTATION
SALIENCY MAP BASED IMPROVED SEGMENTATIONSALIENCY MAP BASED IMPROVED SEGMENTATION
SALIENCY MAP BASED IMPROVED SEGMENTATION
 
AUTOMATED BALL TRACKING IN TENNIS VIDEO
AUTOMATED BALL TRACKING IN TENNIS VIDEOAUTOMATED BALL TRACKING IN TENNIS VIDEO
AUTOMATED BALL TRACKING IN TENNIS VIDEO
 
A real-time ball trajectory follower using Robot Operating System
A real-time ball trajectory follower using Robot Operating SystemA real-time ball trajectory follower using Robot Operating System
A real-time ball trajectory follower using Robot Operating System
 
Enhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wildEnhanced characterness for text detection in the wild
Enhanced characterness for text detection in the wild
 
Adaptive Image Compression Using Saliency and KAZE Features
Adaptive Image Compression Using Saliency and KAZE FeaturesAdaptive Image Compression Using Saliency and KAZE Features
Adaptive Image Compression Using Saliency and KAZE Features
 
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...
 
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...
ADAPTIVE CRYPTO-STEGANOSYSTEM FOR VIDEOS BASED ON INFORMATION CONTENT AND VIS...
 
DRIZY- Collaborative Driver Assistance Over Wireless Networks
DRIZY- Collaborative Driver Assistance Over Wireless NetworksDRIZY- Collaborative Driver Assistance Over Wireless Networks
DRIZY- Collaborative Driver Assistance Over Wireless Networks
 
imPlag: Detecting Image Plagiarism Using Hierarchical Near Duplicate Retrieval
imPlag: Detecting Image Plagiarism Using Hierarchical Near Duplicate RetrievalimPlag: Detecting Image Plagiarism Using Hierarchical Near Duplicate Retrieval
imPlag: Detecting Image Plagiarism Using Hierarchical Near Duplicate Retrieval
 

Recently uploaded

Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 

Recently uploaded (20)

Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 

Salient keypoint selection improves object matching

  • 1. Salient Keypoint Selection for Object Representation Paper ID: 1570232318 Twenty Second National Conference on Communications : NCC 2016 Authors: Prerana Mukherjee, Siddharth Srivastava, Brejesh Lall Department of Electrical Engineering Indian Institute of Technology, Delhi
  • 2. OVERVIEW Salient Keypoint Selection for Object Representation • Introduction • Background • Proposed Methodology • Experimental Results and Discussions • Conclusion
  • 3. INTRODUCTION • We propose a keypoint selection technique which utilizes SIFT and KAZE keypoint detectors, a texture map and Gabor Filter. • The obtained keypoints are a subset of SIFT and KAZE keypoints on the original image as well as the texture map. • These are ranked according to the proposed saliency score based on three criteria: • distinctivity, • detectability • repeatability • These keypoints are shown to be effectively able to characterize objects in an image.
  • 4. INTRODUCTION • Selecting relevant keypoints from a set of detected keypoints assists in reducing:  the computational complexity  error propagated due to irrelevant keypoints. • This would help in application domains where objects are primary concern such as object classification, detection, segmentation etc.
  • 5. Motivation Most matchable keypoints: regions with reasonably high Difference of Gaussian (DoG) responses. [1] KAZE features have strong response along the boundary of objects while SIFT captures shape, texture etc. similar to neuronal response of human vision system. [6]
  • 6. KEY CONTRIBUTIONS • First work using KAZE with SIFT keypoints for keypoint selection aimed at object characterization and its subsequent use for object matching. • Salient Keypoint selection of SIFT features on Gabor convolved image for representation of features inside object boundaries in context of object characterization. • Adapt distinctiveness, detectability and repeatability scores [1] for keypoints to Euclidean space.
  • 7. Background • SIFT has been the de-facto choice for keypoint extraction. • KAZE is a recent feature detection technique which exploits the non linear scale space to detect keypoints along edges and sharp discontinuities. • SIKA: A combination of SIFT and KAZE keypoints has shown complementary nature of these techniques. Though it shows the effectiveness of the combination in object classification, we provide a non-heuristic approach for extracting suitable keypoints from the image with the requisite properties.
  • 8. SIKA • SIKA keypoints [7] are direct combination of SIFT and KAZE keypoints. The selection consists of either all or a subset of keypoints based on the available object annotations. • Suited for Object Classification and similar tasks with available object annotations for training.
  • 11. SIFT vs KAZE vs SIKA Property SIFT KAZE SIKA Keypoint Distribution corners boundaries objects No. of Keypoints Large Relatively fewer Selective (Practically needs less than 50% of keypoints as compared to SIFT and KAZE) Scale Space Linear Non linear Both Descriptor size 128 dimensional descriptor 64/128 dimensional descriptor Respective Descriptors Object Classification [7] Lags behind CNN No where near CNN Comparable to CNN (not always)
  • 12. Proposed Methodology: An overview 1. Ranked combination: SIFT and KAZE keypoints + keypoints computed from the texture map produced by Gabor filter. 2. Sharp edges or transitions: key characteristics of objects [3]. SIFT or any other detector loses out on this crucial boundary information. 3. Supplement the SIFT and KAZE keypoints from original image with the SIFT keypoints obtained from the texture map using Gabor filter. Saliency map obtained using [5] is used to threshold out 'weak' keypoints. KAZE features based on non-linear anisotropic diffusion filtering [4].
  • 13. Proposed Methodology: Flow Fig 1. : Flow diagram for the proposed methodology
  • 14. Keypoint Selection and Ranking 1. Transformations: rotation (π/6, π/3, 2 ∗ π/3), scaling (0.5, 1.5, 2), cropping (20%, 50%), affine. Where SKP (i) : saliency score, Dist(KP(i)) : Distinctivity, Det(KP(i)) : Detectability, Rep(KP(i)) : Repeatability 2. The description of ith keypoint which gives the location (xi , yi) and response of the keypoint si . SKP (i) = Dist(KP(i)) + Det(KP(i)) + Rep(KP(i)) KP(i) = {(xi , yi), si}, i = 1...N
  • 15. Keypoint Selection and Ranking 3. Distinctiveness gives the summation of the Euclidean distances between every pair of keypoint descriptors in the same image.
  • 16. Keypoint Selection and Ranking 4. Repeatability gives Euclidean distance (ED) between the keypoint descriptor in the original image to the keypoint descriptor mapped in the corresponding transform, t. Here, nTransf is the number of transformations.
  • 17. Keypoint Selection and Ranking 5. Detectability gives the summation of the strengths of the keypoint in the original image and its respective transforms.
  • 18. Keypoint Selection and Ranking 6. We select the KAZE and SIFT keypoints which have saliency score greater than the respective mean saliency scores. where N is the total count of keypoint from respective detector and µsalscore is mean of the saliency scores.
  • 19. Texture Map based SIFT keypoints 1. SIFT keypoints are calculated on the original image. Then, the orientation histogram of the keypoints is constructed. The dominant orientations are found by binning the keypoint orientations into prespecified number of bins. The image is then convolved with Gabor filter using these dominant orientations. where u denotes the frequency of the sinusoidal function, θ gives the orientation of the function, σ is the standard deviation of the Gaussian function.
  • 20. Texture Map based SIFT keypoints 2. Next, the saliency map [5] is calculated for the original image. For each keypoint, if the saliency value is greater than the mean saliency then the keypoint is retained. where TextureKP denotes the set of keypoints which are salient for representing the texture. µsalmap denotes the mean of the saliency map.
  • 22. EXPERIMENTAL RESULTS AND DISCUSSIONS Datasets:  Caltech 101: to show the effectiveness of the algo. that the salient keypoints characterize and represent the objects.  VGG affine dataset: for object matching.
  • 24. Object Representation Fig. 2: Figure showing a) Object annotation b) Saliency Map c) Gabor filtered image (Texture Map) d) Ranked keypoints inside the object contour
  • 25. Object Representation Fig. 3: Texture and Ranked (SIFT and KAZE) keypoints
  • 27. Object Matching Fig. 4: Correctly matched keypoints by the proposed selection strategy: red (KAZE), yellow (SIFT), green (TextureKP) on the bikes dataset (VGG).
  • 28. Object Matching Fig. 5: Average ED vs top N% keypoints of the feature set
  • 29. CONCLUSION • Novel keypoint selection scheme based on SIFT and KAZE proposed. The technique incorporated texture information by finding SIFT keypoints on a texture map (using Gabor). • Technique can characterize an object region more efficiently than other contemporary detectors. • Less prone to false positives. • It will help in extending the existing object matching and classification algorithms. • Practical applications: object localization, segmentation and many other domains. • Holds promise to extend the existing state of the art in many application areas where objects are involved
  • 30. [1] W. Hartmann, M. Havlena, and K. Schindler, “Predicting matchability,” in Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on. IEEE, 2014, pp. 9–16. [2] S. Buoncompagni, D. Maio, D. Maltoni, and S. Papi, “Saliency-based keypoint selection for fast object detection and matching,” Pattern Recognition Letters, 2015. [3] B. Alexe, T. Deselaers, and V. Ferrari, “What is an object?” in Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 2010, pp. 73–80. [4] P. Perona and J. Malik, “Scale-space and edge detection using anisotropic diffusion,” Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 12, no. 7, pp. 629–639, 1990. [5] P. Mukherjee, B. Lall, and A. Shah, “Saliency map based improved segmentation,” in Image Processing (ICIP), 2015 IEEE International Conference on (Accepted). IEEE, 2015. [6] P. Alcantarilla, A. Bartoli and A. Davison, “Kaze Features,” In Proceedings of the 12th European conference on Computer Vision, vol. 6, pp. 214-227, 2012. [7] Srivastava, Siddharth, Prerana Mukherjee, and Brejesh Lall. "Characterizing objects with SIKA features for multiclass classification." Applied Soft Computing (2015). Bibliography
  • 33. Convolve with Gaussian Downsample Step 1: Construction of Scale Space Scale Invariant Feature Transform: Keypoint Detection
  • 34. Gaussian images grouped by octave. DoG images grouped by octave
  • 35. Choose consecutive DoG images 26 neighbours Optimization Tricks: 1. For non-maxima and non-minima all points need not to be compared 2. First and last images in the octave need not be compared Take pixel if it is local maxima/local minima than all of them. This is called a KEYPOINT. Extrema Detection (for each pixel)
  • 36. • (b) Reject keypoints with low contrast • (c ) Reject keypoints that are localized along an edge Step II: Keypoint Localization
  • 37. • Create gradient histogram for the keypoint neighbourhood ( 36 bins) • Neighborhood: a circular Gaussian falloff from the keypoint center (sigma=1.5 pixels at the current scale, so the effective neighborhood is about 9x9) Step III: Orientation Assignment
  • 38. Any peak within 80% of the highest peak is used to create a keypoint with that orientation Orientation Assignment (Contd…)
  • 39. Extracted keypoints, arrows indicating scale and orientation
  • 40. • Take 16x16 square window around detected keypoint • Decompose this into 4x4 tiles • Compute gradient orientation for each pixel (8 bins) • Create histogram over edge orientations weighted by magnitude Adapted from slide by David Lowe 0 2 angle histogram 4x4x8= 128D Scale Invariant Feature Transform: Keypoint Description
  • 46. equation for building non linear scale space using AOS KAZE: Keypoint Detection
  • 47. Comparison between gaussian blurring and nonlinear diffusion Non linear vs linear scale space
  • 49. Scharr edge filter The Scharr operator is the most common technique with two kernels used to estimate the two dimensional second derivatives horizontally and vertically. The operator for the two direction is given by the following formula: KAZE: Keypoint Detection