SlideShare a Scribd company logo
1 of 3
Download to read offline
Kernel Descriptors for Visual Recognition
by L.Bo, X.Ren and D.Fox
A Term Paper Report by Priyatham Bollimpalli (10010148)
Summary of the Paper
Popular Computer Vision algorithms like SIFT and HOG compute feature descriptor for an
image. A descriptor for an image is in simple terms, a concise representation of the image
properties which enables them to be used for many practical applications such as object
recognition, scene detection, image matching etc. Inspired from the orientation histogram
approach used in SIFT and HOG, this paper defines kernel orientation histogram and then
designs kernel descriptors for gradient, colour and local binary pattern (shape) using match
kernels. The definition of these kernels resulted in the reduction of granularity of low level
pixel features and made the idea of similaritybetween patches(high level features) come true.
To generate kernels in a computationally feasible manner, first match kernels are
approximated to finite dimension taking a set of finite basis vector from sampled normalized
gradient vectors. Then to reduce the redundancy and generate the compact features, Kernel
Principal Component Analysis is done. It is shown experimentally that the error which results
in these two stages is very less. Now gradient, colour and shape kernel descriptors are
computed more efficiently and in a simple, straight forward way over the images.
Experiment is done on four publicly available datasets: Scene-15, Caltech101, CIFAR10 and
CIFAR10-ImageNet. These datasets are for image classification and Laplacian kernel SVMs is
used in the experiments to classify. It is shown that the gradient kernel descriptor performs
best among the proposed kernel descriptors. All of them perform better than the SIFT
descriptor and other sophisticated feature learning methods.
The main novelty in the paper is that this is the first work done on kernels which is based on
low-level visual feature learning and that shows better performance than very famous
methods which are used as default choice for many applications. But some of the limitations
of this proposed scheme is the high computational time (even after optimizing) compared to
other methods and difficulty in learning pixel attributes from large image collection to
approximate the kernel. But since this area of research is new, alternative kernel functions or
using the existing one in combination of other kernel methods may get around this limitation,
further improving the performance or using in other areas where SIFT is used such as object
tracking, multi-view matching etc.
Details and Explanation of the paper
The gradient orientation at a pixel plays an important role in describing the features of the
image and this concept has been extensively used in many image descriptors. For example,
SIFT descriptor assigns the orientations to 8 bins as depicted below across 4 x 4 block.
Feature vector of each pixel z is defined as F(z) = m(z)๏ค(z) where m(z) is the magnitude of
the gradient and the ith
component of ๏ค(z) is 1 if the gradient falls in ith bin and 0 otherwise.
Soft bin formulation can also be used as ๏ค(z) = max(cos(๏ฆ(z ),ai )9
, 0) where ๏ฆ(z ) is the gradient
and ai is bin center. Over a patch P, histogram of gradients is obtained by
๐นโ„Ž = โˆ‘ ๐‘šฬƒ( ๐‘ง)ฮด(z)๐‘งโˆˆ๐‘ƒ where ๐‘šฬƒ( ๐‘ง) = ๐‘š( ๐‘ง)/โˆšโˆ‘ ๐‘š( ๐‘ง)2 + ๐œ–๐‘งโˆˆ๐‘ƒ (normalized magnitudes)
Intuitively, the similarity between two patches P and Q from different images is defined as
Since there are only inner product in the RHS, kernel functions can be defined between two
pixels and hence kernalized notion of similarity between two patches (as in HOG) is obtained.
But defining the kernel in this way introduces quantization errors and poor performance.
So to capture image variations properly, Gradient match kernel is defined as follows.
Here kp and ko are Gaussian kernels over position of pixel and orientations respectively. To
get more accuracy and for defining in uniform way, the values of pixel positions and
orientations are normalized.
The motivation for defining the gradient match kernel K as product of three kernels is as
follows. First we have to weigh the contribution of each pixel gradient magnitude and
normalized linear kernel is used for this. Then a measure of similarity of gradient orientations
should be included and the last Gaussian kernel kp measures how close two pixels are
spatially. By similar motivation, colour match kernel is defined (c(z) is the colour at z).
In shape kernel, s is the standard deviation of pixel values in the 3 x 3 neighborhood, b (z) is
binary column vector with the pixel value differences in a local window around z. Thus in
Shape Kernel descriptor, the contribution of each local binary pattern s(z) is weighed, and
shape similarity is obtained through local binary patterns b(z).
Features over image patches can be expressed as
Since Gaussian kernels are used, Fgrad(P) has infinite dimensions. Directly applying KPCA may
be computationally infeasible when the number of patches is very large. So first match kernels
are approximated directly by learning ๏ฌnite-dimensional features obtained by projecting
Fgrad(P ) into a set of basis vectors. An example to approximate Gaussian kernel over gradients
to d dimensions is shown below. Here xi are sampled normalized gradient vectors.
Note that the Kronekar product โจ‚ is used to compute the features which still results in large
number of dimensions. Now to achieve fewer compact features, KPCA is done. This makes
the computation time of evaluation practical. The tth
kernel principle component is written as
Finally the gradient kernel descriptor is expressed as shown below. It is shown that the error
incurred in approximating the match kernels in this way is very less.
The gradient (KDES-G), color (KDES-C), and shape (KDES-S) kernel descriptors are compared
to SIFT and several other state of the art object recognition algorithms using four publicly
available datasets of Scene-15, Caltech101, CIFAR10, and CIFAR10-ImageNet. Except in
CIFAR10, Laplacian kernel SVMs are used in the experiments. The summary of the result is
shown below. The combination of the three kernel descriptors is observed to boost the
performance by 2%. Thus we can see that the proposed kernel descriptor outperforms all the
other methods.
Scene-15 Caltech-101
KDES 86.7% KDES 76.4% CDBN[2]
65.5%
SIFT 82.2% SPM [1]
64.4% LCC[4]
73.4%
CIFAR10 KDES 76.0% LCC[4]
74.5%
mcRBM-DBN[3]
71.0% TCNN[5]
73.1%
[1]Lazebnik, Schmid, Ponce, CVPR '06 [2]Lee, Grosse, Ranganath, Ng, ICML '09 [3]Ranzato, Hinton, CVPR '10 [4]Yu,
Zhang, ICML '10 [5]Le, Ngiam, Chen, Chia, Koh, Ng, NIPS '10

More Related Content

What's hot

Accurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset PoolingAccurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset PoolingMLAI2
ย 
Aerial detection part2
Aerial detection part2Aerial detection part2
Aerial detection part2ssuser456ad6
ย 
Modification on Energy Efficient Design of DVB-T2 Constellation De-mapper
Modification on Energy Efficient Design of DVB-T2 Constellation De-mapperModification on Energy Efficient Design of DVB-T2 Constellation De-mapper
Modification on Energy Efficient Design of DVB-T2 Constellation De-mapperIJERA Editor
ย 
Aerial detection1
Aerial detection1Aerial detection1
Aerial detection1ssuser456ad6
ย 
Double Patterning
Double PatterningDouble Patterning
Double PatterningDanny Luk
ย 
Double Patterning (4/2 update)
Double Patterning (4/2 update)Double Patterning (4/2 update)
Double Patterning (4/2 update)Danny Luk
ย 
Double Patterning
Double PatterningDouble Patterning
Double PatterningDanny Luk
ย 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
ย 
Building and road detection from large aerial imagery
Building and road detection from large aerial imageryBuilding and road detection from large aerial imagery
Building and road detection from large aerial imageryShunta Saito
ย 
Comparison of Various RCNN techniques for Classification of Object from Image
Comparison of Various RCNN techniques for Classification of Object from ImageComparison of Various RCNN techniques for Classification of Object from Image
Comparison of Various RCNN techniques for Classification of Object from ImageIRJET Journal
ย 
Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...
Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...
Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...Ramoni Adeogun, PhD
ย 
Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...Universitat de Barcelona
ย 
Detection focal loss ๋”ฅ๋Ÿฌ๋‹ ๋…ผ๋ฌธ์ฝ๊ธฐ ๋ชจ์ž„ ๋ฐœํ‘œ์ž๋ฃŒ
Detection focal loss ๋”ฅ๋Ÿฌ๋‹ ๋…ผ๋ฌธ์ฝ๊ธฐ ๋ชจ์ž„ ๋ฐœํ‘œ์ž๋ฃŒDetection focal loss ๋”ฅ๋Ÿฌ๋‹ ๋…ผ๋ฌธ์ฝ๊ธฐ ๋ชจ์ž„ ๋ฐœํ‘œ์ž๋ฃŒ
Detection focal loss ๋”ฅ๋Ÿฌ๋‹ ๋…ผ๋ฌธ์ฝ๊ธฐ ๋ชจ์ž„ ๋ฐœํ‘œ์ž๋ฃŒtaeseon ryu
ย 
DNR - Auto deep lab paper review ppt
DNR - Auto deep lab paper review pptDNR - Auto deep lab paper review ppt
DNR - Auto deep lab paper review ppttaeseon ryu
ย 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis PresentationReuben Feinman
ย 
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Sunando Sengupta
ย 
Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...
Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...
Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...idescitation
ย 
Convolutional Patch Representations for Image Retrieval An unsupervised approach
Convolutional Patch Representations for Image Retrieval An unsupervised approachConvolutional Patch Representations for Image Retrieval An unsupervised approach
Convolutional Patch Representations for Image Retrieval An unsupervised approachUniversitat de Barcelona
ย 
Advanced Lighting Techniques Dan Baker (Meltdown 2005)
Advanced Lighting Techniques   Dan Baker (Meltdown 2005)Advanced Lighting Techniques   Dan Baker (Meltdown 2005)
Advanced Lighting Techniques Dan Baker (Meltdown 2005)mobius.cn
ย 

What's hot (19)

Accurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset PoolingAccurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset Pooling
ย 
Aerial detection part2
Aerial detection part2Aerial detection part2
Aerial detection part2
ย 
Modification on Energy Efficient Design of DVB-T2 Constellation De-mapper
Modification on Energy Efficient Design of DVB-T2 Constellation De-mapperModification on Energy Efficient Design of DVB-T2 Constellation De-mapper
Modification on Energy Efficient Design of DVB-T2 Constellation De-mapper
ย 
Aerial detection1
Aerial detection1Aerial detection1
Aerial detection1
ย 
Double Patterning
Double PatterningDouble Patterning
Double Patterning
ย 
Double Patterning (4/2 update)
Double Patterning (4/2 update)Double Patterning (4/2 update)
Double Patterning (4/2 update)
ย 
Double Patterning
Double PatterningDouble Patterning
Double Patterning
ย 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
ย 
Building and road detection from large aerial imagery
Building and road detection from large aerial imageryBuilding and road detection from large aerial imagery
Building and road detection from large aerial imagery
ย 
Comparison of Various RCNN techniques for Classification of Object from Image
Comparison of Various RCNN techniques for Classification of Object from ImageComparison of Various RCNN techniques for Classification of Object from Image
Comparison of Various RCNN techniques for Classification of Object from Image
ย 
Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...
Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...
Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...
ย 
Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...
ย 
Detection focal loss ๋”ฅ๋Ÿฌ๋‹ ๋…ผ๋ฌธ์ฝ๊ธฐ ๋ชจ์ž„ ๋ฐœํ‘œ์ž๋ฃŒ
Detection focal loss ๋”ฅ๋Ÿฌ๋‹ ๋…ผ๋ฌธ์ฝ๊ธฐ ๋ชจ์ž„ ๋ฐœํ‘œ์ž๋ฃŒDetection focal loss ๋”ฅ๋Ÿฌ๋‹ ๋…ผ๋ฌธ์ฝ๊ธฐ ๋ชจ์ž„ ๋ฐœํ‘œ์ž๋ฃŒ
Detection focal loss ๋”ฅ๋Ÿฌ๋‹ ๋…ผ๋ฌธ์ฝ๊ธฐ ๋ชจ์ž„ ๋ฐœํ‘œ์ž๋ฃŒ
ย 
DNR - Auto deep lab paper review ppt
DNR - Auto deep lab paper review pptDNR - Auto deep lab paper review ppt
DNR - Auto deep lab paper review ppt
ย 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis Presentation
ย 
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
ย 
Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...
Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...
Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...
ย 
Convolutional Patch Representations for Image Retrieval An unsupervised approach
Convolutional Patch Representations for Image Retrieval An unsupervised approachConvolutional Patch Representations for Image Retrieval An unsupervised approach
Convolutional Patch Representations for Image Retrieval An unsupervised approach
ย 
Advanced Lighting Techniques Dan Baker (Meltdown 2005)
Advanced Lighting Techniques   Dan Baker (Meltdown 2005)Advanced Lighting Techniques   Dan Baker (Meltdown 2005)
Advanced Lighting Techniques Dan Baker (Meltdown 2005)
ย 

Similar to Kernel Descriptors for Visual Recognition

APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLINGAPPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLINGsipij
ย 
A survey on feature descriptors for texture image classification
A survey on feature descriptors for texture image classificationA survey on feature descriptors for texture image classification
A survey on feature descriptors for texture image classificationIRJET Journal
ย 
Oc2423022305
Oc2423022305Oc2423022305
Oc2423022305IJERA Editor
ย 
Currency recognition on mobile phones
Currency recognition on mobile phonesCurrency recognition on mobile phones
Currency recognition on mobile phoneshabeebsab
ย 
Deferred Pixel Shading on the PLAYSTATIONยฎ3
Deferred Pixel Shading on the PLAYSTATIONยฎ3Deferred Pixel Shading on the PLAYSTATIONยฎ3
Deferred Pixel Shading on the PLAYSTATIONยฎ3Slide_N
ย 
Low complexity features for jpeg steganalysis using undecimated dct
Low complexity features for jpeg steganalysis using undecimated dctLow complexity features for jpeg steganalysis using undecimated dct
Low complexity features for jpeg steganalysis using undecimated dctPvrtechnologies Nellore
ย 
Speeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object RecognitionSpeeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object RecognitionCSCJournals
ย 
Probabilistic model based image segmentation
Probabilistic model based image segmentationProbabilistic model based image segmentation
Probabilistic model based image segmentationijma
ย 
Texture descriptor based on local combination adaptive ternary pattern
Texture descriptor based on local combination adaptive ternary patternTexture descriptor based on local combination adaptive ternary pattern
Texture descriptor based on local combination adaptive ternary patternProjectsatbangalore
ย 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptorswolf
ย 
Object Shape Representation by Kernel Density Feature Points Estimator
Object Shape Representation by Kernel Density Feature Points Estimator Object Shape Representation by Kernel Density Feature Points Estimator
Object Shape Representation by Kernel Density Feature Points Estimator cscpconf
ย 
Conception_et_realisation_dun_site_Web_d.pdf
Conception_et_realisation_dun_site_Web_d.pdfConception_et_realisation_dun_site_Web_d.pdf
Conception_et_realisation_dun_site_Web_d.pdfSofianeHassine2
ย 
11.graph cut based local binary patterns for content based image retrieval
11.graph cut based local binary patterns for content based image retrieval11.graph cut based local binary patterns for content based image retrieval
11.graph cut based local binary patterns for content based image retrievalAlexander Decker
ย 
3.[18 30]graph cut based local binary patterns for content based image retrieval
3.[18 30]graph cut based local binary patterns for content based image retrieval3.[18 30]graph cut based local binary patterns for content based image retrieval
3.[18 30]graph cut based local binary patterns for content based image retrievalAlexander Decker
ย 
11.framework of smart mobile rfid networks
11.framework of smart mobile rfid networks11.framework of smart mobile rfid networks
11.framework of smart mobile rfid networksAlexander Decker
ย 
3.[13 21]framework of smart mobile rfid networks
3.[13 21]framework of smart mobile rfid networks3.[13 21]framework of smart mobile rfid networks
3.[13 21]framework of smart mobile rfid networksAlexander Decker
ย 
Empirical Coding for Curvature Based Linear Representation in Image Retrieval...
Empirical Coding for Curvature Based Linear Representation in Image Retrieval...Empirical Coding for Curvature Based Linear Representation in Image Retrieval...
Empirical Coding for Curvature Based Linear Representation in Image Retrieval...iosrjce
ย 
Image Enhancement Using Filter To Adjust Dynamic Range of Pixels
Image Enhancement Using Filter To Adjust Dynamic Range of PixelsImage Enhancement Using Filter To Adjust Dynamic Range of Pixels
Image Enhancement Using Filter To Adjust Dynamic Range of PixelsIJERA Editor
ย 

Similar to Kernel Descriptors for Visual Recognition (20)

APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLINGAPPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
ย 
A survey on feature descriptors for texture image classification
A survey on feature descriptors for texture image classificationA survey on feature descriptors for texture image classification
A survey on feature descriptors for texture image classification
ย 
Oc2423022305
Oc2423022305Oc2423022305
Oc2423022305
ย 
Currency recognition on mobile phones
Currency recognition on mobile phonesCurrency recognition on mobile phones
Currency recognition on mobile phones
ย 
Deferred Pixel Shading on the PLAYSTATIONยฎ3
Deferred Pixel Shading on the PLAYSTATIONยฎ3Deferred Pixel Shading on the PLAYSTATIONยฎ3
Deferred Pixel Shading on the PLAYSTATIONยฎ3
ย 
Low complexity features for jpeg steganalysis using undecimated dct
Low complexity features for jpeg steganalysis using undecimated dctLow complexity features for jpeg steganalysis using undecimated dct
Low complexity features for jpeg steganalysis using undecimated dct
ย 
Speeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object RecognitionSpeeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object Recognition
ย 
Probabilistic model based image segmentation
Probabilistic model based image segmentationProbabilistic model based image segmentation
Probabilistic model based image segmentation
ย 
Texture descriptor based on local combination adaptive ternary pattern
Texture descriptor based on local combination adaptive ternary patternTexture descriptor based on local combination adaptive ternary pattern
Texture descriptor based on local combination adaptive ternary pattern
ย 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
ย 
Object Shape Representation by Kernel Density Feature Points Estimator
Object Shape Representation by Kernel Density Feature Points Estimator Object Shape Representation by Kernel Density Feature Points Estimator
Object Shape Representation by Kernel Density Feature Points Estimator
ย 
Conception_et_realisation_dun_site_Web_d.pdf
Conception_et_realisation_dun_site_Web_d.pdfConception_et_realisation_dun_site_Web_d.pdf
Conception_et_realisation_dun_site_Web_d.pdf
ย 
11.graph cut based local binary patterns for content based image retrieval
11.graph cut based local binary patterns for content based image retrieval11.graph cut based local binary patterns for content based image retrieval
11.graph cut based local binary patterns for content based image retrieval
ย 
3.[18 30]graph cut based local binary patterns for content based image retrieval
3.[18 30]graph cut based local binary patterns for content based image retrieval3.[18 30]graph cut based local binary patterns for content based image retrieval
3.[18 30]graph cut based local binary patterns for content based image retrieval
ย 
11.framework of smart mobile rfid networks
11.framework of smart mobile rfid networks11.framework of smart mobile rfid networks
11.framework of smart mobile rfid networks
ย 
3.[13 21]framework of smart mobile rfid networks
3.[13 21]framework of smart mobile rfid networks3.[13 21]framework of smart mobile rfid networks
3.[13 21]framework of smart mobile rfid networks
ย 
IJET-V2I6P17
IJET-V2I6P17IJET-V2I6P17
IJET-V2I6P17
ย 
B017360516
B017360516B017360516
B017360516
ย 
Empirical Coding for Curvature Based Linear Representation in Image Retrieval...
Empirical Coding for Curvature Based Linear Representation in Image Retrieval...Empirical Coding for Curvature Based Linear Representation in Image Retrieval...
Empirical Coding for Curvature Based Linear Representation in Image Retrieval...
ย 
Image Enhancement Using Filter To Adjust Dynamic Range of Pixels
Image Enhancement Using Filter To Adjust Dynamic Range of PixelsImage Enhancement Using Filter To Adjust Dynamic Range of Pixels
Image Enhancement Using Filter To Adjust Dynamic Range of Pixels
ย 

More from Priyatham Bollimpalli

Meta Machine Learning: Hyperparameter Optimization
Meta Machine Learning: Hyperparameter OptimizationMeta Machine Learning: Hyperparameter Optimization
Meta Machine Learning: Hyperparameter OptimizationPriyatham Bollimpalli
ย 
Science and Ethics: The Manhattan Project during World War II
Science and Ethics: The Manhattan Project during World War IIScience and Ethics: The Manhattan Project during World War II
Science and Ethics: The Manhattan Project during World War IIPriyatham Bollimpalli
ย 
IIT JEE Seat Allocation System
IIT JEE Seat Allocation System IIT JEE Seat Allocation System
IIT JEE Seat Allocation System Priyatham Bollimpalli
ย 
Design and Fabrication of 4-bit processor
Design and Fabrication of  4-bit processorDesign and Fabrication of  4-bit processor
Design and Fabrication of 4-bit processorPriyatham Bollimpalli
ย 
Interface for Finding Close Matches from Translation Memory
Interface for Finding Close Matches from Translation MemoryInterface for Finding Close Matches from Translation Memory
Interface for Finding Close Matches from Translation MemoryPriyatham Bollimpalli
ย 
GCC RTL and Machine Description
GCC RTL and Machine DescriptionGCC RTL and Machine Description
GCC RTL and Machine DescriptionPriyatham Bollimpalli
ย 
The problem of Spatio-Temporal Invariant Points in Videos
The problem of Spatio-Temporal Invariant Points in VideosThe problem of Spatio-Temporal Invariant Points in Videos
The problem of Spatio-Temporal Invariant Points in VideosPriyatham Bollimpalli
ย 
Literature Survey on Interest Points based Watermarking
Literature Survey on Interest Points based WatermarkingLiterature Survey on Interest Points based Watermarking
Literature Survey on Interest Points based WatermarkingPriyatham Bollimpalli
ย 

More from Priyatham Bollimpalli (10)

Meta Machine Learning: Hyperparameter Optimization
Meta Machine Learning: Hyperparameter OptimizationMeta Machine Learning: Hyperparameter Optimization
Meta Machine Learning: Hyperparameter Optimization
ย 
Science and Ethics: The Manhattan Project during World War II
Science and Ethics: The Manhattan Project during World War IIScience and Ethics: The Manhattan Project during World War II
Science and Ethics: The Manhattan Project during World War II
ย 
Auction Portal
Auction PortalAuction Portal
Auction Portal
ย 
IIT JEE Seat Allocation System
IIT JEE Seat Allocation System IIT JEE Seat Allocation System
IIT JEE Seat Allocation System
ย 
Design and Fabrication of 4-bit processor
Design and Fabrication of  4-bit processorDesign and Fabrication of  4-bit processor
Design and Fabrication of 4-bit processor
ย 
Library Management System
Library  Management  SystemLibrary  Management  System
Library Management System
ย 
Interface for Finding Close Matches from Translation Memory
Interface for Finding Close Matches from Translation MemoryInterface for Finding Close Matches from Translation Memory
Interface for Finding Close Matches from Translation Memory
ย 
GCC RTL and Machine Description
GCC RTL and Machine DescriptionGCC RTL and Machine Description
GCC RTL and Machine Description
ย 
The problem of Spatio-Temporal Invariant Points in Videos
The problem of Spatio-Temporal Invariant Points in VideosThe problem of Spatio-Temporal Invariant Points in Videos
The problem of Spatio-Temporal Invariant Points in Videos
ย 
Literature Survey on Interest Points based Watermarking
Literature Survey on Interest Points based WatermarkingLiterature Survey on Interest Points based Watermarking
Literature Survey on Interest Points based Watermarking
ย 

Recently uploaded

UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
ย 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
ย 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
ย 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
ย 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
ย 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Christo Ananth
ย 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
ย 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
ย 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7Call Girls in Nagpur High Profile Call Girls
ย 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
ย 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
ย 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
ย 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
ย 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spaintimesproduction05
ย 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756dollysharma2066
ย 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
ย 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
ย 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
ย 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
ย 

Recently uploaded (20)

UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
ย 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
ย 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
ย 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
ย 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
ย 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
ย 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
ย 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
ย 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
ย 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
ย 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
ย 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
ย 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
ย 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
ย 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spain
ย 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
ย 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
ย 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
ย 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
ย 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
ย 

Kernel Descriptors for Visual Recognition

  • 1. Kernel Descriptors for Visual Recognition by L.Bo, X.Ren and D.Fox A Term Paper Report by Priyatham Bollimpalli (10010148) Summary of the Paper Popular Computer Vision algorithms like SIFT and HOG compute feature descriptor for an image. A descriptor for an image is in simple terms, a concise representation of the image properties which enables them to be used for many practical applications such as object recognition, scene detection, image matching etc. Inspired from the orientation histogram approach used in SIFT and HOG, this paper defines kernel orientation histogram and then designs kernel descriptors for gradient, colour and local binary pattern (shape) using match kernels. The definition of these kernels resulted in the reduction of granularity of low level pixel features and made the idea of similaritybetween patches(high level features) come true. To generate kernels in a computationally feasible manner, first match kernels are approximated to finite dimension taking a set of finite basis vector from sampled normalized gradient vectors. Then to reduce the redundancy and generate the compact features, Kernel Principal Component Analysis is done. It is shown experimentally that the error which results in these two stages is very less. Now gradient, colour and shape kernel descriptors are computed more efficiently and in a simple, straight forward way over the images. Experiment is done on four publicly available datasets: Scene-15, Caltech101, CIFAR10 and CIFAR10-ImageNet. These datasets are for image classification and Laplacian kernel SVMs is used in the experiments to classify. It is shown that the gradient kernel descriptor performs best among the proposed kernel descriptors. All of them perform better than the SIFT descriptor and other sophisticated feature learning methods. The main novelty in the paper is that this is the first work done on kernels which is based on low-level visual feature learning and that shows better performance than very famous methods which are used as default choice for many applications. But some of the limitations of this proposed scheme is the high computational time (even after optimizing) compared to other methods and difficulty in learning pixel attributes from large image collection to approximate the kernel. But since this area of research is new, alternative kernel functions or using the existing one in combination of other kernel methods may get around this limitation, further improving the performance or using in other areas where SIFT is used such as object tracking, multi-view matching etc.
  • 2. Details and Explanation of the paper The gradient orientation at a pixel plays an important role in describing the features of the image and this concept has been extensively used in many image descriptors. For example, SIFT descriptor assigns the orientations to 8 bins as depicted below across 4 x 4 block. Feature vector of each pixel z is defined as F(z) = m(z)๏ค(z) where m(z) is the magnitude of the gradient and the ith component of ๏ค(z) is 1 if the gradient falls in ith bin and 0 otherwise. Soft bin formulation can also be used as ๏ค(z) = max(cos(๏ฆ(z ),ai )9 , 0) where ๏ฆ(z ) is the gradient and ai is bin center. Over a patch P, histogram of gradients is obtained by ๐นโ„Ž = โˆ‘ ๐‘šฬƒ( ๐‘ง)ฮด(z)๐‘งโˆˆ๐‘ƒ where ๐‘šฬƒ( ๐‘ง) = ๐‘š( ๐‘ง)/โˆšโˆ‘ ๐‘š( ๐‘ง)2 + ๐œ–๐‘งโˆˆ๐‘ƒ (normalized magnitudes) Intuitively, the similarity between two patches P and Q from different images is defined as Since there are only inner product in the RHS, kernel functions can be defined between two pixels and hence kernalized notion of similarity between two patches (as in HOG) is obtained. But defining the kernel in this way introduces quantization errors and poor performance. So to capture image variations properly, Gradient match kernel is defined as follows. Here kp and ko are Gaussian kernels over position of pixel and orientations respectively. To get more accuracy and for defining in uniform way, the values of pixel positions and orientations are normalized. The motivation for defining the gradient match kernel K as product of three kernels is as follows. First we have to weigh the contribution of each pixel gradient magnitude and normalized linear kernel is used for this. Then a measure of similarity of gradient orientations should be included and the last Gaussian kernel kp measures how close two pixels are spatially. By similar motivation, colour match kernel is defined (c(z) is the colour at z). In shape kernel, s is the standard deviation of pixel values in the 3 x 3 neighborhood, b (z) is binary column vector with the pixel value differences in a local window around z. Thus in Shape Kernel descriptor, the contribution of each local binary pattern s(z) is weighed, and shape similarity is obtained through local binary patterns b(z).
  • 3. Features over image patches can be expressed as Since Gaussian kernels are used, Fgrad(P) has infinite dimensions. Directly applying KPCA may be computationally infeasible when the number of patches is very large. So first match kernels are approximated directly by learning ๏ฌnite-dimensional features obtained by projecting Fgrad(P ) into a set of basis vectors. An example to approximate Gaussian kernel over gradients to d dimensions is shown below. Here xi are sampled normalized gradient vectors. Note that the Kronekar product โจ‚ is used to compute the features which still results in large number of dimensions. Now to achieve fewer compact features, KPCA is done. This makes the computation time of evaluation practical. The tth kernel principle component is written as Finally the gradient kernel descriptor is expressed as shown below. It is shown that the error incurred in approximating the match kernels in this way is very less. The gradient (KDES-G), color (KDES-C), and shape (KDES-S) kernel descriptors are compared to SIFT and several other state of the art object recognition algorithms using four publicly available datasets of Scene-15, Caltech101, CIFAR10, and CIFAR10-ImageNet. Except in CIFAR10, Laplacian kernel SVMs are used in the experiments. The summary of the result is shown below. The combination of the three kernel descriptors is observed to boost the performance by 2%. Thus we can see that the proposed kernel descriptor outperforms all the other methods. Scene-15 Caltech-101 KDES 86.7% KDES 76.4% CDBN[2] 65.5% SIFT 82.2% SPM [1] 64.4% LCC[4] 73.4% CIFAR10 KDES 76.0% LCC[4] 74.5% mcRBM-DBN[3] 71.0% TCNN[5] 73.1% [1]Lazebnik, Schmid, Ponce, CVPR '06 [2]Lee, Grosse, Ranganath, Ng, ICML '09 [3]Ranzato, Hinton, CVPR '10 [4]Yu, Zhang, ICML '10 [5]Le, Ngiam, Chen, Chia, Koh, Ng, NIPS '10