Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition

•

1 like•253 views

In Sparsity, Dictionaries and Projections in Machine Learning and Signal Processing, ICML Workshop, Edinburgh, Scotland, 2012.

Technology

Towards Learning Semantically Relevant Dictionary for Visual Category
Recognition
Ashish Gupta, Richard Bowden
Centre for Vision, Speech, and Signal Processing, University of Surrey, Guildford, United Kingdom
Objective
Transform feature space rendered by the local patch
afﬁne invariant feature descriptor to a semantically
relevant space for visual categorisation.
Challenge
Large intra-category visual appearance variation.
Training data: insufﬁcient, noisy, background clutter.
Feature descriptor is high-dimensional, sparsely
populated, and renders highly inter-mixed vectors in
feature space.
Topic ← Words
Feature space is
assumed to have
local semantic
integrity.
Intra-category
appearance variance
ameliorated.
Grouping Scattered Clusters
Analyse Image-Word
co-occurrence
statistics.
Similar occurrence
⇒ semantic
equivalence.
Use co-clustering to
discover word
groups.
Group such words
into topics.
Multiple Sub-Manifolds
visual category ← object part
visual σ2
(object part) is small . d(part1, part2) is large.
Disambiguation by projection to Sub-Manifolds
Separating
inter-mixed
descriptors.
Dual objective
of inter-vector
distance and
sub-manifold
embedding
overcomes
limitation of
hard
partitioning.
Inﬂuence of Co-clustering
Co-clustering aids grouping of semantically
equivalent descriptors (similar co-occurrence
statistics or similar sub-manifold embedding) by
projecting from a higher dimensional space (words) to
lower dimensional space (topics). This effectively
reduces separation between equivalent descriptors,
veriﬁed using a K-NN classiﬁer.
Experiment: Grouping Scattered Clusters
Comparative classiﬁcation performance (F1 score) of
standard clustered dictionary (BoW) vs. grouping
scattered clusters dictionary for all categories of VOC
2010 data set; dictionary size is 1000.
Grouping clusters: different co-clustering methods
Comparison of Information-theoretic (i) and
sum-squared Residue (r) co-clustering methods.
Grouping clusters: inﬂuence of dictionary size
Topics (100,500,1000,5000) ← Words (10,000)
Comparative F1 score, averaged for all categories, for
various datasets.
Experiment: Multiple Sub-Manifold
Comparative classiﬁcation performance (F1 score) of
standard clustered dictionary (BoW) vs.
multi-manifold dictionary (SSRBC) for all categories of
VOC 2010 data set; dictionary size is 100.
Multi-Manifolds: different co-clustering methods
Comparison of Information-theoretic (i) and
sum-squared Residue (r) co-clustering methods.
Towards Semantically Relevant Space
Group semantically similar small clusters.
Multi-manifolds dictionary.
Prune non-discriminative space.
Combine these paradigms.
Summary
The improvement in classiﬁcation performance
supports the hypotheses that semantic relevance of
feature space can be improved by grouping scattered
tiny clusters based on image-word co-occurrence and
learning a dictionary on multiple sub-manifolds, which
disambiguates descriptors by projecting them to
different sub-manifolds. Future work implements
pruning non-discriminative space and combine these
paradigms to render a semantically relevant space.
Acknowledgement
Supported by the EU project Dicta-Sign (FP7/2007-2013) under
Grant No. 231135 and PASCAL 2.
Center for Vision, Speech, and Signal Processing - University of Surrey - Guildford, United Kingdom Mail: a.gupta@surrey.ac.uk WWW: http://www.ee.surrey.ac.uk/cvssp

Viewers also liked

John Nash Pptapompouspanda

Game theorygtush24

Game theoryAbu Bashar

Matlab: Speech Signal AnalysisDataminingTools Inc

Speech Signal ProcessingMurtadha Alsabbagh

Sound analysis and processing with MATLABTan Hoang Luu

Chapter 7 retail managmentsonny recato

Equilibrium in Nash’s mind (with references)Vasil Penchev

Speech signal processing lizyLizy Abraham

Game theoryPT Education, Indore

Nash equilibriumprateek_floyd

Game theoryamaroks

Game TheoryMadhuri Gupta

Game theory and its applicationsEranga Weerasekara

Game theoryPankaj Sabherwal

Introduction to Digital Signal Processingop205

Game theoryDe La Salle University-Manila

Speech recognitionCharu Joshi

An introduction to Game TheoryPaul Trafford

Game Theory PresentationMehdi Ghotbi

Viewers also liked (20)

John Nash Ppt

Game theory

Matlab: Speech Signal Analysis

Speech Signal Processing

Sound analysis and processing with MATLAB

Chapter 7 retail managment

Equilibrium in Nash’s mind (with references)

Speech signal processing lizy

Game theory

Nash equilibrium

Game theory

Game Theory

Game theory and its applications

Game theory

Introduction to Digital Signal Processing

Game theory

Speech recognition

An introduction to Game Theory

Game Theory Presentation

Similar to Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition

Deep Neural Methods for RetrievalBhaskar Mitra

Continuous bag of words cbow word2vec word embedding work .pdfdevangmittal4

Text Mining for LexicographyLeiden University

Doc format.butest

Neural Models for Information RetrievalBhaskar Mitra

Question answer templateThanuw Chaks

Metrics for Evaluating Quality of Embeddings for Ontological Concepts Saeedeh Shekarpour

Challenges in transfer learning in nlpLaraOlmosCamarena

Neural Models for Information RetrievalBhaskar Mitra

AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATIONijnlc

Visual Category Recognition using Information-Theoretic Co-ClusteringAshish Gupta

Effect of word embedding vector dimensionality on sentiment analysis through ...IAESIJAI

Schema-agnositc queries over large-schema databases: a distributional semanti...Andre Freitas

THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig

An introduction to compositional models in distributional semanticsAndre Freitas

5 Lessons Learned from Designing Neural Models for Information RetrievalBhaskar Mitra

Class14Dr. Cupid Lucid

Improving Text Categorization with Semantic Knowledge in Wikipediachjshan

Current Approaches in Search Result DiversificationMario Sangiorgio

Similar to Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition (20)

Deep Neural Methods for Retrieval

Continuous bag of words cbow word2vec word embedding work .pdf

Text Mining for Lexicography

Doc format.

Neural Models for Information Retrieval

Question answer template

Metrics for Evaluating Quality of Embeddings for Ontological Concepts

Challenges in transfer learning in nlp

Neural Models for Information Retrieval

AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION

Visual Category Recognition using Information-Theoretic Co-Clustering

Effect of word embedding vector dimensionality on sentiment analysis through ...

Schema-agnositc queries over large-schema databases: a distributional semanti...

THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES

An introduction to compositional models in distributional semantics

5 Lessons Learned from Designing Neural Models for Information Retrieval

Class14

Improving Text Categorization with Semantic Knowledge in Wikipedia

Current Approaches in Search Result Diversification

Recently uploaded

How to convert PDF to text with Nanonetsnaman860154

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

Understanding the Laravel MVC ArchitecturePixlogix Infotech

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

How to Remove Document Management Hurdles with X-Docs?XfilesPro

Pigging Solutions in Pet Food ManufacturingPigging Solutions

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Recently uploaded (20)

How to convert PDF to text with Nanonets

Unblocking The Main Thread Solving ANRs and Frozen Frames

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

GenCyber Cyber Security Day Presentation

Benefits Of Flutter Compared To Other Frameworks

Understanding the Laravel MVC Architecture

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Breaking the Kubernetes Kill Chain: Host Path Mount

Presentation on how to chat with PDF using ChatGPT code interpreter

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

How to Remove Document Management Hurdles with X-Docs?

Pigging Solutions in Pet Food Manufacturing

[2024]Digital Global Overview Report 2024 Meltwater.pdf

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition

1. Towards Learning Semantically Relevant Dictionary for Visual Category Recognition Ashish Gupta, Richard Bowden Centre for Vision, Speech, and Signal Processing, University of Surrey, Guildford, United Kingdom Objective Transform feature space rendered by the local patch affine invariant feature descriptor to a semantically relevant space for visual categorisation. Challenge Large intra-category visual appearance variation. Training data: insufficient, noisy, background clutter. Feature descriptor is high-dimensional, sparsely populated, and renders highly inter-mixed vectors in feature space. Topic ← Words Feature space is assumed to have local semantic integrity. Intra-category appearance variance ameliorated. Grouping Scattered Clusters Analyse Image-Word co-occurrence statistics. Similar occurrence ⇒ semantic equivalence. Use co-clustering to discover word groups. Group such words into topics. Multiple Sub-Manifolds visual category ← object part visual σ2 (object part) is small . d(part1, part2) is large. Disambiguation by projection to Sub-Manifolds Separating inter-mixed descriptors. Dual objective of inter-vector distance and sub-manifold embedding overcomes limitation of hard partitioning. Influence of Co-clustering Co-clustering aids grouping of semantically equivalent descriptors (similar co-occurrence statistics or similar sub-manifold embedding) by projecting from a higher dimensional space (words) to lower dimensional space (topics). This effectively reduces separation between equivalent descriptors, verified using a K-NN classifier. Experiment: Grouping Scattered Clusters Comparative classification performance (F1 score) of standard clustered dictionary (BoW) vs. grouping scattered clusters dictionary for all categories of VOC 2010 data set; dictionary size is 1000. Grouping clusters: different co-clustering methods Comparison of Information-theoretic (i) and sum-squared Residue (r) co-clustering methods. Grouping clusters: influence of dictionary size Topics (100,500,1000,5000) ← Words (10,000) Comparative F1 score, averaged for all categories, for various datasets. Experiment: Multiple Sub-Manifold Comparative classification performance (F1 score) of standard clustered dictionary (BoW) vs. multi-manifold dictionary (SSRBC) for all categories of VOC 2010 data set; dictionary size is 100. Multi-Manifolds: different co-clustering methods Comparison of Information-theoretic (i) and sum-squared Residue (r) co-clustering methods. Towards Semantically Relevant Space Group semantically similar small clusters. Multi-manifolds dictionary. Prune non-discriminative space. Combine these paradigms. Summary The improvement in classification performance supports the hypotheses that semantic relevance of feature space can be improved by grouping scattered tiny clusters based on image-word co-occurrence and learning a dictionary on multiple sub-manifolds, which disambiguates descriptors by projecting them to different sub-manifolds. Future work implements pruning non-discriminative space and combine these paradigms to render a semantically relevant space. Acknowledgement Supported by the EU project Dicta-Sign (FP7/2007-2013) under Grant No. 231135 and PASCAL 2. Center for Vision, Speech, and Signal Processing - University of Surrey - Guildford, United Kingdom Mail: a.gupta@surrey.ac.uk WWW: http://www.ee.surrey.ac.uk/cvssp

Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (20)

Similar to Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition

Similar to Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition (20)

Recently uploaded

Recently uploaded (20)

Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition