SlideShare a Scribd company logo
1 of 1
Download to read offline
Towards Learning Semantically Relevant Dictionary for Visual Category
Recognition
Ashish Gupta, Richard Bowden
Centre for Vision, Speech, and Signal Processing, University of Surrey, Guildford, United Kingdom
Objective
Transform feature space rendered by the local patch
affine invariant feature descriptor to a semantically
relevant space for visual categorisation.
Challenge
Large intra-category visual appearance variation.
Training data: insufficient, noisy, background clutter.
Feature descriptor is high-dimensional, sparsely
populated, and renders highly inter-mixed vectors in
feature space.
Topic ← Words
Feature space is
assumed to have
local semantic
integrity.
Intra-category
appearance variance
ameliorated.
Grouping Scattered Clusters
Analyse Image-Word
co-occurrence
statistics.
Similar occurrence
⇒ semantic
equivalence.
Use co-clustering to
discover word
groups.
Group such words
into topics.
Multiple Sub-Manifolds
visual category ← object part
visual σ2
(object part) is small . d(part1, part2) is large.
Disambiguation by projection to Sub-Manifolds
Separating
inter-mixed
descriptors.
Dual objective
of inter-vector
distance and
sub-manifold
embedding
overcomes
limitation of
hard
partitioning.
Influence of Co-clustering
Co-clustering aids grouping of semantically
equivalent descriptors (similar co-occurrence
statistics or similar sub-manifold embedding) by
projecting from a higher dimensional space (words) to
lower dimensional space (topics). This effectively
reduces separation between equivalent descriptors,
verified using a K-NN classifier.
Experiment: Grouping Scattered Clusters
Comparative classification performance (F1 score) of
standard clustered dictionary (BoW) vs. grouping
scattered clusters dictionary for all categories of VOC
2010 data set; dictionary size is 1000.
Grouping clusters: different co-clustering methods
Comparison of Information-theoretic (i) and
sum-squared Residue (r) co-clustering methods.
Grouping clusters: influence of dictionary size
Topics (100,500,1000,5000) ← Words (10,000)
Comparative F1 score, averaged for all categories, for
various datasets.
Experiment: Multiple Sub-Manifold
Comparative classification performance (F1 score) of
standard clustered dictionary (BoW) vs.
multi-manifold dictionary (SSRBC) for all categories of
VOC 2010 data set; dictionary size is 100.
Multi-Manifolds: different co-clustering methods
Comparison of Information-theoretic (i) and
sum-squared Residue (r) co-clustering methods.
Towards Semantically Relevant Space
Group semantically similar small clusters.
Multi-manifolds dictionary.
Prune non-discriminative space.
Combine these paradigms.
Summary
The improvement in classification performance
supports the hypotheses that semantic relevance of
feature space can be improved by grouping scattered
tiny clusters based on image-word co-occurrence and
learning a dictionary on multiple sub-manifolds, which
disambiguates descriptors by projecting them to
different sub-manifolds. Future work implements
pruning non-discriminative space and combine these
paradigms to render a semantically relevant space.
Acknowledgement
Supported by the EU project Dicta-Sign (FP7/2007-2013) under
Grant No. 231135 and PASCAL 2.
Center for Vision, Speech, and Signal Processing - University of Surrey - Guildford, United Kingdom Mail: a.gupta@surrey.ac.uk WWW: http://www.ee.surrey.ac.uk/cvssp

More Related Content

Viewers also liked

Game theory
Game theoryGame theory
Game theorygtush24
 
Sound analysis and processing with MATLAB
Sound analysis and processing with MATLABSound analysis and processing with MATLAB
Sound analysis and processing with MATLABTan Hoang Luu
 
Chapter 7 retail managment
Chapter   7 retail managmentChapter   7 retail managment
Chapter 7 retail managmentsonny recato
 
Equilibrium in Nash’s mind (with references)
Equilibrium in Nash’s mind (with references)Equilibrium in Nash’s mind (with references)
Equilibrium in Nash’s mind (with references)Vasil Penchev
 
Speech signal processing lizy
Speech signal processing lizySpeech signal processing lizy
Speech signal processing lizyLizy Abraham
 
Game theory
Game theoryGame theory
Game theoryamaroks
 
Game theory and its applications
Game theory and its applicationsGame theory and its applications
Game theory and its applicationsEranga Weerasekara
 
Introduction to Digital Signal Processing
Introduction to Digital Signal ProcessingIntroduction to Digital Signal Processing
Introduction to Digital Signal Processingop205
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 
An introduction to Game Theory
An introduction to Game TheoryAn introduction to Game Theory
An introduction to Game TheoryPaul Trafford
 
Game Theory Presentation
Game Theory PresentationGame Theory Presentation
Game Theory PresentationMehdi Ghotbi
 

Viewers also liked (20)

John Nash Ppt
John Nash PptJohn Nash Ppt
John Nash Ppt
 
Game theory
Game theoryGame theory
Game theory
 
Game theory
Game theoryGame theory
Game theory
 
Matlab: Speech Signal Analysis
Matlab: Speech Signal AnalysisMatlab: Speech Signal Analysis
Matlab: Speech Signal Analysis
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
 
Sound analysis and processing with MATLAB
Sound analysis and processing with MATLABSound analysis and processing with MATLAB
Sound analysis and processing with MATLAB
 
Chapter 7 retail managment
Chapter   7 retail managmentChapter   7 retail managment
Chapter 7 retail managment
 
Equilibrium in Nash’s mind (with references)
Equilibrium in Nash’s mind (with references)Equilibrium in Nash’s mind (with references)
Equilibrium in Nash’s mind (with references)
 
Speech signal processing lizy
Speech signal processing lizySpeech signal processing lizy
Speech signal processing lizy
 
Game theory
Game theoryGame theory
Game theory
 
Nash equilibrium
Nash equilibriumNash equilibrium
Nash equilibrium
 
Game theory
Game theoryGame theory
Game theory
 
Game Theory
Game TheoryGame Theory
Game Theory
 
Game theory and its applications
Game theory and its applicationsGame theory and its applications
Game theory and its applications
 
Game theory
Game theoryGame theory
Game theory
 
Introduction to Digital Signal Processing
Introduction to Digital Signal ProcessingIntroduction to Digital Signal Processing
Introduction to Digital Signal Processing
 
Game theory
Game theoryGame theory
Game theory
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
An introduction to Game Theory
An introduction to Game TheoryAn introduction to Game Theory
An introduction to Game Theory
 
Game Theory Presentation
Game Theory PresentationGame Theory Presentation
Game Theory Presentation
 

Similar to Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition

Deep Neural Methods for Retrieval
Deep Neural Methods for RetrievalDeep Neural Methods for Retrieval
Deep Neural Methods for RetrievalBhaskar Mitra
 
Continuous bag of words cbow word2vec word embedding work .pdf
Continuous bag of words cbow word2vec word embedding work .pdfContinuous bag of words cbow word2vec word embedding work .pdf
Continuous bag of words cbow word2vec word embedding work .pdfdevangmittal4
 
Text Mining for Lexicography
Text Mining for LexicographyText Mining for Lexicography
Text Mining for LexicographyLeiden University
 
Doc format.
Doc format.Doc format.
Doc format.butest
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information RetrievalBhaskar Mitra
 
Question answer template
Question answer templateQuestion answer template
Question answer templateThanuw Chaks
 
Metrics for Evaluating Quality of Embeddings for Ontological Concepts
Metrics for Evaluating Quality of Embeddings for Ontological Concepts Metrics for Evaluating Quality of Embeddings for Ontological Concepts
Metrics for Evaluating Quality of Embeddings for Ontological Concepts Saeedeh Shekarpour
 
Challenges in transfer learning in nlp
Challenges in transfer learning in nlpChallenges in transfer learning in nlp
Challenges in transfer learning in nlpLaraOlmosCamarena
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information RetrievalBhaskar Mitra
 
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATIONAN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATIONijnlc
 
Visual Category Recognition using Information-Theoretic Co-Clustering
Visual Category Recognition using Information-Theoretic Co-ClusteringVisual Category Recognition using Information-Theoretic Co-Clustering
Visual Category Recognition using Information-Theoretic Co-ClusteringAshish Gupta
 
Effect of word embedding vector dimensionality on sentiment analysis through ...
Effect of word embedding vector dimensionality on sentiment analysis through ...Effect of word embedding vector dimensionality on sentiment analysis through ...
Effect of word embedding vector dimensionality on sentiment analysis through ...IAESIJAI
 
Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Andre Freitas
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
 
An introduction to compositional models in distributional semantics
An introduction to compositional models in distributional semanticsAn introduction to compositional models in distributional semantics
An introduction to compositional models in distributional semanticsAndre Freitas
 
5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information Retrieval5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information RetrievalBhaskar Mitra
 
Improving Text Categorization with Semantic Knowledge in Wikipedia
Improving Text Categorization with Semantic Knowledge in WikipediaImproving Text Categorization with Semantic Knowledge in Wikipedia
Improving Text Categorization with Semantic Knowledge in Wikipediachjshan
 
Current Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationCurrent Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationMario Sangiorgio
 

Similar to Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition (20)

Deep Neural Methods for Retrieval
Deep Neural Methods for RetrievalDeep Neural Methods for Retrieval
Deep Neural Methods for Retrieval
 
Continuous bag of words cbow word2vec word embedding work .pdf
Continuous bag of words cbow word2vec word embedding work .pdfContinuous bag of words cbow word2vec word embedding work .pdf
Continuous bag of words cbow word2vec word embedding work .pdf
 
Text Mining for Lexicography
Text Mining for LexicographyText Mining for Lexicography
Text Mining for Lexicography
 
Doc format.
Doc format.Doc format.
Doc format.
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information Retrieval
 
Question answer template
Question answer templateQuestion answer template
Question answer template
 
Metrics for Evaluating Quality of Embeddings for Ontological Concepts
Metrics for Evaluating Quality of Embeddings for Ontological Concepts Metrics for Evaluating Quality of Embeddings for Ontological Concepts
Metrics for Evaluating Quality of Embeddings for Ontological Concepts
 
Challenges in transfer learning in nlp
Challenges in transfer learning in nlpChallenges in transfer learning in nlp
Challenges in transfer learning in nlp
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information Retrieval
 
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATIONAN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
 
Visual Category Recognition using Information-Theoretic Co-Clustering
Visual Category Recognition using Information-Theoretic Co-ClusteringVisual Category Recognition using Information-Theoretic Co-Clustering
Visual Category Recognition using Information-Theoretic Co-Clustering
 
Effect of word embedding vector dimensionality on sentiment analysis through ...
Effect of word embedding vector dimensionality on sentiment analysis through ...Effect of word embedding vector dimensionality on sentiment analysis through ...
Effect of word embedding vector dimensionality on sentiment analysis through ...
 
Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
An introduction to compositional models in distributional semantics
An introduction to compositional models in distributional semanticsAn introduction to compositional models in distributional semantics
An introduction to compositional models in distributional semantics
 
5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information Retrieval5 Lessons Learned from Designing Neural Models for Information Retrieval
5 Lessons Learned from Designing Neural Models for Information Retrieval
 
Class14
Class14Class14
Class14
 
Improving Text Categorization with Semantic Knowledge in Wikipedia
Improving Text Categorization with Semantic Knowledge in WikipediaImproving Text Categorization with Semantic Knowledge in Wikipedia
Improving Text Categorization with Semantic Knowledge in Wikipedia
 
Current Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationCurrent Approaches in Search Result Diversification
Current Approaches in Search Result Diversification
 

Recently uploaded

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 

Towards Learning a Semantically Relevant Dictionary for Visual Category Recognition

  • 1. Towards Learning Semantically Relevant Dictionary for Visual Category Recognition Ashish Gupta, Richard Bowden Centre for Vision, Speech, and Signal Processing, University of Surrey, Guildford, United Kingdom Objective Transform feature space rendered by the local patch affine invariant feature descriptor to a semantically relevant space for visual categorisation. Challenge Large intra-category visual appearance variation. Training data: insufficient, noisy, background clutter. Feature descriptor is high-dimensional, sparsely populated, and renders highly inter-mixed vectors in feature space. Topic ← Words Feature space is assumed to have local semantic integrity. Intra-category appearance variance ameliorated. Grouping Scattered Clusters Analyse Image-Word co-occurrence statistics. Similar occurrence ⇒ semantic equivalence. Use co-clustering to discover word groups. Group such words into topics. Multiple Sub-Manifolds visual category ← object part visual σ2 (object part) is small . d(part1, part2) is large. Disambiguation by projection to Sub-Manifolds Separating inter-mixed descriptors. Dual objective of inter-vector distance and sub-manifold embedding overcomes limitation of hard partitioning. Influence of Co-clustering Co-clustering aids grouping of semantically equivalent descriptors (similar co-occurrence statistics or similar sub-manifold embedding) by projecting from a higher dimensional space (words) to lower dimensional space (topics). This effectively reduces separation between equivalent descriptors, verified using a K-NN classifier. Experiment: Grouping Scattered Clusters Comparative classification performance (F1 score) of standard clustered dictionary (BoW) vs. grouping scattered clusters dictionary for all categories of VOC 2010 data set; dictionary size is 1000. Grouping clusters: different co-clustering methods Comparison of Information-theoretic (i) and sum-squared Residue (r) co-clustering methods. Grouping clusters: influence of dictionary size Topics (100,500,1000,5000) ← Words (10,000) Comparative F1 score, averaged for all categories, for various datasets. Experiment: Multiple Sub-Manifold Comparative classification performance (F1 score) of standard clustered dictionary (BoW) vs. multi-manifold dictionary (SSRBC) for all categories of VOC 2010 data set; dictionary size is 100. Multi-Manifolds: different co-clustering methods Comparison of Information-theoretic (i) and sum-squared Residue (r) co-clustering methods. Towards Semantically Relevant Space Group semantically similar small clusters. Multi-manifolds dictionary. Prune non-discriminative space. Combine these paradigms. Summary The improvement in classification performance supports the hypotheses that semantic relevance of feature space can be improved by grouping scattered tiny clusters based on image-word co-occurrence and learning a dictionary on multiple sub-manifolds, which disambiguates descriptors by projecting them to different sub-manifolds. Future work implements pruning non-discriminative space and combine these paradigms to render a semantically relevant space. Acknowledgement Supported by the EU project Dicta-Sign (FP7/2007-2013) under Grant No. 231135 and PASCAL 2. Center for Vision, Speech, and Signal Processing - University of Surrey - Guildford, United Kingdom Mail: a.gupta@surrey.ac.uk WWW: http://www.ee.surrey.ac.uk/cvssp