Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification

•Download as PPTX, PDF•

1 like•2,229 views

Shao-Chuan Wang

A paper review

Education Technology

Spatially coherent latent topic model for concurrent object segmentation and classification Authors: Liangliang Cao, Li Fei-Fei Presenter: Shao-Chuan Wang

Outline Motivation A Review on Graphical Models Today’s topic: the paper Their Results

Motivation: Real world problem often full of “noises” Bags of words (local features) Spatial relationships of objects are ignored (has its limit) When classify a test image, what is its “subject” ? Flag? Banner? People? Sports field? From Prof. Fei-Fei’s ICCV09 tutorial slide

Generative vs Discriminative Generative model: model p(x, y) or p(x|y)p(y) Discriminative model: model p(y|x) 0.1 0.05 0 0 10 20 30 40 50 60 70 1 0.5 0 0 10 20 30 40 50 60 70 x = data From Prof. Antonio Torralba course slide

Naïve Bayesian model (c: class, w: visual words) Once we have learnt the distribution, for a query image Generative model: An example Bayesian Networks c w1 wn …

Generative model: Another example Mixture Gaussian Model How to infer from unlabeled data even if we know the underlining probability distribution structure? ?

A graphical model Object class c P(c) Inverse Variance Mean γ μ P(γ|c) P(μ|c) Observed data x P(x|μ,γ) ,[object Object]

Nodes represent variablesHidden ,[object Object]

Conditional distributions at each node,[object Object]

Variational method/Variational Message Passing (VMP)

Algorithms that convert inference problems into optimization problems (Opper and Saad 2001; Wainwright and Jordan 2003)Image from Wikipedia

Back to the topic: the paper bag of words Key Ideas: Latent topics are spatially coherent Generate topic distribution at the region level Over-segmentation, then merge by same topics Avoid obtaining regions larger than the objects One topic per region Can recognize objects with occlusion oversegmentation ,[object Object]

Homogeneous Appearance ar: average of color or texture features

Concurrent segmentation and classification,[object Object]

Spatial Latent Topic Model (Unsupervised) Multinomial Dirichlet prior Maximize Log-likelihood an optimization problem: close-formed solution is intractable

Variaitional Message Passing (Winn 2005) Coupling hidden variables θ, α, β makes the maximization intractable Instead, maximize the lower bound of L Goal: Find a tractable Q(H) that closely approximates the true posterior distribution P(H|V) (equality holds for any distribution Q) ←Or equivalently, minimize KL(Q||P)

Variaitional Message Passing (Winn 2005) Further factorization assumptions (Jordan et al., 1999; Jaakkola, 2001; Parisi, 1988) (restrict the family of distributions Q) Entropy term = Where,

Variaitional Message Passing (Winn 2005) Eqn. (6) in the paper Bayesian networks representation Markov blanket:

Spatial Latent Topic Model (Supervised) Now it becomes C x K matrix, i.e. θ depends on observed c For a query image,Id , find its most probable category c:

What's hot

Chris Dyer - 2017 - Neural MT Workshop Invited Talk: The Neural Noisy Channel...Association for Computational Linguistics

Prototype-based models in machine learningUniversity of Groningen

Cvpr2007 object category recognition p1 - bag of words modelszukun

MLIP - Chapter 5 - Detection, Segmentation, CaptioningCharles Deledalle

Kernel methods and variable selection for exploratory analysis and multi-omic...tuxette

Chapter 1 - IntroductionCharles Deledalle

Lec15 graph laplacian embeddingUnited States Air Force Academy

MLIP - Chapter 2 - Preliminaries to deep learningCharles Deledalle

Multimodal Residual Networks for Visual QAJin-Hwa Kim

Machine learning by Dr. Vivek Vijay and Dr. Sandeep YadavAgile Testing Alliance

About functional SIRtuxette

Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationUnited States Air Force Academy

MLIP - Chapter 3 - Introduction to deep learningCharles Deledalle

Constellation Models and Unsupervised Learning for Object Class Recognitionwolf

A pixel to-pixel segmentation method of DILD without masks using CNN and perl...남주 김

CS364 Artificial Intelligence Machine Learningbutest

Clustermaheswari narne

06 cv mil_learning_and_inferencezukun

（DL輪読）Matching Networks for One Shot LearningMasahiro Suzuki

4 avrachenkovYandex

What's hot (20)

Chris Dyer - 2017 - Neural MT Workshop Invited Talk: The Neural Noisy Channel...

Prototype-based models in machine learning

Cvpr2007 object category recognition p1 - bag of words models

MLIP - Chapter 5 - Detection, Segmentation, Captioning

Kernel methods and variable selection for exploratory analysis and multi-omic...

Chapter 1 - Introduction

Lec15 graph laplacian embedding

MLIP - Chapter 2 - Preliminaries to deep learning

Multimodal Residual Networks for Visual QA

Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav

About functional SIR

Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification

MLIP - Chapter 3 - Introduction to deep learning

Constellation Models and Unsupervised Learning for Object Class Recognition

A pixel to-pixel segmentation method of DILD without masks using CNN and perl...

CS364 Artificial Intelligence Machine Learning

Cluster

06 cv mil_learning_and_inference

（DL輪読）Matching Networks for One Shot Learning

4 avrachenkov

Recently uploaded

On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash

SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur

Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417

Introduction to Nonprofit Accounting: The BasicsTechSoup

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi

Sociology 101 Demonstration of Learning Exhibitjbellavia9

Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417

Graduate Outcomes Presentation Slides - Englishneillewis46

Holdier Curriculum Vitae (April 2024).pdfagholdier

Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdfssuserdda66b

Application orientated numerical on hev.pptRamjanShidvankar

Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam

Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith

Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University of Engineering & Technology, Jamshoro

ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43

Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid

General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil

Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh

How to Create and Manage Wizard in Odoo 17Celine George

Spellings Wk 3 English CAPS CARES Please PractiseAnaAcapella

Recently uploaded (20)

On National Teacher Day, meet the 2024-25 Kenan Fellows

SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx

Unit-V; Pricing (Pharma Marketing Management).pptx

Introduction to Nonprofit Accounting: The Basics

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf

Sociology 101 Demonstration of Learning Exhibit

Unit-IV; Professional Sales Representative (PSR).pptx

Graduate Outcomes Presentation Slides - English

Holdier Curriculum Vitae (April 2024).pdf

Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf

Application orientated numerical on hev.ppt

Python Notes for mca i year students osmania university.docx

Fostering Friendships - Enhancing Social Bonds in the Classroom

Mehran University Newsletter Vol-X, Issue-I, 2024

ComPTIA Overview | Comptia Security+ Book SY0-701

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx

General Principles of Intellectual Property: Concepts of Intellectual Proper...

Micro-Scholarship, What it is, How can it help me.pdf

How to Create and Manage Wizard in Odoo 17

Spellings Wk 3 English CAPS CARES Please Practise

Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification

1. Spatially coherent latent topic model for concurrent object segmentation and classification Authors: Liangliang Cao, Li Fei-Fei Presenter: Shao-Chuan Wang

2. Outline Motivation A Review on Graphical Models Today’s topic: the paper Their Results

3. Motivation: Real world problem often full of “noises” Bags of words (local features) Spatial relationships of objects are ignored (has its limit) When classify a test image, what is its “subject” ? Flag? Banner? People? Sports field? From Prof. Fei-Fei’s ICCV09 tutorial slide

4. Outline Motivation A Review on Graphical Models Today’s topic: the paper Their Results

5. Generative vs Discriminative Generative model: model p(x, y) or p(x|y)p(y) Discriminative model: model p(y|x) 0.1 0.05 0 0 10 20 30 40 50 60 70 1 0.5 0 0 10 20 30 40 50 60 70 x = data From Prof. Antonio Torralba course slide

6. Naïve Bayesian model (c: class, w: visual words) Once we have learnt the distribution, for a query image Generative model: An example Bayesian Networks c w1 wn …

7. Generative model: Another example Mixture Gaussian Model How to infer from unlabeled data even if we know the underlining probability distribution structure? ?

10.

11. Use Gibbs sampling from the Posterior

12. Slow to converge

13. Variational method/Variational Message Passing (VMP)

14. Algorithms that convert inference problems into optimization problems (Opper and Saad 2001; Wainwright and Jordan 2003)Image from Wikipedia

15. Outline Motivation A Review on Graphical Models Today’s topic: the paper Their Results

16.

17. Homogeneous Appearance ar: average of color or texture features

18. SIFT-based visual words: wr

19.

20. Spatial Latent Topic Model (Unsupervised) Multinomial Dirichlet prior Maximize Log-likelihood an optimization problem: close-formed solution is intractable

21. Variaitional Message Passing (Winn 2005) Coupling hidden variables θ, α, β makes the maximization intractable Instead, maximize the lower bound of L Goal: Find a tractable Q(H) that closely approximates the true posterior distribution P(H|V) (equality holds for any distribution Q) ←Or equivalently, minimize KL(Q||P)

22. Variaitional Message Passing (Winn 2005) Further factorization assumptions (Jordan et al., 1999; Jaakkola, 2001; Parisi, 1988) (restrict the family of distributions Q) Entropy term = Where,

23. Variaitional Message Passing (Winn 2005) Eqn. (6) in the paper Bayesian networks representation Markov blanket:

24. Spatial Latent Topic Model (Supervised) Now it becomes C x K matrix, i.e. θ depends on observed c For a query image,Id , find its most probable category c:

25. Process Training step maximize total likelihood of training images, subject λ, α, θ and zr The learned λ, α are fixed Testing phase, for a query Image Id Estimate its θd and zr For classification task, find its most probable latent topics as its category For segmentation task, for the same zr, merge it. (3)

26. Outline Motivation A Review on Graphical Models Today’s topic: the paper Their Results

27. Experimental Results Unsupervised segmentation Occlusion case:

28. Experimental Results Supervised segmentation Dataset 13 classes of nature scenes # of training images: 100 # of topics: 60 # of categories: 13

29. Experimental Results Supervised classification Dataset 28 classes from Caltech 101 # of training images: 30 # of test images: 30 # of topics in category: 28 # of topics in clutter: 34 6 background classes are left unlabeled

30. ~ Thank you ~

31. Variaitional Message Passing Following this framework, and use the graphical model provided by this paper:

Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification

Similar to Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification (20)

More from Shao-Chuan Wang

More from Shao-Chuan Wang (10)

Recently uploaded

Recently uploaded (20)

Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification