SlideShare a Scribd company logo
1 of 26
Large Scale Recognition and Retrieval
What does the world look like? Scaling to billions of images Object Recognition for large-scale search Focus on scaling rather than understanding image High level image statistics
Content-Based Image Retrieval Variety of simple/hand-designed cues: Color and/or Texture histograms, Shape, PCA, etc. Various distance metrics Earth Movers Distance (Rubner et al. ‘98) QBIC from IBM (1999) Blobworld, Carson et al. 2002
Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors
Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors
Matching features incategory-level recognition
Comparing sets of local features Previous strategies:  ,[object Object],	[Schmid, Lowe, Tuytelaars et al.] ,[object Object],	[Rubner et al., Belongie et al., Gold & Rangarajan, Wallraven & Caputo, Berg et al., Zhang et al.,…] ,[object Object],	[Csurka et al., Sivic & Zisserman, Lazebnik & Ponce] Slide credit: Kristen Grauman
Pyramid match kernel optimal partial matching [Grauman & Darrell, ICCV 2005] Optimal match:  O(m3) Pyramid match: O(mL) m = # features L = # levels in pyramid Slide credit: Kristen Grauman
Pyramid match: main idea Feature space partitions serve to “match” the local descriptors within successively wider regions. descriptor space Slide credit: Kristen Grauman
Pyramid match: main idea Histogram intersection counts number of possible matches at a given partitioning. Slide credit: Kristen Grauman
Computing the partial matching Earth Mover’s Distance [Rubner, Tomasi, Guibas 1998] Hungarian method [Kuhn, 1955] Greedy matching		            … Pyramid match		      	 [Grauman and Darrell, ICCV 2005] for sets with              features of dimension
Recognition on the ETH-80 Complexity Kernel Match [Wallraven et al.] Pyramid match Recognition accuracy (%) Testing time (s) Mean number of features per set (m) Mean number of features per set (m) Slide credit: Kristen Grauman
Pyramid match kernel: examples ofextensions and applications by other groups Spatial Pyramid Match Kernel Lazebnik, Schmid, Ponce, 2006. Dual-space Pyramid Matching Hu et al., 2007.  Representing Shape with a Pyramid Kernel   Bosch & Zisserman, 2007. Scene recognition Shape representation Medical image classification Slide credit: Kristen Grauman L=1 L=2 L=0
Pyramid match kernel: examples of extensions and applications by other groups wave sit down From Omnidirectional Images to Hierarchical Localization, Murillo et al. 2007. Single View Human Action Recognition using Key Pose Matching, Lv & Nevatia, 2007. Spatio-temporal  Pyramid Matching for Sports Videos, Choi et al., 2008. Action recognition Video indexing Robot localization Slide : Kristen Grauman
Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors
Learning how to compare images ,[object Object]
Number of existing techniques for metric learningdissimilar similar [Weinberger et al. 2004, Hertz et al. 2004, Frome et al. 2007, Varma & Ray 2007, Kumar et al. 2007]
Problem-specific knowledge Example sources of similarity constraints Fully labeled image databases Partially labeled image databases
Locality Sensitive Hashing (LSH) Gionis, A. & Indyk, P. & Motwani, R. (1999) ,[object Object]
Quantize each projection with few bits101 0 Descriptor in high D space 1 0 1 1 0
Fast Image Search for Learned MetricsJain, Kulis, & Grauman, CVPR 2008  Learn a Malhanobis metric for LSH h(   ) = h(   ) h(   ) ≠h(   ) Less likely to split pairs like those with similarity constraint More likely to split pairs like those with dissimilarity constraint  Slide : Kristen Grauman
Results: Flickr dataset Query time:  Error rate ,[object Object]
Categorize scene based on nearest exemplars
Base metric: Ling & Soatto’s Proximity Distribution Kernel (PDK)slower search                 faster search 30% of data                      2% of data Slide : Kristen Grauman
Results: Flickr dataset Query time:  Error rate ,[object Object]
Categorize scene based on nearest exemplars

More Related Content

What's hot

A Quantitative analysis and performance study for similarity search methods i...
A Quantitative analysis and performance study for similarity search methods i...A Quantitative analysis and performance study for similarity search methods i...
A Quantitative analysis and performance study for similarity search methods i...
Jungyeol
 
Real-Time Volumetric Tests (EG 2008)
Real-Time Volumetric Tests (EG 2008)Real-Time Volumetric Tests (EG 2008)
Real-Time Volumetric Tests (EG 2008)
Matthias Trapp
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
Pierre de Lacaze
 

What's hot (20)

A Quantitative analysis and performance study for similarity search methods i...
A Quantitative analysis and performance study for similarity search methods i...A Quantitative analysis and performance study for similarity search methods i...
A Quantitative analysis and performance study for similarity search methods i...
 
Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...
 
Depth estimation using deep learning
Depth estimation using deep learningDepth estimation using deep learning
Depth estimation using deep learning
 
[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
 
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
 
Seed net automatic seed generation with deep reinforcement learning for robus...
Seed net automatic seed generation with deep reinforcement learning for robus...Seed net automatic seed generation with deep reinforcement learning for robus...
Seed net automatic seed generation with deep reinforcement learning for robus...
 
Mind the Gap: Another look at the problem of the semantic gap in image retrieval
Mind the Gap: Another look at the problem of the semantic gap in image retrievalMind the Gap: Another look at the problem of the semantic gap in image retrieval
Mind the Gap: Another look at the problem of the semantic gap in image retrieval
 
A brief introduction to extracting information from images
A brief introduction to extracting information from imagesA brief introduction to extracting information from images
A brief introduction to extracting information from images
 
poster
posterposter
poster
 
CLUSTERING HYPERSPECTRAL DATA
CLUSTERING HYPERSPECTRAL DATACLUSTERING HYPERSPECTRAL DATA
CLUSTERING HYPERSPECTRAL DATA
 
Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...
Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...
Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...
 
Real-Time Volumetric Tests (EG 2008)
Real-Time Volumetric Tests (EG 2008)Real-Time Volumetric Tests (EG 2008)
Real-Time Volumetric Tests (EG 2008)
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
 
Object detection with deep learning
Object detection with deep learningObject detection with deep learning
Object detection with deep learning
 
Object detection - RCNNs vs Retinanet
Object detection - RCNNs vs RetinanetObject detection - RCNNs vs Retinanet
Object detection - RCNNs vs Retinanet
 
V2 v posenet
V2 v posenetV2 v posenet
V2 v posenet
 
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
 
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...
 
Automatic Building detection for satellite Images using IGV and DSM
Automatic Building detection for satellite Images using IGV and DSMAutomatic Building detection for satellite Images using IGV and DSM
Automatic Building detection for satellite Images using IGV and DSM
 
MultiModal Retrieval Image
MultiModal Retrieval ImageMultiModal Retrieval Image
MultiModal Retrieval Image
 

Viewers also liked

CVPR2010: Learnings from founding a computer vision startup: Chapter 8: Softw...
CVPR2010: Learnings from founding a computer vision startup: Chapter 8: Softw...CVPR2010: Learnings from founding a computer vision startup: Chapter 8: Softw...
CVPR2010: Learnings from founding a computer vision startup: Chapter 8: Softw...
zukun
 
Tuto cvpr part1
Tuto cvpr part1Tuto cvpr part1
Tuto cvpr part1
zukun
 
CVPR2010: Learnings from founding a computer vision startup: Chapter 1, 2: Wh...
CVPR2010: Learnings from founding a computer vision startup: Chapter 1, 2: Wh...CVPR2010: Learnings from founding a computer vision startup: Chapter 1, 2: Wh...
CVPR2010: Learnings from founding a computer vision startup: Chapter 1, 2: Wh...
zukun
 
NIPS2008: tutorial: statistical models of visual images
NIPS2008: tutorial: statistical models of visual imagesNIPS2008: tutorial: statistical models of visual images
NIPS2008: tutorial: statistical models of visual images
zukun
 
NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2
zukun
 
ECCV2010: distance function and metric learning part 2
ECCV2010: distance function and metric learning part 2ECCV2010: distance function and metric learning part 2
ECCV2010: distance function and metric learning part 2
zukun
 

Viewers also liked (6)

CVPR2010: Learnings from founding a computer vision startup: Chapter 8: Softw...
CVPR2010: Learnings from founding a computer vision startup: Chapter 8: Softw...CVPR2010: Learnings from founding a computer vision startup: Chapter 8: Softw...
CVPR2010: Learnings from founding a computer vision startup: Chapter 8: Softw...
 
Tuto cvpr part1
Tuto cvpr part1Tuto cvpr part1
Tuto cvpr part1
 
CVPR2010: Learnings from founding a computer vision startup: Chapter 1, 2: Wh...
CVPR2010: Learnings from founding a computer vision startup: Chapter 1, 2: Wh...CVPR2010: Learnings from founding a computer vision startup: Chapter 1, 2: Wh...
CVPR2010: Learnings from founding a computer vision startup: Chapter 1, 2: Wh...
 
NIPS2008: tutorial: statistical models of visual images
NIPS2008: tutorial: statistical models of visual imagesNIPS2008: tutorial: statistical models of visual images
NIPS2008: tutorial: statistical models of visual images
 
NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2
 
ECCV2010: distance function and metric learning part 2
ECCV2010: distance function and metric learning part 2ECCV2010: distance function and metric learning part 2
ECCV2010: distance function and metric learning part 2
 

Similar to Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes

20110220 computer vision_eruhimov_lecture01
20110220 computer vision_eruhimov_lecture0120110220 computer vision_eruhimov_lecture01
20110220 computer vision_eruhimov_lecture01
Computer Science Club
 
Machine Learning Project
Machine Learning ProjectMachine Learning Project
Machine Learning Project
Adeyemi Fowe
 
Representative Previous Work
Representative Previous WorkRepresentative Previous Work
Representative Previous Work
butest
 
Representative Previous Work
Representative Previous WorkRepresentative Previous Work
Representative Previous Work
butest
 
image processing image processing image processing
image processing  image processing  image processingimage processing  image processing  image processing
image processing image processing image processing
SportsAcademy1
 
VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...
VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...
VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...
grssieee
 
Computer vision,,summer training programme
Computer vision,,summer training programmeComputer vision,,summer training programme
Computer vision,,summer training programme
Praveen Pandey
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2
zukun
 

Similar to Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes (20)

20110220 computer vision_eruhimov_lecture01
20110220 computer vision_eruhimov_lecture0120110220 computer vision_eruhimov_lecture01
20110220 computer vision_eruhimov_lecture01
 
Soundarya m.sc
Soundarya m.scSoundarya m.sc
Soundarya m.sc
 
Multimedia searching
Multimedia searchingMultimedia searching
Multimedia searching
 
Machine Learning Project
Machine Learning ProjectMachine Learning Project
Machine Learning Project
 
Multilabel Image Retreval Using Hashing
Multilabel Image Retreval Using HashingMultilabel Image Retreval Using Hashing
Multilabel Image Retreval Using Hashing
 
Representative Previous Work
Representative Previous WorkRepresentative Previous Work
Representative Previous Work
 
Representative Previous Work
Representative Previous WorkRepresentative Previous Work
Representative Previous Work
 
image processing image processing image processing
image processing  image processing  image processingimage processing  image processing  image processing
image processing image processing image processing
 
Wang midterm-defence
Wang midterm-defenceWang midterm-defence
Wang midterm-defence
 
Dense Image Matching - Challenges and Potentials (Keynote 3D-ARCH 2015)
Dense Image Matching - Challenges and Potentials (Keynote 3D-ARCH 2015) Dense Image Matching - Challenges and Potentials (Keynote 3D-ARCH 2015)
Dense Image Matching - Challenges and Potentials (Keynote 3D-ARCH 2015)
 
A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...
A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...
A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...
 
OBJECT DETECTION AND RECOGNITION: A SURVEY
OBJECT DETECTION AND RECOGNITION: A SURVEYOBJECT DETECTION AND RECOGNITION: A SURVEY
OBJECT DETECTION AND RECOGNITION: A SURVEY
 
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
 
VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...
VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...
VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...
 
Semantics In Digital Photos A Contenxtual Analysis
Semantics In Digital Photos A Contenxtual AnalysisSemantics In Digital Photos A Contenxtual Analysis
Semantics In Digital Photos A Contenxtual Analysis
 
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
 
A Review on Matching For Sketch Technique
A Review on Matching For Sketch TechniqueA Review on Matching For Sketch Technique
A Review on Matching For Sketch Technique
 
[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution
[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution
[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution
 
Computer vision,,summer training programme
Computer vision,,summer training programmeComputer vision,,summer training programme
Computer vision,,summer training programme
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2
 

More from zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
zukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
zukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
zukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
zukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
zukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
zukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
zukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
zukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
zukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
zukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
zukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
zukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
zukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
zukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
zukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
zukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
zukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
zukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
zukun
 

More from zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 

Recently uploaded

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Recently uploaded (20)

Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 

Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes

  • 1. Large Scale Recognition and Retrieval
  • 2. What does the world look like? Scaling to billions of images Object Recognition for large-scale search Focus on scaling rather than understanding image High level image statistics
  • 3. Content-Based Image Retrieval Variety of simple/hand-designed cues: Color and/or Texture histograms, Shape, PCA, etc. Various distance metrics Earth Movers Distance (Rubner et al. ‘98) QBIC from IBM (1999) Blobworld, Carson et al. 2002
  • 4. Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors
  • 5. Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors
  • 7.
  • 8. Pyramid match kernel optimal partial matching [Grauman & Darrell, ICCV 2005] Optimal match: O(m3) Pyramid match: O(mL) m = # features L = # levels in pyramid Slide credit: Kristen Grauman
  • 9. Pyramid match: main idea Feature space partitions serve to “match” the local descriptors within successively wider regions. descriptor space Slide credit: Kristen Grauman
  • 10. Pyramid match: main idea Histogram intersection counts number of possible matches at a given partitioning. Slide credit: Kristen Grauman
  • 11. Computing the partial matching Earth Mover’s Distance [Rubner, Tomasi, Guibas 1998] Hungarian method [Kuhn, 1955] Greedy matching … Pyramid match [Grauman and Darrell, ICCV 2005] for sets with features of dimension
  • 12. Recognition on the ETH-80 Complexity Kernel Match [Wallraven et al.] Pyramid match Recognition accuracy (%) Testing time (s) Mean number of features per set (m) Mean number of features per set (m) Slide credit: Kristen Grauman
  • 13. Pyramid match kernel: examples ofextensions and applications by other groups Spatial Pyramid Match Kernel Lazebnik, Schmid, Ponce, 2006. Dual-space Pyramid Matching Hu et al., 2007. Representing Shape with a Pyramid Kernel Bosch & Zisserman, 2007. Scene recognition Shape representation Medical image classification Slide credit: Kristen Grauman L=1 L=2 L=0
  • 14. Pyramid match kernel: examples of extensions and applications by other groups wave sit down From Omnidirectional Images to Hierarchical Localization, Murillo et al. 2007. Single View Human Action Recognition using Key Pose Matching, Lv & Nevatia, 2007. Spatio-temporal Pyramid Matching for Sports Videos, Choi et al., 2008. Action recognition Video indexing Robot localization Slide : Kristen Grauman
  • 15. Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors
  • 16.
  • 17. Number of existing techniques for metric learningdissimilar similar [Weinberger et al. 2004, Hertz et al. 2004, Frome et al. 2007, Varma & Ray 2007, Kumar et al. 2007]
  • 18. Problem-specific knowledge Example sources of similarity constraints Fully labeled image databases Partially labeled image databases
  • 19.
  • 20. Quantize each projection with few bits101 0 Descriptor in high D space 1 0 1 1 0
  • 21. Fast Image Search for Learned MetricsJain, Kulis, & Grauman, CVPR 2008 Learn a Malhanobis metric for LSH h( ) = h( ) h( ) ≠h( ) Less likely to split pairs like those with similarity constraint More likely to split pairs like those with dissimilarity constraint Slide : Kristen Grauman
  • 22.
  • 23. Categorize scene based on nearest exemplars
  • 24. Base metric: Ling & Soatto’s Proximity Distribution Kernel (PDK)slower search faster search 30% of data 2% of data Slide : Kristen Grauman
  • 25.
  • 26. Categorize scene based on nearest exemplars
  • 27. Base metric: Ling & Soatto’s Proximity Distribution Kernel (PDK)slower search faster search 30% of data 2% of data Slide : Kristen Grauman
  • 28. Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors
  • 29. Semantic Hashing [Salakhutdinov & Hinton, 2007] for text documents Query Image Semantic HashFunction Address Space Binary code Images in database Query address Semantically similar images Quite differentto a (conventional)randomizing hash
  • 30. Exploring different choices of semantic hash function Torralba, Fergus, Weiss, CVPR 2008 <10μs Binary code Image 1 3. RBM 2. BoostSSC 1.LSH Retrieved images <1ms Semantic Hash Query Image Gist descriptor Compute Gist ~1ms (in Matlab)
  • 31.
  • 32. LabelMe retrieval comparison 32-bit learned codes do as well as 512-dim real-valued input descriptor Learning methods outperform LSH % of 50 true neighbors in retrieval set 0 2,000 10,000 20,0000 Size of retrieval set
  • 33.
  • 34. Jain, Kulis, & Grauman, CVPR 2008: Learn Malhanobis distance for LSH.
  • 35. Torralba, Fergus, Weiss, CVPR 2008: Directly learn mapping from image to binary code.
  • 36. Use Hamming distance (binary codes) for speed
  • 37. Learning metric really helps over plain LSH
  • 38. Learning only applied to metric, not representation

Editor's Notes

  1. Goal – match image to others in order to name locationMake point of high dimensions here.Smaller gap with ML because also serves as dim reduction, so need fewer bits. Here num bits is the same for both. PDK is higher d than PMK. So we didn’t observe this gap discrepancy for ML PMK hash because num bits used was enough for both.To evaluate our learned hash functions when the base kernel is theproximity distribution kernel (PDK), we performed experiments with a dataset of 5400 images of 18 different tourist attractions from the photo-sharing site Flickr. We took three cities inEurope that have major tourist attractions: Rome, London, and Paris. The tourist sites for eachcity were taken from the top attractions in www.TripAdvisor.com under the headings Religioussite, Architectural building, Historic site, Opera, Museum, and Theater. Overall, the list yielded18 classes: eight from Rome, five from London, and five from Paris. The classes are: Arc deTriomphe, Basilica San Pietro, Castel SantAngelo, Colosseum, Eiffel Tower, Globe Theatre,Hotel des Invalides, House of Parliament, Louvre, Notre Dame Cathedral, Pantheon, PiazzaCampidoglio, Roman Forum, Santa Maria Maggiore, Spanish Steps, St. Pauls Cathedral, Tower
  2. Left side – 30% of data searched. Right side – 2-3% searched