Large-Scale Visual Recognition Tools and Techniques

•

1 like•351 views

This document summarizes a tutorial on large-scale visual recognition. The tutorial aims to provide tools for handling large datasets, including scalable image representations like VLAD and Fisher Vectors, and efficient matching and learning techniques. It also shows how large-scale retrieval and classification are converging, with retrieval becoming more machine learning-based and classification more cost-aware. Finally, it demonstrates that large-scale visual recognition does not require huge resources, with examples of searching 100 million images in 250ms and training the 2010 ImageNet challenge in days on a single server.

Engineering

Large-scale visual recognition
Conclusion
Florent Perronnin, XRCE
Hervé Jégou, INRIA
CVPR tutorial
June 16, 2012

Goals of this tutorial
Provide tools to handle large-scale datasets:
!! image representations: scaling the BOV, including higher order statistics (VLAD, FV)
!! scalable matching/learning: compression, approx. search, SGD, explicit embedding

Viewers also liked

6 large-scale-learning.pptxmustafa sarac

AWS essentials EC2mustafa sarac

Lecture 05 gerard medioni - tensor voting: fundamentals and recent progressmustafa sarac

Scaling to Millions of Simultaneous Connections by Rick Reed from WhatsAppmustafa sarac

Lecture 01 frank dellaert - 3 d reconstruction and mapping: a factor graph ...mustafa sarac

1 introduction.pptxmustafa sarac

Big data & advanced analytics in Telecom: A multi-billion-dollar revenue oppo...mustafa sarac

Banking and fintechmustafa sarac

Viewers also liked (8)

6 large-scale-learning.pptx

AWS essentials EC2

Lecture 05 gerard medioni - tensor voting: fundamentals and recent progress

Scaling to Millions of Simultaneous Connections by Rick Reed from WhatsApp

Lecture 01 frank dellaert - 3 d reconstruction and mapping: a factor graph ...

1 introduction.pptx

Big data & advanced analytics in Telecom: A multi-billion-dollar revenue oppo...

Banking and fintech

Similar to Large-Scale Visual Recognition Tools and Techniques

Fast Distributed Online Classification DataWorks Summit/Hadoop Summit

4 new-patch-agggregation.pptxmustafa sarac

Technical computing in JuliaJiahao Chen

Fast Distributed Online ClassificationPrasad Chalasani

3 bagofwords.pptmustafa sarac

Pose Extraction for Real-Time Workout Assistant: Milestone 3Zachary Christmas

Exploring French Job Ads, Lynn ChernyPôle Systematic Paris-Region

19th Athens Big Data Meetup - 2nd Talk - NLP: From news recommendation to wor...Athens Big Data

Compact and Distinctive Visual Vocabularies for Efficient Multimedia Data Ind...Symeon Papadopoulos

Tesl Ontario 2010 DMPTJohn Allan

C-ing the FutureWayne Hodgins

Adobe Air Application case study - nycoders.org 0509Andrew Hunt

Webinar: Deep Learning with H2OSri Ambati

Json tutorialMohammed Bilal

Scalable Learning Technologies for Big Data MiningGerard de Melo

Imago demo day storyboardFederico Arboleda

[212]big models without big data using domain specific deep networks in data-...NAVER D2

Teacher Workshop September 2011 Using Moodle Part 2networkcoordination

CVPR2022 paper reading - Balanced multimodal learning - All Japan Computer Vi...Antonio Tejero de Pablos

Video+Language: From Classification to DescriptionGoergen Institute for Data Science

Similar to Large-Scale Visual Recognition Tools and Techniques (20)

Fast Distributed Online Classification

4 new-patch-agggregation.pptx

Technical computing in Julia

Fast Distributed Online Classification

3 bagofwords.ppt

Pose Extraction for Real-Time Workout Assistant: Milestone 3

Exploring French Job Ads, Lynn Cherny

19th Athens Big Data Meetup - 2nd Talk - NLP: From news recommendation to wor...

Compact and Distinctive Visual Vocabularies for Efficient Multimedia Data Ind...

Tesl Ontario 2010 DMPT

C-ing the Future

Adobe Air Application case study - nycoders.org 0509

Webinar: Deep Learning with H2O

Json tutorial

Scalable Learning Technologies for Big Data Mining

Imago demo day storyboard

[212]big models without big data using domain specific deep networks in data-...

Teacher Workshop September 2011 Using Moodle Part 2

CVPR2022 paper reading - Balanced multimodal learning - All Japan Computer Vi...

Video+Language: From Classification to Description

Recently uploaded

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis

What are the advantages and disadvantages of membrane structures.pptxwendy cai

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst

HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Low Rate Call Girls In Saket, Delhi NCR

IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Biology for Computer Engineers Course Handout.pptxDeepakSakkari2

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal

College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

Introduction and different types of Ethernet.pptxupamatechverse

GDSC ASEB Gen AI study jams presentationGDSCAESB

Recently uploaded (20)

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts

247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...

What are the advantages and disadvantages of membrane structures.pptx

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service

IVE Industry Focused Event - Defence Sector 2024

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf

IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts

Biology for Computer Engineers Course Handout.pptx

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...

College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik

Introduction and different types of Ethernet.pptx

GDSC ASEB Gen AI study jams presentation

Large-Scale Visual Recognition Tools and Techniques

1. Large-scale visual recognition Conclusion Florent Perronnin, XRCE Hervé Jégou, INRIA CVPR tutorial June 16, 2012

2. Goals of this tutorial Provide tools to handle large-scale datasets: !! image representations: scaling the BOV, including higher order statistics (VLAD, FV) !! scalable matching/learning: compression, approx. search, SGD, explicit embedding

3. Goals of this tutorial Provide tools to handle large-scale datasets: !! image representations: scaling the BOV, including higher order statistics (VLAD, FV) !! scalable matching/learning: compression, approx. search, SGD, explicit embedding Show convergence of large-scale retrieval and classification: !! retrieval: more and more machine learning !! classification: more and more cost aware

4. Goals of this tutorial Provide tools to handle large-scale datasets: !! image representations: scaling the BOV, including higher order statistics (VLAD, FV) !! scalable matching/learning: compression, approx. search, SGD, explicit embedding Show convergence of large-scale retrieval and classification: !! retrieval: more and more machine learning !! classification: more and more cost aware Show that LSVR does not necessarily require gigantic resources: !! searching in 100M images in 250ms on a single processor !! train from scratch ILSVRC 2010 in a few days on a single server

Large-Scale Visual Recognition Tools and Techniques

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (8)

Similar to Large-Scale Visual Recognition Tools and Techniques

Similar to Large-Scale Visual Recognition Tools and Techniques (20)

More from mustafa sarac

More from mustafa sarac (20)

Recently uploaded

Recently uploaded (20)

Large-Scale Visual Recognition Tools and Techniques