Visual search

•

1 like•182 views

Julien Jouganous

A short introduction to visual search to find similar images based on deep learning algorithms.

Data & Analytics

A short introduction to visual search
Guillaume Dechriste

Neural Network
w1
w3
w2
b
 Neuron representation:
 Activation function:
http://neuralnetworksanddeeplearning.com/

Neural Network
 Network example:
 Forward propagation:
 Backward propagation:

Deep learning
 Convolution:
 Softmax:
 Max pooling:

Litterature
 Google search engine:
“Learning Fine-grained Image Similarity wit Deep Ranking” (2014)
 Visual search and Recommendation system from Flipkart, India’s largest e-commerce
company:
“Deep Learning based Large Scale Visual Recommendation and Search for E-
Commerce” (2017)

Main idea for image search
 Images are projected on an Euclidean space, such that the more similar two images, the
smallest the distance between them in the embedding space.
 Our goal is to learn an embedding function f(.) that assigns the smallest distance to more
similar pairs:
Consider three images p, p+ and p-. If p and p+ are more similar than p and p-, then
, 𝑓 𝑝𝑖 − 𝑓 𝑝𝑖
+
< 𝑓 𝑝𝑖 − 𝑓 𝑝𝑖
−
 Consider a sample of triplets (pi, pi
+, pi
-), the function f(.) can be learned by minimizing the
following loss function:
max(0, 𝑓 𝑝𝑖 − 𝑓 𝑝𝑖
+
− 𝑓 𝑝𝑖 − 𝑓 𝑝𝑖
−
)

CNN architecture
 The triplet sampling characterizes the relative
similarity relationship for three images.
 Query image pi, positive image pi
+ and negative
image pi
- are fed independently into three identical
deep neural networks.
 The ranking layer evaluates the loss of the triplet. It
does not have any parameter.
 The network parameters are computed using the
classical back propagation algorithm to minimize the
ranking loss function.

CNN architecture
 16-Layer VGG net: capture abstract, high level features of the input image
 Shallow Conv Layers 1 and 2: capture fine-grained details of the input image

𝑓(𝑝𝑖
+
)
𝑓(𝑝𝑖
−
)
𝑓(𝑝𝑖 )
And with few resources ?
 Training a full network is computationally expensive.
 Pre-trained networks for classification could be fine-tuned to address the image search
problem:
 Tensorflow is well suited to download and modify pre-trained network.
 The following results are computed from the pre-trained inception-V3 model .
reLu nerons
Only one step of
backpropagation

And with few resources ?
 To ensure that inception model bottlenecks can be adapted to visual search, we first look at
the nearest images in the bottleneck space:
 Results seem promising for images with white background:

And with few resources ?
 We use a selection of 18 000 products (2x(3000 seats + 3000 armchairs + 3000 sofas))
from Cdiscount catalog.
 Inside category triplet: query and positive images are pictures of the same product,
while the negative image is taken from another product from the same category.
 Outside category triplet: query and positive images are taken from products of the same
category, while the negative image represents a product from another category.

And with few resources ?
 Lots of “bad” triplets…
 Lead to “bad” learning…

What's hot

Searching Images: Recent research at SouthamptonJonathon Hare

[PR12] Generative Models as Distributions of FunctionsJaeJun Yoo

Convolutional Neural Network (CNN)Muhammad Haroon

Reproducible data science and business solutionsAntonio Rueda-Toicen

Depth estimation using deep learningUniversity of Oklahoma

Searching Images: Recent research at SouthamptonJonathon Hare

Saliency-based Models of Image Content and their Application to Auto-Annotati...Jonathon Hare

Convolution Neural Network (CNN)Basit Rafiq

Обучение нейросетей компьютерного зрения в видеоиграхAnatol Alizar

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis taeseon ryu

Обучение нейросети машинного зрения в видеоиграхAnatol Alizar

SeRanet introductionKosuke Nakago

Content-based image retrieval using a mobile device as a novel interfaceJonathon Hare

D0341829iosrjournals

Convolution Neural Network (CNN)Suraj Aavula

Designing a neural network architecture for image recognitionShandukaniVhulondo

Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...Jonathon Hare

Log polar coordinatesOğul Göçmen

Machine Learning Explanations: LIME framework Deep Learning Italia

ImagesteganographydbatuaudiABHIJEET KHIRE

What's hot (20)

Searching Images: Recent research at Southampton

[PR12] Generative Models as Distributions of Functions

Convolutional Neural Network (CNN)

Reproducible data science and business solutions

Depth estimation using deep learning

Searching Images: Recent research at Southampton

Saliency-based Models of Image Content and their Application to Auto-Annotati...

Convolution Neural Network (CNN)

Обучение нейросетей компьютерного зрения в видеоиграх

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Обучение нейросети машинного зрения в видеоиграх

SeRanet introduction

Content-based image retrieval using a mobile device as a novel interface

D0341829

Convolution Neural Network (CNN)

Designing a neural network architecture for image recognition

Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...

Log polar coordinates

Machine Learning Explanations: LIME framework

Imagesteganographydbatuaudi

Similar to Visual search

Data Science - Part XVII - Deep Learning & Image ProcessingDerek Kane

Ai use casesSparsh Agarwal

One shot learningVuong Ho Ngoc

Scene Description From Images To SentencesIRJET Journal

Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...StampedeCon

Neural network image recognitionOleksii Sekundant

Bol.comBigDataExpo

Using Deep Learning to Find Similar DressesHJ van Veen

Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Jonathon Hare

Android and Deep LearningOswald Campesato

Ai in 45 minutes昉达王

Deep Learning and the state of AI / 2016Grigory Sapunov

introduction to deeplearningEyad Alshami

Introduction to Machine VisionNasir Jumani

Image De-Noising Using Deep Neural Networkaciijournal

Image Classification and Annotation Using Deep LearningIRJET Journal

Deep Learning and Reinforcement LearningRenārs Liepiņš

Diving into Deep Learning (Silicon Valley Code Camp 2017)Oswald Campesato

E03404025032theijes

Deep Computer Vision - 1.pptxJawadHaider36

Similar to Visual search (20)

Data Science - Part XVII - Deep Learning & Image Processing

Ai use cases

One shot learning

Scene Description From Images To Sentences

Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...

Neural network image recognition

Bol.com

Using Deep Learning to Find Similar Dresses

Spot the Dog: An overview of semantic retrieval of unannotated images in the ...

Android and Deep Learning

Ai in 45 minutes

Deep Learning and the state of AI / 2016

introduction to deeplearning

Introduction to Machine Vision

Image De-Noising Using Deep Neural Network

Image Classification and Annotation Using Deep Learning

Deep Learning and Reinforcement Learning

Diving into Deep Learning (Silicon Valley Code Camp 2017)

E03404025032

Deep Computer Vision - 1.pptx

Recently uploaded

B2 Creative Industry Response Evaluation.docxStephen266013

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

Halmar dropshipping via API with DroFxolyaivanovalion

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad EscortsCall girls in Ahmedabad High profile

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Ravak dropshipping via API with DroFx.pptxolyaivanovalion

Smarteg dropshipping via API with DroFx.pptxolyaivanovalion

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

April 2024 - Crypto Market Report's Analysismanisha194592

Midocean dropshipping via API with DroFxolyaivanovalion

Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

Recently uploaded (20)

B2 Creative Industry Response Evaluation.docx

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

Halmar dropshipping via API with DroFx

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

Unveiling Insights: The Role of a Data Analyst

100-Concepts-of-AI by Anupama Kate .pptx

RA-11058_IRR-COMPRESS Do 198 series of 1998

(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh

Brighton SEO | April 2024 | Data Storytelling

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

Ravak dropshipping via API with DroFx.pptx

Smarteg dropshipping via API with DroFx.pptx

Generative AI on Enterprise Cloud with NiFi and Milvus

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

April 2024 - Crypto Market Report's Analysis

Midocean dropshipping via API with DroFx

Dubai Call Girls Wifey O52&786472 Call Girls Dubai

Customer Service Analytics - Make Sense of All Your Data.pptx

Visual search

1. A short introduction to visual search Guillaume Dechriste

2. Neural Network w1 w3 w2 b  Neuron representation:  Activation function: http://neuralnetworksanddeeplearning.com/

3. Neural Network  Network example:  Forward propagation:  Backward propagation:

4. Deep learning  Convolution:  Softmax:  Max pooling:

5. Litterature  Google search engine: “Learning Fine-grained Image Similarity wit Deep Ranking” (2014)  Visual search and Recommendation system from Flipkart, India’s largest e-commerce company: “Deep Learning based Large Scale Visual Recommendation and Search for E- Commerce” (2017)

6. Main idea for image search  Images are projected on an Euclidean space, such that the more similar two images, the smallest the distance between them in the embedding space.  Our goal is to learn an embedding function f(.) that assigns the smallest distance to more similar pairs: Consider three images p, p+ and p-. If p and p+ are more similar than p and p-, then , 𝑓 𝑝𝑖 − 𝑓 𝑝𝑖 + < 𝑓 𝑝𝑖 − 𝑓 𝑝𝑖 −  Consider a sample of triplets (pi, pi +, pi -), the function f(.) can be learned by minimizing the following loss function: max(0, 𝑓 𝑝𝑖 − 𝑓 𝑝𝑖 + − 𝑓 𝑝𝑖 − 𝑓 𝑝𝑖 − )

7. CNN architecture  The triplet sampling characterizes the relative similarity relationship for three images.  Query image pi, positive image pi + and negative image pi - are fed independently into three identical deep neural networks.  The ranking layer evaluates the loss of the triplet. It does not have any parameter.  The network parameters are computed using the classical back propagation algorithm to minimize the ranking loss function.

8. CNN architecture  16-Layer VGG net: capture abstract, high level features of the input image  Shallow Conv Layers 1 and 2: capture fine-grained details of the input image

9. 𝑓(𝑝𝑖 + ) 𝑓(𝑝𝑖 − ) 𝑓(𝑝𝑖 ) And with few resources ?  Training a full network is computationally expensive.  Pre-trained networks for classification could be fine-tuned to address the image search problem:  Tensorflow is well suited to download and modify pre-trained network.  The following results are computed from the pre-trained inception-V3 model . reLu nerons Only one step of backpropagation

10. And with few resources ?  To ensure that inception model bottlenecks can be adapted to visual search, we first look at the nearest images in the bottleneck space:  Results seem promising for images with white background:

11. And with few resources ?  To ensure that inception model bottlenecks can be adapted to visual search, we first look at the nearest images in the bottleneck space:  Results need to be improved for “real life” pictures:

12. And with few resources ?  We use a selection of 18 000 products (2x(3000 seats + 3000 armchairs + 3000 sofas)) from Cdiscount catalog.  Inside category triplet: query and positive images are pictures of the same product, while the negative image is taken from another product from the same category.  Outside category triplet: query and positive images are taken from products of the same category, while the negative image represents a product from another category.

13. And with few resources ?  Lots of “bad” triplets…  Lead to “bad” learning…

Visual search

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Visual search

Similar to Visual search (20)

Recently uploaded

Recently uploaded (20)

Visual search