Finding needles in haystacks with deep neural networks

Finding needles in haystacks
with deep neural networks
1
Calvin Giles
calvin.giles@gmail.com
@calvingiles

2
Calvin Giles
calvin.giles@gmail.com
@calvingiles
Who am I
Data scientist at Lyst
PyData organiser
Recovering Nuclear Physicist
Started learning ML 7 years ago
Trained my ﬁrst neural network 6 months ago

Needles
We have 30 million products
in our database.
We would like to ﬁnd pairs
of these products that are
the same.
3

Haystacks
A brute force search of our
database would ﬁnd one
correct pair in every 1,000.
4

One product
Eight retailers
Users like duplicate grouping
Machine learning tasks are
easier on merged data
Why bother?
5

But first, a tangent into perspective…
(Or why some perspectives are better than others)
6

Get some perspective, the lazy way
7

8
Step 1
Train a neural network for classiﬁcation

9
Step 1
Step 2
Label one example per class

10
Step 1
Step 2
Step 3
Train a simple model e.g. SVM

11
Step 1
Step 2
Step 3
Step 4
Label samples that confuse the model

12
Step 1
Step 2
Step 3
Step 4
Label samples that confuse the model
Step 5
Repeat steps 3 and 4 until bored

Network structure
22
Network
Image A
representation

Network structure
23
Network
Image A
representation

Network structure
24
Network
Image A
representation
Image B
representation

Network structure
25
Network
Image A
representation
Image B
representation
Loss

Network structure
26
Network
Image A
representation
Image B
representation
Loss
Back propagate

Contrastive loss
27
(from http://docs.chainer.org/en/stable/reference/functions.html#chainer.functions.contrastive)

28
Contrastive loss

29
Contrastive loss

30
Contrastive loss

How to train a neural network for
duplicate detection
31

duplicate detection
32
Step 1
Read a paper on face detection [Chorpa05]

duplicate detection
33
Step 1
Step 2
Implement a siamese network

duplicate detection
34
Step 1
Step 2
Step 3
Watch the loss decrease

duplicate detection
35
Step 1
Step 2
Step 3
Step 4
Look at the results

38
Cleaning with phash and corroboration

39
Match visually identical images with phash

40
High false positives and false negatives

41
High false positives and false negatives
For a pair of products, consider the corroboration between multiple images

Hyper-parameter tuning is important
44

Hyper-parameter tuning is important
45

That was an old paper
47
Step 1
Read two recent papers on face detection [Wang14, Schroff15]

48
Step 1
Step 2
Implement a triplet loss network

49
Step 1
Step 2
Step 3

50
Step 1
Step 2
Step 3
Step 4
Visualise the detected duplicates

58
2 products, 2 dimensions
+ Gradient Clipping

70
So, the model can describe a trivial
problem, can it handle more data?

71
So, the model can describe a trivial
problem, can it handle more data?

72
Remove any pairs nearer than .001

73
Remove any pairs nearer than .001

74
This is great, but all the cool kids*
are L2 norming their output
*[Chopra05, Wang14, Schroff15]

Network structure
75
Network
Image A
representation
Image B
representation
Loss

Network structure
76
Network
Image A
representation
Image B
representation
Loss
L2
L2

Convolutional layers from ZfNet
pre-trained by Eddie and weights fixed
ReLU activation
78
224x224x3 (zero mean)
96C7S3:BN
256C5S2:BN
384C3
384C3
256C3

Fully connected MLP trained with
ReLU activation and contrastive loss
79
9216 (from ZfNet)
FC1024
FC128
MomentumSGD(.01, 0.9)
GradientClipping(.01)
WeightDecay(.01)

Neural networks can describe
arbitrarily complex functions
84

Neural networks can describe
arbitrarily complex functions
They will cheat
85

More data can be
unreasonably effective
86

More data can be
unreasonably effective
It will slow you down
87

Neural networks can be difficult to interpret
88

Neural networks can be difficult to interpret
This is not an excuse to not try
89

Neural networks are fun :-)
90

References
92
S Chopra, R Hadsell, and Y LeCun. Learning a Similarity Metric Discriminatively, with Application to Face Verification.
2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005 vol. 1 pp. 539-546.
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=1467314
Jiang Wang, Yang song, Thomas Leung, Chuck Rosenberg, Jinbin Wang, James Philbin, Bo Chen, and Ying Wu. Learning
Fine-grained Image Similarity with Deep Ranking. 2014. http://arxiv.org/abs/1404.4661v1
Florian Schroff, Dmitry Kalenichenko, and James Philbin. FaceNet: A Unified Embedding for Face Recognition and
Clustering. 2015. http://arxiv.org/abs/1503.03832v3

Finding needles in haystacks with deep neural networks

Recommended

Recommended

More Related Content

What's hot

What's hot (7)

Viewers also liked

Viewers also liked (20)

Similar to Finding needles in haystacks with deep neural networks

Similar to Finding needles in haystacks with deep neural networks (20)

Recently uploaded

Recently uploaded (20)

Finding needles in haystacks with deep neural networks