Personal Matching Recommendation system in TinderBox

Personal matching
recommendation system in
TinderBox
Sogang University
Data Mining Laboratory
Jaehyun Ahn
Feb. 26. 2015

Overview
•  We will cover Viola Jones Face Detection
Algorithm
•  Eigen Face Detection Algorithm and
PCA(Principal Component Analysis)
•  K-Nearest Neighbor Classiﬁcation Algorithm
http://en.wikipedia.org/wiki/K-nearest_neighbors_algorithm 2

Overview 2
•  What is the TinderBox?
3
TinderBox is Tinder based application!

Then, what is the Tinder?
•  Very simple, very compressed dating system
4
Tinder is a social dating application

However, Tinder not just a social dating app
5

Recommendation Process – Collecting Samples
6

7

8

Find the suitable model for next recommendation
9

Thus, Next Recommendation Will Be..
10

How would it be possible?
•  Viola Jones – Face Detection
•  Eigen Face – Project Image to Vector Matrix
•  K-Nearest Neighbor – Recommendation Algorithm
11

Viola Jones Face Detection Algorithm
“Robust Real-Time Face Detection”

Paul Viola
Microsoft Research

Michel J. Jones
Mitsubishi Lab

2001
“Training is slow, but detection is very fast”
12

•  Use Rectangle ﬁlter
13
f(x): sum of rectangle (also called ‘integral image’)
x: pixel rate (usually 0 ~ 255 value, gray scale)
ϑ: threshold (constant)
α: usually 1
β: usually 0 (polarity)
t: t-th classiﬁer index
i: index of pixel

•  Define ‘one’ classifier will be..
14
Variety of ht(x)s’
h(x): 0 – 1 polarity function
b: bias
ϑ: classifier weight
t: t-th classifier index

•  Boosting Correctness with Cascaded Classiﬁer

•  In TinderBox, they use this algorithm for cropping
face
15
“Training is slow, but detection is very fast”

usually saved ‘. xml’constant ﬁle

Eigen Face – Convert Images to Vectors
16
“Face Recognition Using EigenFaces”
Mattew A. Turk / Alex P. Pentland

Vision and Modeling Group, The Media Laboratory
Massachusetts Institute of Technology

1991, IEEE

Eigen Face – Convert Images to Vectors
•  Basic Idea
17
Face Recognition is a simple matching problem
But N x N dimensional space is so high
Let’s mapping N x N matrix into a
lower-dimensionality space
How can we ﬁnd a suitable lower-dimensional space?
“Maybe PCA will be a great solution”
Principal Component Analysis

Eigen Face – Image Matrix
(Training Samples)
•  Consider about N x K size of image, Eigen
Face converts it into gray scale image. Thus,
N x K matrix’s values will be 0 ~ 255 range.
18
0 ~ 255 value
I’m watching you

Eigen Face – Find Mean Matrix
(Training Samples)
19
To ﬁnd central vectors from sample images,
we sum up all of sample images with pixel by
pixel and divide it into M. (# of samples)

Why?
That makes possible to construct Covariance
Matrix to apply PCA dimensional reduction.
Average Face Vector: Ψ
i-th Image vector: Γi

(Training Samples)
20
Average Face Vector: Ψ
i-th Image vector: Γi
Now move its’ derivation into origin space
ϕi = Γi - Ψ
Now let’s ﬁnd covariance matrix C

(Training Samples)
21
Now let’s ﬁnd covariance matrix C
We have to ﬁnd eigenvectors of ui for projection.
However, AAT (N2 x N2) Matrix is too large!
# of training sets
Don’t give up!
Let’s consider about ATA’s eigenvector vi (M x 1 matrix)
Computable constant
What is eigenvector? : http://darkpgmr.tistory.com/105
How to compute? : http://www.vision.jhu.edu/teaching/vision08/Handouts/case_study_pca1.pdf

Because of eigenvector’s
meaning!

(Training Samples)
22
Don’t give up!
Let’s consider about ATA’s eigenvector vi (M x 1 matrix)
Computable constant
Then?
CAvi = μiAvi
(Since C = AAT)
Singular Value Decomposition

(Training Samples)
23
Don’t give up!
Let’s consider about ATA’s eigenvector vi (M x M matrix)
Computable constant
Then?
CAvi = μiAvi
(Since C = AAT)

(Training Samples)
24
Then?
CAvi = μiAvi
(Since C = AAT)
Now let’s consider Avi = ωi
(Anonymous vector ωi)
Computable constant
CAvi = μiAvi = Cωi = μiωi

Isn’t
it
eigenvalue
and
eigenvector?

(Training Samples)
25
Isn’t
it
eigenvalue
and
eigenvector?

Therefore,
ωi = ui = Avi
“Which means we can get N2 x N2 matrix’s eigenvector (ui)
with M x M (smaller) matrix’s eigenvector (vi).”

(Training Samples)
26
Isn’t
it
eigenvalue
and
eigenvector?

Therefore,
ωi = ui = Avi

27
(Training Samples)
“When we got an anonymous image, which user select ‘like’label or ‘nope’one.
Then, we can get a K-th dimensional face vector as a sample feature.”

K-Nearest Neighbor Algorithm
for polarity classiﬁcation (1 or 0)
•  K-Nearest Neighbor Classiﬁcation (in short K-NN)
28
“An introduction to kernel and nearest-neighbor
nonparametric regression”
Altman, N. S.

Cornell University

1992, The American Statisticain

K-NN is not K-means
•  K-NN is supervised. We define unknown sample x’s label
from k number of samples’ label (voting). The way we
calculate its distance is smallest Euclidian distance, absolute
difference, maximum distance, and Minkowski distance.
•  K-means is unsupervised. We define k number of points for
initial central cluster position. Iteratively, define smallest
Euclidian distance sample as a nearest cluster’s member.
And re-calculate cluster’s central point based on its cluster
members. (until central point’s coordination stable)
29
클러스터링과 다르다! 클러스터와는

K-NN’s distance example
30Sources from: http://www-01.ibm.com/support/knowledgecenter/SSLVMB_21.0.0/com.ibm.spss.statistics.help/alg_knn.htm?lang=ko

K-NN on TinderBox in practice
31
?

Minkowski distance
•  Similar with Euclidian distance but generalized
distance. (= Manhattan distance)
32
Distance will be
(p is order)
If p reaching inﬁnity, we obtain the Chebyshev distance.
Sources from: http://en.wikipedia.org/wiki/Minkowski_distance

Time to summarize
33
“yes” label

34
Thank you
jaehyunahn@sogang.ac.kr

Personal Matching Recommendation system in TinderBox

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (7)

Similar to Personal Matching Recommendation system in TinderBox

Similar to Personal Matching Recommendation system in TinderBox (20)

More from Mad Scientists

More from Mad Scientists (20)

Recently uploaded

Recently uploaded (20)

Personal Matching Recommendation system in TinderBox