Personalised Recommendations in E-Commerce

THE WORLD’S
LEADING ONLINE
HEALTH & BEAUTY
DESTINATION

BUSINESS MODEL
300M+ LIFESTYLE CONSUMERS GLOBALLY

PROPRIETARY DATA
TECHNOLOGY PLATFORM

HEALTH

BEAUTY

HIGH REPEAT,
HIGH MARGIN & LOW RETURNS

CUSTOMER METRICS

5

MILLION CUSTOMERS

2.5

MILLION SHIPPED IN
LAST 12 MONTHS

140

MILLION VISITS PER
ANNUM

8

MILLION ORDERS
PER ANNUM

TALENT

25
30
75
280

DATA/TECHNOLOGY
PERSONNEL

AVERAGE AGE OF
EMPLOYEES

AVERAGE AGE OF
DIVISIONAL HEADS

# OF GRADS HIRED
IN LTM

SCALABILITY:
VISITS

ORDERS

AVERAGE ORDER
VALUE

400

22
50

42

46

2009

21

8

140

2

2013E

2017E

Visits (Millions)

2009

2013E

2017E

Orders (Millions)

2009

2013E

AOV (£)

2017E

SCALABILITY: HOW
INVESTMENT IN
TALENT

INVESTMENT IN
CAPACITY
22

350

£40M
INVESTMENT
155

8

2

33

2009

2013E

2017E

Tech Headcount

2009

2013E

2017E

Orders (Millions)

Personalised
Recommendations for
E-Commerce
Wing Yung Chan

Applications of Computing
in Industry

Recommendation
=
“Suggestion to do
something”
INTRODUCTION

Watch a film
Read a book
Buy a drink
Visit a museum
INTRODUCTION

Types of
recommendations…

INTRODUCTION

Restaurant Waiter:
•
•
•
•

Special of the day
My personal favourites
Bestsellers
Do you like Chicken?
INTRODUCTION

Clothes Store Attendant:
•
•
•
•

New-in/Seasonal Highlights
Special Offers/Discounts
Bestsellers
Are you looking for anything in
particular?
INTRODUCTION

Quick Taxonomy
Business self-interest:
New-in, specials,…
General preference:
Bestsellers
More personal:
Search
INTRODUCTION

None of these are personal
except search.

But search requires effort
and a way to articulate what
you need/want.
INTRODUCTION

Solution:
Add in assumptions about
Personal Preference to
create Personalised
Recommendations
INTRODUCTION

Assumption #1
You are like your friends.

INTRODUCTION

Assumption #2
You are like people who do
similar things to you.

INTRODUCTION

Assumption #3
You like things that are similar to things
you already do.

INTRODUCTION

Assumption #4
You are influenced by experts or the
experiences of others.

INTRODUCTION

How can we program this?

INTRODUCTION

Problem Statement
An entry is filled with the number of times a user has bought an
item.
It could also be the rating given to an item.
ITEMS

1
A

2

3

5

1

6

1

1

B
USERS

4

C

1

D

1

E
F

1

1

1

PROBLEM STATEMENT

Problem Statement
There are usually too many items and/or too many users.
The matrix is large but very sparse. It’s also incomplete.

ITEMS

1
A

2

3

5

1

6

1

1

B
USERS

4

C

1

D

1

E
F

1

1

1

PROBLEM STATEMENT

Audience Check
Before we get into algorithms, just a quick
check:
• Machine Learning
• Singular Value Decomposition/Latent Semantic
Analysis
• Clustering/Community Detection
• Page Rank and Random-Walks
• Item-Based Collaborative Filtering

ALGORITHMS

Machine Learning : A program whose results
improve with data.
Aim: predict a score for each user-product so that
we can pick the best products to recommend to a
user.
Data: The previous interactions between users
and products.

ALGORITHMS

But we need to find products
for users. So we need users
and products to be mapped
to the same space.

ALGORITHMS

Singular Value Decomposition
documents

latent

documents
M

WxP

=

M
terms

terms

M
W

P

M

ALGORITHMS

Now we just replace terms
with users and documents
with products.

ALGORITHMS

Singular Value Decomposition
products

latent

products
M

WxP

=

M

users

users

M
W

P

M

ALGORITHMS

However…
Traditional SVD methods fail when the
matrix is incomplete.

ALGORITHMS

Alternating Least Squares
1. Solve for p, using random values
for q.
2. Then solve for q, using latest
version of p.
3. Repeat to solve for p using latest
version of q, etc.

ALGORITHMS

SGD: usually faster.

ALS: can be easily parallelizable.
These methods give up the
Singular, Orthonormal guarantee of
SVD but work with incomplete data.

ALGORITHMS

Item-based Collaborative Filtering
“Amazon.com recommendations: item-to-item
collaborative filtering”, 2003.
“Customers Who Bought This Also Bought That”

ALGORITHMS

Item-Based CF
1000

2

100

ALGORITHMS

Item-Based CF
A

B

C

D

A

-

3

10

7

B

3

-

5

6

C

10

5

-

6

D

12

5

6

-

A

B

C

D

A

-

0.15

0.50

0.35

B

0.21

-

0.36

0.43

C

0.48

0.24

-

0.29

D

0.52

0.22

0.26

-

SCORE

NSCORE

ALGORITHMS

Item-Based CF Algorithm
We can then recommend products to customers as follows:
For each customer:
For each product bought by the customer:
Find the top N recommended products
Take the top M products, ordered by the sum of the scores
This is efficient: O(U + I) Space
1. We store N recommended products per product
Size = (Items * N)
2. We store the products bought by each customer:
Usually this is a low constant
Size = Users * M (M small)

ALGORITHMS

Item-CF collapses the UserProduct matrix into a ProductProduct graph.
But is this the only way to do it?

ALGORITHMS

Graph-Based
The user-product matrix looks like an adjacency matrix for a bipartite
graph, where entries specify edges. In fact, this makes intuitive sense
too.
A

1

B

2

C

3

D

4

E

5

F

6

ALGORITHMS

Graph-Based
Maybe if B and E are similar, we should be recommending Product 3 to
B and Product 4 to E. This line of thinking is called Neighbourhood
methods.
A

1

B

2

C

3

D

4

E

5

F

6

ALGORITHMS

Graph-Based
We can randomly walk the graph, starting at a User Node and walk an
odd number of steps so we always end up at a Product.

A

1

B

2

C

3

D

4

E

5

F

6

ALGORITHMS

Problems
1. Cold Start (Users): Users may not have bought many products.
2. Cold Start (Products): A Product may not have been bought very
often (or could be new so had no chance to be bought).
This causes a problem for both Latent methods and Neighbourhood
methods.
Possible solutions:
1. Link users to other users using social or demographic data.
2. Link products to other products using taxonomy information like
brand, category, description.

ALGORITHMS

Augmented Graph
Recommending for a new customer G:
A

1

B

2

C

3

D

4

E

5

F

6

G

ALGORITHMS

Augmented Graph
Adding brand connections for 1 and 3.
A

1

B

2

C

3

D

4

E

5

F

6

X

ALGORITHMS

Personalised PageRank
PPR for a given start node S.
If you reach a node u,
Then move to one of the adjacent nodes v with probability (1-a), and
back to S with probability a.
If there are N adjacent nodes, then pick one of them according to the
distribution of weights on the edges.
This can’t be done efficiently using the typical Power method for Global
PageRank because it would require N iterations of the Power method.

ALGORITHMS

This can be approximated
efficiently on a single machine
using DrunkardMob.
See “DrunkardMob: billions of random
walks on just a PC”, by Aapo Kyrola.

ALGORITHMS

Item-Based CF can be approximated by a 2-step
Random Walk starting from each Product.
We can even do User Clustering or Community
Detection by doing Random Walks starting at
users and taking an even number of steps.

ALGORITHMS

Competition Time!
Can you write the best Recommendation Algorithm?
The Hut Group is offering £5,000 and paid Summer
Internships to the best team (up to 5 people) of University
Students who can make the best recommendations.
Dataset:

• 2.2M rows
• 150,000 Customers
• 500 Products

Rec Challenge 2013

Thanks

Go to www.thehutchallenge.com and enter your
e-mail address today!

wingyung.chan@thehutgroup.com
Twitter: @MrWingChan
Rec Challenge 2013

Personalised Recommendations in E-Commerce

Recommended

Recommended

More Related Content

Similar to Personalised Recommendations in E-Commerce

Similar to Personalised Recommendations in E-Commerce (20)

Recently uploaded

Recently uploaded (20)

Personalised Recommendations in E-Commerce

Editor's Notes