Memcached

MEMCACHED
Sam Warmuth & Dex Delfrate

11/12/09

THE PROBLEM

• MySQL is great for small/medium usage, but can’t scale to
today’s demands

• Some things can be done for reads, but writes don’t scale

HOW DO WE FIX IT?
• Scale MySQL

• Better indexes

• Buy bigger HDs

• Buy faster CPUs

• Master Write, several Reads

HOW DO WE FIX IT?
• Scale MySQL

• Better indexes

• Buy bigger HDs

• Buy faster CPUs

• Master Write, several Reads

OR
• Cache the queries

CACHING

• Querying the database on every request is overkill

• Data might be loaded thousands of times between changes

CACHING

• Every
time you make a query that might be repeated, save it
in RAM

• Next time someone asks for that info, use the cached version

HOW MUCH FASTER IS RAM?

RAM: 83NS
HARD DISK: 13.7MS

HOW MUCH FASTER IS RAM?

RAM is 165,000 times faster than disk access

WHAT IS MEMCACHED?

• Sits in RAM

• Dumb caching layer

• One gigantic hash table

BASIC LAYOUT
User Requests

Application Framework

Memcached
Server

MySQL/Postgres
Server

HASHING
Hash Index
• How Hashing Works 0
1
• hash(X)=2
• hash(Y)=1 2
• hash(Z)=4 3
4
5

HASHING
Hash Index
• How Hashing Works 0
1 Y
• hash(X)=2
• hash(Y)=1 2 X
• hash(Z)=4 3
4 Z
5

MULTIPLE SERVERS
Hash Index
0
• Modulo Division
1 Y
• Client calculates hash, runs modulo 2 X
to ﬁgure out which server
3
• 3 servers, hash value 4 % 3 = 1 4 Z
(2nd server) 5

MULTIPLE SERVERS
Hash Index
0 Server 1
• Modulo Division
1 Y Server 2
• Client calculates hash, runs modulo 2 X Server 3
to ﬁgure out which server
3 Server 1
• 3 servers, hash value 4 % 3 = 1 4 Z Server 2
(2nd server) 5 Server 3

What happens when you want
to add a server or a server goes
down?

WHEN A SERVER GOES
DOWN
Index
0 Server 1
1 Y Server 2
2 X Server 3
3 Server 1
4 Z Server 2
5 Server 3

WHEN A SERVER GOES
DOWN
Index
0 Server 1
Accesses of Server 1 Y Server 2
2s data will miss 2 X Server 3
every time.
3 Server 1
4 Z Server 2
5 Server 3

CONSISTENT HASHING

• Hash the servers

• For each lookup, ﬁnd the server with the nearest hash

4 Servers: A, B, C, D 0
h(A) = 11 11 1
h(B) = 5
h(C) = 8 10 2
h(D) = 3

9 3

8 4

7 5
6

4 Servers: A, B, C, D 0
h(A) = 11 11 1
h(B) = 5 A
h(C) = 8 10 2
h(D) = 3

9 D 3

C
8 4
B
7 5
6

0
11 1
A
10 2

9 D 3

C
8 4
B
7 5
6

IF A SERVER GOES DOWN
A

D

C

B

CONSISTENT HASHING
• Pros

• If
one server goes down, all of its requests move on to the
next server

• Seamlessly add new servers

• Cons

• Hashes distribute mostly evenly, but could cluster

• Solution: split servers into multiple sub-server nodes

0
11 1

10 2

9 3

8 4

7 5
6

0
11 1
A1
B2
10 2
C1

9 D2 3
D1

8 4
B1
C2
7 A2 5
6

SCALING

• Not only is memcached fast, it scales horizontally.

WHAT IS SCALING?

There are two kinds of scaling:

Vertical & Horizonal

VERTICAL SCALING

• “Traditional” scaling
• Transitionsare very complicated
• The old machine is useless afterward.
• There’s a cap on how big you can go.


User Requests

Router

Server A

HORIZONTAL SCALING

• More servers creates more capacity.

• Transparent to the application

• No single point of failure




User Requests

Router




User Requests

Router

Server A




User Requests

Router

Server A Server B

USING MEMCACHED

v = myDB.query( DB query )

USING MEMCACHED

v = memcachedClient.get(key)

if (v == null)

v = myDB.query( DB query )

memcachedClient.set(key, v)

end

WRITING DATA

• When updating data, either update or delete cache

• It’ll be cached on the next request

WHO USES IT?

• 350 million active users

• Avg. user is on 55 minutes/day

• Very connected data

• Everyone loads everyone else’s data all the time

WHO USES IT?

1750
requ
per s ests
• 350
econ
million active users

• Avg. user

• Very
d
is on 55 minutes/day

connected data


RESULTS

• 95% cache hit rate

• DB queries take ~5 ms, memcached hits ~.5ms

WHO USES IT?

• More than100 Million Videos, 1 BILLION views/day


• Memcached handles all thumbnails, descriptions, etc.

WHO USES IT?

• Over 3 million articles, 7 Billion page views/month

• Saves every version of articles in the database, so cache works
great

• Memcached split over hundreds of servers.

REFERENCES

• http://memcached.org/

• http://download.tangent.org/talks/Memcached%20Study.pdf

• http://code.google.com/p/memcached/wiki/Start

• http://en.wikipedia.org/wiki/Memcached

• http://adam.blog.heroku.com/past/2009/7/6/
sql_databases_dont_scale/

Memcached

More Related Content

Similar to Memcached

Recently uploaded

Memcached