SHORTCUTS AROUND THE MISTAKES WE’VE MADE SCALING MONGODB

SHORTCUTS
AROUND THE
MISTAKES WE’VE
MADE SCALING
MONGODB
@effata
slideshare.net/tollmyr

David Tollmyr, Platform lead

What we do
We want to make digital advertising an
amazing user experience.
There is more to metrics that clicks.

Assembling sessions
exposure

ping
ping

event
ping

ping
ping

➔ ➔ session

event

ping

Crunching
session
session
session

session
session session

session session

session session
session

➔ ➔ 42

session

session

What we do
Track ads, make pretty reports.

That doesn’t
sound so hard
We don’t know when sessions end
There’s a lot of data
It’s all done in (close to) real time

Numbers
200 Gb logs
100 million data points
per day
~300 metrics per data point
= 6000 updates / s at peak

How we use(d) MongoDB
“Virtual memory” to ofﬂoad data while we wait
for sessions to ﬁnish
Short time storage (<48 hours) for batch jobs,
replays and manual analysis
Metrics storage

Why we use MongoDB
Schemalessness makes things so much easier,
the data we collect changes as we come up
with new ideas
Sharding makes it possible to scale writes
Secondary indexes and rich query language are
great features (for the metrics store)
It’s just… nice

Btw.
We use JRuby, it’s awesome

STANDING ON
THE SHOULDERS
OF GIANTS
WITH JRUBY

slideshare.net/iconara

1st iteration
secondary indexes and updates

One document per session, update as new
data comes along
Outcome: 1000% write lock

#1
Everything is about
working around the

GLOBAL
WRITE
LOCK

MongoDB 1.8.1

db.coll.update({_id: "xyz"}, {$inc: {x: 1}}, true)

db.coll.update({_id: "abc"}, {$push: {x: “...”}}, true)

MongoDB 2.0.0

db.coll.update({_id: "xyz"}, {$inc: {x: 1}}, true)

db.coll.update({_id: "abc"}, {$push: {x: “...”}}, true)

2nd iteration
using scans for two step assembling

Instead of updating, save each fragment, then
scan over _id to assemble sessions

2nd iteration
using scans for two step assembling

Outcome: not as much lock, but still not great
performance. We also realised we couldn’t
remove data fast enough

#2
Everything is about
working around the

GLOBAL
WRITE
LOCK

#3
Give a lot of
thought to your

PRIMARY
KEY

3rd iteration
partitioning

Partitioning the data by writing to a new
collection every hour
Outcome: complicated, fragmented database

#4
Make sure you can

REMOVE
OLD DATA

4th iteration
sharding

To get around the global write lock and get
higher write performance we moved to a
sharded cluster.
Outcome: higher write performance, lots of
problems, lots of ops time spent debugging

#5
Everything is about
working around the

GLOBAL
WRITE
LOCK

#6
SHARDING
IS NOT A
SILVER BULLET
and it’s complex,
if you can, avoid it

#7
IT WILL FAIL
design for it

5th iteration
moving things to separate clusters

We saw very different loads on the shards and
realised we had databases with very different
usage patterns, some that made autosharding
not work. We moved these off the cluster.
Outcome: a more balanced and stable cluster

#8
Everything is about
working around the

GLOBAL
WRITE
LOCK

#9
ONE DATABASE
with one usage pattern

PER CLUSTER

#10
MONITOR
EVERYTHING
look at your health
graphs daily

6th iteration
monster machines

We got new problems removing data and
needed some room to breathe and think
Solution: upgraded the servers to High-
Memory Quadruple Extra Large (with cheese).

I♥

#11
Don’t try to scale up

SCALE OUT

#12
When you’re
out of ideas

CALL THE
EXPERTS

7th iteration
partitioning (again) and pre-chunking

We rewrote the database layer to write to a
new database each day, and we created all
chunks in advance. We also decreased the size
of our documents by a lot.
Outcome: no more problems removing data.

#13
Smaller objects means a
smaller database, and a
smaller database means

LESS RAM
NEEDED

#14
Give a lot of
thought to your

PRIMARY
KEY

#15
Everything is about
working around the

GLOBAL
WRITE
LOCK

8th iteration
realize when you have the wrong tool

Transient data might not need all the bells and
whistles.

Outcome: Redis gave us 100x performance in
the assembling step

#16
When all you have is a
HAMMER
everything looks like a
NAIL

9th iteration
rinse and repeat

We now have the same scaling issues later in
the chain.

Outcome: Upcoming rewrite to make writes/
updated more effective
Redis was actually slower

#17
Everything is about
working around the

GLOBAL
WRITE
LOCK

Thank you

@effata
slideshare.net/tollmyr

engineering.burtcorp.com
burtcorp.com
richmetrics.com

Tips
EC2

You have three copies of your data, do you
really need EBS?
Instance store disks are included in the price
and they have predictable performance.
m1.xlarge comes with 1.7 TB of storage.

Tips
Avoid bulk inserts

Very dangerous if there’s a possibility of
duplicate key errors

It’s not ﬁxed in 2.0 even though the driver has a
ﬂag for it.

Tips
Safe mode

Run every Nth insert in safe mode
This will give you warnings when bad things
happen; like failovers

SHORTCUTS AROUND THE MISTAKES WE’VE MADE SCALING MONGODB

Recommended

Recommended

More Related Content

Similar to SHORTCUTS AROUND THE MISTAKES WE’VE MADE SCALING MONGODB

Similar to SHORTCUTS AROUND THE MISTAKES WE’VE MADE SCALING MONGODB (20)

Recently uploaded

Recently uploaded (20)

SHORTCUTS AROUND THE MISTAKES WE’VE MADE SCALING MONGODB