Building a High-Performance Distributed Task Queue on MongoDB

Chapman: Building a
Distributed Job Queue
in MongoDB
Rick Copeland
@rick446
rick@synapp.io

Getting to Know One
Another
Rick

Roadmap
Define the problem
Schema design & operations
Types of tasks
Reducing Polling
•
•
•
•

Requirements
Digital Ocean
Rackspace US
Rackspace UK
SMTP
Server
SMTP
Server
SMTP
Server
SMTP
Server
SMTP
Server
SMTP
Server
App
Server

Requirements
Group
check_smtp
Analyze
Results
Update
Reports
Pipeline

Requirements
MongoDB
(of course)

Basic Ideas
msg
Chapman Insecure
msg
msg
msg
msg
msg
Task
Task
Worker
Process

Job Queue Schema:
Message
{ _id: ObjectId(...),
task_id: ObjectId(...),
slot: 'run',
s: {
status: 'ready',
ts: ISODateTime(...),
q: 'chapman',
pri: 10,
w: '----------',
},
args: Binary(...),
kwargs: Binary(...),
send_args: Binary(...),
send_kwargs: Binary(...)
}
Task method to
be run
Destination
Task
Sched
SynchroMessage
arguments

Job Queue Schema:
TaskState
{ _id: ObjectId(...),
type: 'Group',
parent_id: ObjectId(...),
on_complete: ObjectId(...),
mq: [ObjectId(...), ...],
status: 'pending',
options: {
queue: 'chapman',
priority: 10,
immutable: false,
ignore_result: true,
}
result: Binary(...),
data: {...}
}
Python class
registered for
task
Parent task
(if any)
Message to be
sent on
completion

Message State: Naive
Approach
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message
Reserve
Message
Try to
lock task
Un-
reserve
Message

Message States:
Reserve Message
findAndModify (‘ready’)
s.state => ‘q1’
s.w => worker_id
$push _id onto task’s mq field
If msg is first in mq, s.state =>
‘busy’
Start processing
•
•
•
•
•
•
ready
q1
busy
q2
next
pending

Message States:
Reserve Message
findAndModify (‘next’)
s.state => ‘busy’
s.w => worker_id
start processing
•
•
•
•
ready
q1
busy
q2
next
pending

Message States:
Retire Message
findAndModify TaskState
$pull message _id from ‘mq’
findAndModify new first message
in ‘mq’ if its s.state is in [‘q1’, ‘q2’]
s.state => ‘next’
•
•
•
•
ready
q1
busy
q2
next
pending

Task States
States mainly
advisory
success,failure
transitions trigger
on_complete
message
‘chained’ is a tail-
call optimization
•
•
•
pending
active
chainedfailuresuccess

Basic Tasks:
FunctionTask
Simplest task: run a function to
completion, set the result to the return
value
If a ‘Suspend’ exception is raised, move
the task to ‘chained’ status
Other exceptions set task to ‘failure’, save
•
•
•
@task(ignore_result=True, priority=50)
def track(event, user_id=None, **kwargs):
log.info('track(%s, %s...)', event, user_id)
# ...

Digression: Task
Chaining
Task state set to ‘chained’
New “Chain” task is created that will
Call the “next” task
When the “next” task completes, also
complete the “current” task
•
•
•
•
@task(ignore_result=True,
priority=50)
def function_task(*args, **kwargs):
# ...
Chain.call(some_other_task)

Composite Tasks
on_complete message for each subtask
with slot=retire_subtask, specifying subtask
position & the result of the subtask
Different composite tasks implement ‘run’
and ‘retire_subtask’ differently
•
•
task_state.update(
{ '_id': subtask_id },
{ $set: {
'parent_id': composite_id,
'data.composite_position': position,
'options.ignore_result': false
}}
)

Composite Task:
Pipeline
Run
Send a ‘run’ message to the subtask with
position=0
Retire_subtask(position, result)
Send a ‘run’ message with the previous
result to the subtask with position =
(position+1), OR retire the Pipeline if no
more tasks
•
•
•
•

Composite Task:
Group
Run
Send a ‘run’ message to all subtasks
Retire_subtask(position, result)
Decrement the num_waiting counter
If num_waiting is 0, retire the group
Collect subtask results, complete
•
•
•
•
•
•

Reducing Polling
Reserving messages is expensive
Use Pub/Sub system instead
Publish to the channel whenever a
message is ready to be handled
Each worker subscribes to the channel
•
•
•
•

Pub/Sub for MongoDB
Capped
Collection
Fixed size
Fast inserts
“Tailable”
cursors
•
•
•
Tailable
Cursor

Getting a Tailable
Cursor
def get_cursor(collection, topic_re, await_data=True):
options = { 'tailable': True }
if await_data:
options['await_data'] = True
cur = collection.find(
{ 'k': topic_re },
**options)
cur = cur.hint([('$natural', 1)]) # ensure we don't use any indexes
return cur
import re, time
while True:
cur = get_cursor(
db.capped_collection,
re.compile('^foo'),
await_data=True)
for msg in cur:
do_something(msg)
time.sleep(0.1)
Holds open cursor for a
while
Make curso
Don’t use indexes
Still some polling
producer, so do
too fast

Building in retry...
def get_cursor(collection, topic_re, last_id=-1, await_data=True):
spec = {
'id': { '$gt': last_id }, # only new messages
'k': topic_re }
if await_data:
cur = collection.find(spec, **options)
return cur
Integer au

Building auto-increment
class Sequence(object):
...
def next(self, sname, inc=1):
doc = self._db[self._name].find_and_modify(
query={'_id': sname},
update={'$inc': { 'value': inc } },
upsert=True,
new=True)
return doc['value']
Atomically $inc the
dedicated document

Ludicrous Speed
from pymongo.cursor import _QUERY_OPTIONS
def get_cursor(collection, topic_re, last_id=-1, await_data=True):
spec = {
'ts': { '$gt': last_id }, # only new messages
'k': topic_re }
if await_data:
cur = collection.find(spec, **options)
if await:
cur = cur.add_option(_QUERY_OPTIONS['oplog_replay'])
return cur
id ==> ts
Co-opt the
oplog_repla
option

Questions?
Rick Copeland
rick@synapp.io
@rick446

Building a High-Performance Distributed Task Queue on MongoDB

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Building a High-Performance Distributed Task Queue on MongoDB

Similar to Building a High-Performance Distributed Task Queue on MongoDB (20)

More from MongoDB

More from MongoDB (20)

Building a High-Performance Distributed Task Queue on MongoDB