Workers and Event processors that Scale

Workers and Event Processors that Scale
August 2015 Meetup

WHAT? WHY?
• Image processing
• Video encoding
• Data processing
• Report generation
• Order processing
• on and on and on…..

TRADITIONAL METHOD
1. Create & secure page in monolith
2. Setup cron to call URL on schedule or just off-
load process on demand within application.
3. Watch as resources spike. Worry about when to
run tasks so performance is not impacted.

WORKER PATTERNS
• Small independent unit of work
• Think concurrently
• Optimal worker durations
• Scheduling & Quarterbacking
• Pass Id’s, be thin
• Log everything
• Use message queues

SMALL INDEPENDENT UNIT
OF WORK
Keep workers task speciﬁc and as light as possible with one purpose.
MUCH easier to maintain and scale.

THINK CONCURRENTLY
Designed as collections of non-interactive workers that may be
executed in parallel.
Independent and self-sufﬁcient.
Loosely coupled, no shared state.

OPTIMALTASK DURATIONS
Long enough to take care of setup, but short enough to quickly ﬁx and
retry on errors.
Too short does not make sense for setup/tear down. If short, batch
several tasks.
Too long increases opportunity for failure and complexity of retry.

SCHEDULING &
QUARTERBACKING
Create a low number of scheduled jobs and queue up lots of
concurrent jobs.

PASS IDS, BETHIN
Pass the minimum amount of information required for the worker/
processor to run. Then retrieve full data while running.
• Better performance while queuing up tasks
• Get latest data
• Much easier to maintain state on that object

LOG EVERYTHING
Log everything using a cloud logger or other global logger.

USE MESSAGE QUEUES
Plan for concurrency issues at scale.

SOLUTIONS
We are going to focus on cloud solutions

AWS Lambda IronWorker
Event-BasedTriggers Yes Yes
Timeout 60 sec (rounded 100 ms) 1 hour (rounded 1 sec)
StartupTime Milliseconds Few seconds, variable
Languages Node.js & Java All major
Memory Size 64-1536 MB 320 - 2048 MB
Disk Size 512 MB 10 GB
Max Payload Size 6 MB 64 KB
Clouds AWS AWS, Rackspace,Azure, private
Dedicated Clusters No Yes
On-Premise No Yes
Built-in Scheduling No Yes
Simple function editor Yes No
FreeTier Yes 5 concurrent &10 hours

ADDITIONAL ITEMS
• CLI has additional options to fine tune the way the worker
should process (max-concurrency, retries, retries-delay, delay, timeout, scheduling,
and multiple environments)
• Configuration variables via json or yml files.
• Multiple invoke methods
• CLI
• Webhooks
• Via IronMQ
• REST API

ADDITIONAL ITEMS
• Deployment automation
• Multiple invoke methods
• Kinesis or DynamoDB stream
• S3 event
• CloudTrail events
• API Gateway
• REST API
• Gotcha:You might need a remote build server.

QUESTIONS?
https://github.com/GrNodeDev/ExchangeRate

Workers and Event processors that Scale

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (14)

Similar to Workers and Event processors that Scale

Similar to Workers and Event processors that Scale (20)

Recently uploaded

Recently uploaded (20)

Workers and Event processors that Scale