Realtime web2012

•Download as PPTX, PDF•

6 likes•2,041 views

Timothy Fitz

Technology

Talking to the browser
High concurrency
Scaling up

3 HARD PROBLEMS

Talking to the browser
• Short Polling
• Long Polling
• WebSocket
• Flash Socket

High Concurrency
• Blocking I/O
– Thread per process
– Tops out at 200 to 1k connections
• Non-blocking I/O
– One process, one thread
– 10k to 100k connections

Non-blocking I/O Servers
• Python
– Twisted
– Tornado
– gevent
• Not python
– Node.js
– Erlang something

Twisted
• Pro
– Can talk every protocol ever
– Oldest and most widely used in production
• Con
– Overkill for web-only tasks
– Not simple

Tornado
• Pro
– Simple
– Does HTTP stuff simply
• Con
– Might not interface with what you need
• Confusing
– You can run Tornado (HTTP layer) on top of
Twisted (networking layer)

gevent
• Pro
– Coroutines are a better model than callbacks
– As such, very easy to write complicated logic
• Con
– Least well documented
– Least consensus on best practices
– New, uncertain about production readiness

Node.js
• Pro
– Best documentation by far
– Socket.IO abstracts away browser communication
• Con
– Can’t share logic between Django app
– New, but has fairly large install base

Erlang
• Pro
– Hands down best for complex realtime tasks
– Forces you to think about concurrency/scale
– Abstracts away the network
– Old and reliable
• Con
– Forces you to think about concurrency/scale
– Can’t share logic between Django app
– High spin-up cost (functional, concurrency driven)

Just one
Frontend nodes x Backend nodes
More architecture decisions!

SCALING UP!

Just one
• Everything in memory
• Django nodes talk directly to box
• Spare for availability
• Failover = realtime data loss
– Make realtime 100% redundant

Probably good enough!
– WARNING: NAPKIN MATH
– 10k daily visits * 10.0min avg visit
= 70 average concurrent users
– One box can easily be built out to handle 3-5k
= Roughly 450k-700k daily visits

Frontend nodes x Backend nodes
• Frontend handle users / connections
• Backend handles channels

More architecture decisions!
• In memory backend
– Redis Pub/Sub
– ZeroMQ
– Roll your own
• Persisted to Disk:
– ActiveMQ
– RabbitMQ
– Amazon SQS

Redis Pub/Sub
• Simplest to setup
• Simplest model
• SUBSCRIBE channel_name
• PUBLISH channel_name “Hello World!”

ZeroMQ
• Publish/Subscribe semantics
• Request/Response
• Push/Pull (round robin)
• Extremely fast

Roll your own
• Same language as your frontend
– (Twisted/Node/Whatever)
• Only do this if you have per-channel business
logic
– You probably don’t.
• Erlang maps really really well to this domain.

Full Stack Services
• REST APIs to push to the browser
• http://pusher.com
• http://beaconpush.com

Canvas

Amazon ELB Nginx + Twisted Redis

Final Recommendations
• Need python? Twisted
• Don’t? Node.js/SocketIO
• Need scale/reliability? Redis backend.
• Complex? Going big? Erlang all the way.

Further Reading
• IMVU IMQ talk http://www.slideshare.net/JonWatte/message-queuing-
on-a-large-scale-imvus-stateful-realtime-message-queue
• Twilio talk on gevent + zeromq (given by Jeff Lindsay, highly recomended):
http://www.twilio.com/conference/video/distributed-systems-with-
gevent-and-zeromq
• Last.fm scaling Eralng/Mochiweb to 1 million concurrent connections on
one machine: http://www.metabrew.com/article/a-million-user-comet-
application-with-mochiweb-part-1
• The original Comet blog post: http://infrequently.org/2006/03/comet-low-
latency-data-for-the-browser/
• Django + Socket.IO + gevent:
http://codysoyland.com/2011/feb/6/evented-django-part-one-socketio-
and-gevent/

What's hot

Rabbits, indians and... Symfony meets queueing brokersGaetano Giunta

Ruby Concurrency RealitiesMike Subelsky

Cluster Fudge: Recipes for WordPress in the Cloud (WordCamp Austin 2014 Speaker)Grant Norwood

Zero mq logsTomas Doran

Perconalive feb-2011-sharemdcallag

Hybrid concurrency patternsKyle Drake

STAQ Development Manual (Redacted)Mike Subelsky

Get Off My Thread! - keep your UI super-responsiveDroidConTLV

Making Symfony Services async with RabbitMq (and more Symfony)Gaetano Giunta

Big Data! Great! Now What? #SymfonyCon 2014Ricard Clau

My month with Rubyalextomovski

Stackato v3Jonas Brømsø

Ratpack for RealTomAkehurst

Riak at Posterouscapotej

Dark Fairytales from a PhishermanMichele Orru

Why internal pen tests are still funpyschedelicsupernova

Some dope on Zope (Jan 2002, Bangalore LUG)Kiran Jonnalagadda

What's new in Symfony3Yuki MAEJIMA

Designing a Docker Stack for Symfony apps: lessons learnedGaetano Giunta

Woo: Writing a fast web serverfukamachi

What's hot (20)

Rabbits, indians and... Symfony meets queueing brokers

Ruby Concurrency Realities

Cluster Fudge: Recipes for WordPress in the Cloud (WordCamp Austin 2014 Speaker)

Zero mq logs

Perconalive feb-2011-share

Hybrid concurrency patterns

STAQ Development Manual (Redacted)

Get Off My Thread! - keep your UI super-responsive

Making Symfony Services async with RabbitMq (and more Symfony)

Big Data! Great! Now What? #SymfonyCon 2014

My month with Ruby

Stackato v3

Ratpack for Real

Riak at Posterous

Dark Fairytales from a Phisherman

Why internal pen tests are still fun

Some dope on Zope (Jan 2002, Bangalore LUG)

What's new in Symfony3

Designing a Docker Stack for Symfony apps: lessons learned

Woo: Writing a fast web server

Similar to Realtime web2012

Html5 web sockets - Brad Drysdale - London Web 2011-10-20Nathan O'Hanlon

Messaging, interoperability and log aggregation - a new frameworkTomas Doran

Scaling with Symfony - PHP UKRicard Clau

Life Beyond Rails: Creating Cross Platform Ruby AppsTristan Gomez

The State of WebSockets in DjangoRami Sayar

JavaQUAID-E-AWAM UNIVERSITY OF ENGINEERING, SCIENCE & TECHNOLOGY, NAWABSHAH, SINDH, PAKISTAN

From a student to an apache committer practice of apache io tdbjixuan1989

High performance network programming on the jvm oscon 2012 Erik Onnen

Distributed "Web Scale" SystemsRicardo Vice Santos

Fast, concurrent ruby web applications with EventMachine and EM::SynchronyKyle Drake

NullMQ @ PDXJeff Lindsay

Modern software architectures - PHP UK Conference 2015Ricard Clau

EKON27-FrameworksTuning.pdfArnaud Bouchez

Be faster then rabbitsVladislav Bauer

Lares from LOW to PWNEDChris Gates

SPDYAndreas Bjärlestam

Three years of OFELIA - taking stockFIBRE Testbed

John adams talk cloudyJohn Adams

Vert.x introductionGR8Conf

Cloud Connected Devices on a Global Scale (CPN303) | AWS re:Invent 2013Amazon Web Services

Similar to Realtime web2012 (20)

Html5 web sockets - Brad Drysdale - London Web 2011-10-20

Messaging, interoperability and log aggregation - a new framework

Scaling with Symfony - PHP UK

Life Beyond Rails: Creating Cross Platform Ruby Apps

The State of WebSockets in Django

Java

From a student to an apache committer practice of apache io tdb

High performance network programming on the jvm oscon 2012

Distributed "Web Scale" Systems

Fast, concurrent ruby web applications with EventMachine and EM::Synchrony

NullMQ @ PDX

Modern software architectures - PHP UK Conference 2015

EKON27-FrameworksTuning.pdf

Be faster then rabbits

Lares from LOW to PWNED

SPDY

Three years of OFELIA - taking stock

John adams talk cloudy

Vert.x introduction

Cloud Connected Devices on a Global Scale (CPN303) | AWS re:Invent 2013

Recently uploaded

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Artificial Intelligence: Facts and MythsJoaquim Jorge

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Real Time Object Detection Using Open CVKhem

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Recently uploaded (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Finology Group – Insurtech Innovation Award 2024

Axa Assurance Maroc - Insurer Innovation Award 2024

Boost PC performance: How more available memory can improve productivity

Artificial Intelligence: Facts and Myths

08448380779 Call Girls In Friends Colony Women Seeking Men

Automating Google Workspace (GWS) & more with Apps Script

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Handwritten Text Recognition for manuscripts and early printed texts

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

CNv6 Instructor Chapter 6 Quality of Service

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Powerful Google developer tools for immediate impact! (2023-24 C)

Real Time Object Detection Using Open CV

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

A Domino Admins Adventures (Engage 2024)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

GenCyber Cyber Security Day Presentation

Realtime web2012

1. Building Real-Time Web http://tinyurl.com/realtime2012 http:// Timothy Fitz .com CTO Canvas

2. What is “Realtime web”

3. What does “Realtime” look like?

4. What does “Realtime” look like?

5. What does “Realtime” look like?

6. “Push, not pull.” REALTIME WEB

7. Talking to the browser High concurrency Scaling up 3 HARD PROBLEMS

8. Talking to the browser • Short Polling • Long Polling • WebSocket • Flash Socket

9. Short Polling

10. Long Polling

11. Flash Socket

12. WebSocket

13. High Concurrency • Blocking I/O – Thread per process – Tops out at 200 to 1k connections • Non-blocking I/O – One process, one thread – 10k to 100k connections

14. Django

15. Django Apache

16. There is no apache for realtime

17. Non-blocking I/O Servers • Python – Twisted – Tornado – gevent • Not python – Node.js – Erlang something

18. Twisted • Pro – Can talk every protocol ever – Oldest and most widely used in production • Con – Overkill for web-only tasks – Not simple

19. Tornado • Pro – Simple – Does HTTP stuff simply • Con – Might not interface with what you need • Confusing – You can run Tornado (HTTP layer) on top of Twisted (networking layer)

20. gevent • Pro – Coroutines are a better model than callbacks – As such, very easy to write complicated logic • Con – Least well documented – Least consensus on best practices – New, uncertain about production readiness

21. Node.js • Pro – Best documentation by far – Socket.IO abstracts away browser communication • Con – Can’t share logic between Django app – New, but has fairly large install base

22. Erlang • Pro – Hands down best for complex realtime tasks – Forces you to think about concurrency/scale – Abstracts away the network – Old and reliable • Con – Forces you to think about concurrency/scale – Can’t share logic between Django app – High spin-up cost (functional, concurrency driven)

23. Just one Frontend nodes x Backend nodes More architecture decisions! SCALING UP!

24. Just one • Everything in memory • Django nodes talk directly to box • Spare for availability • Failover = realtime data loss – Make realtime 100% redundant

25. Probably good enough! – WARNING: NAPKIN MATH – 10k daily visits * 10.0min avg visit = 70 average concurrent users – One box can easily be built out to handle 3-5k = Roughly 450k-700k daily visits

26. Frontend nodes x Backend nodes • Frontend handle users / connections • Backend handles channels

27. More architecture decisions! • In memory backend – Redis Pub/Sub – ZeroMQ – Roll your own • Persisted to Disk: – ActiveMQ – RabbitMQ – Amazon SQS

28. Redis Pub/Sub • Simplest to setup • Simplest model • SUBSCRIBE channel_name • PUBLISH channel_name “Hello World!”

29. ZeroMQ • Publish/Subscribe semantics • Request/Response • Push/Pull (round robin) • Extremely fast

30. Roll your own • Same language as your frontend – (Twisted/Node/Whatever) • Only do this if you have per-channel business logic – You probably don’t. • Erlang maps really really well to this domain.

31. Full Stack Services • REST APIs to push to the browser • http://pusher.com • http://beaconpush.com

32. Canvas Amazon ELB Nginx + Twisted Redis

33. Final Recommendations • Need python? Twisted • Don’t? Node.js/SocketIO • Need scale/reliability? Redis backend. • Complex? Going big? Erlang all the way.

34. Questions?

35. Further Reading • IMVU IMQ talk http://www.slideshare.net/JonWatte/message-queuing- on-a-large-scale-imvus-stateful-realtime-message-queue • Twilio talk on gevent + zeromq (given by Jeff Lindsay, highly recomended): http://www.twilio.com/conference/video/distributed-systems-with- gevent-and-zeromq • Last.fm scaling Eralng/Mochiweb to 1 million concurrent connections on one machine: http://www.metabrew.com/article/a-million-user-comet- application-with-mochiweb-part-1 • The original Comet blog post: http://infrequently.org/2006/03/comet-low- latency-data-for-the-browser/ • Django + Socket.IO + gevent: http://codysoyland.com/2011/feb/6/evented-django-part-one-socketio- and-gevent/

Editor's Notes

Also known as Comet (in response to AJAX)And before that, under the umbrella of “DHTML” (throwback to the late 90s!)
Latency often doesn’t matter at all (3-5s wouldn’t be noticed, for popular hashtags 1 minute wouldn’t make a difference)
Chat (which is pubsub on steroids)Presence (the fact that you’re connected is important)Latency matters some, but you wouldn’t notice 1s of lag.
Gaming, networked simulated physics / simulated spaces. Latency is critical in both directions (~200ms matters)
Also a dozen other methods, and aggregate methods that have built-in fall back semantics.
Supported absolutely everywhereIncredibly efficientIncredibly easy to implement, hard to get wrongRight for infrequent realtime, or tied to existing expensive operation (most common example: short poll Paypal/payment gateway for success confirmation)
Works everywhere (desktop and mobile)Supports most use cases (twitter, etc)
Requires flash support (user has it, no flashblock, desktop only for the most part)Bidirectional and binary.Bidirectional really only matters for realtime interactive apps (games, virtual spaces, motion is one of the few places where 200ms latency matters)Flash is dying, but if your app already requires (or if your UI is already in flash, hello vidya game) then this might be the best solution.
Works on Chrome, FF, Safari, iOS mobile, IE10 previews. Coming to Android Mobile soon.Bidirectional, but UTF-8 (probably doesn’t matter)Very new (RFC hit “Proposed Standard” in Dec 2011, which means the spec is solidified. “Internet Standard” is then next step, and reserved for two independent interoperable implementations, very close)Great but you’ll probably have to support fallback for a while 
Super simplifying, lots of options exist including hybrids.Often run one non-blocking process per core (if you have to scale to multiple machines, using the same strategy for multiple processes is trivial)
Okay this is kind of a lie, there are hacky ways but you lose most of what makes Django, Django: sessions, users, auth, ORM, and most 3rd party libraries
There is no consensus. There are some good python options. There are a LOT of options I’m not even mentioning, almost every language has two or three non-blocking I/O webservers. Python might be important, especially if you have logic you want to reuse between your Django application and your non-blocking I/O app
Can have two for redundancy

Realtime web2012

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Realtime web2012

Similar to Realtime web2012 (20)

More from Timothy Fitz

More from Timothy Fitz (12)

Recently uploaded

Recently uploaded (20)

Realtime web2012

Editor's Notes