SlideShare a Scribd company logo
1 of 17
Download to read offline
1




      Powering the Britain’s Got Talent
      buzzer*
       *And Big Data



                        Big Data Meetup, London 25/5/2011


Thursday, 26 May 2011                                           1
2




      What we do




Thursday, 26 May 2011       2
3




      Me




            Malcolm Box, Co-founder & CTO

            boxm@livetalkback.com

            @malcolmbox




Thursday, 26 May 2011                           3
4




      The Buzzer




                        BIG DATA




Thursday, 26 May 2011                  4
5




      The challenge



            10 Million+ viewers

            Design goal of 50,000 requests/s, 10,000 buzzes/second

                  Equivalent to 130 Billion requests/month

            But just on Saturday night

            And four weeks to build




Thursday, 26 May 2011                                                    5
6




      The challenge




                                      Where does 130
                                     Billion requests fit?




                 Source: http://www.google.com/adplanner/static/top1000/#

Thursday, 26 May 2011                                                           6
7




      Where we started....

                                          app.livetalkback.com          cdn.livetalkback.com


                        Control plane
                                                    ELB                      CloudFront




                           Zabbix
                                        Webserver           Webserver
                                         Django              Django
                                         Ubuntu              Ubuntu




                                                    MySQL                        S3




Thursday, 26 May 2011                                                                              7
8




      Step 1: Testing




            Started with a platform with a previous peak of 100 requests/s

            No idea where it would break

            Tsung! http://tsung.erlang-projects.org/




Thursday, 26 May 2011                                                            8
9




      Step 2: ELB



            Amazon Elastic Load Balancer

            “Infinite capacity”

            BUT very long impulse response and NO controls :(


            HAProxy to the rescue

                  5K requests/s per node




Thursday, 26 May 2011                                               9
10




      Step 3: Avoid the DB



            MySQL was never going to be able to handle 10,000 writes/s, nor 50,000
            reads

            “Hey, Django does memcached. Problem solved”

            Help, our memcached server I/O is maxed out :(

            Two-layer cache: https://gist.github.com/953524

            Write-behind data




Thursday, 26 May 2011                                                                     10
11




      But we want analytics!




            Now 10K things to write to disk every second

            Logging? Database?

            This is starting to look like BIG DATA




Thursday, 26 May 2011                                           11
12




      Step 4: Baby




Thursday, 26 May 2011        12
13




      Step 5: Cassandra




            Deployed Cassandra cluster on EC2 to handle buzz records

            Tested to > 10K writes/s

            All good!

            “So how many users did we have last night?”




Thursday, 26 May 2011                                                       13
14




      Where we ended...

                                                     app.livetalkback.com                   cdn.livetalkback.com
                                                                                                                    10
                        Control plane
                                               HAProxy               HAProxy                     CloudFront        nodes


                            Chef

                                              Webserver             Webserver                                      100+
                                                                                                                   nodes
                                               Django                Django
                                               Ubuntu                Ubuntu
                           Zabbix




                                        Memcached          Cassandra
                                         Memcached          Cassandra          RDS Master            S3




Thursday, 26 May 2011                                                                                                           14
15




      Scaling up - and down


            Configuring 100+ servers by
            hand each week would have
            been a pain

            Used to Chef to automate

            Also builds the test swarm

            http://wiki.opscode.com/display/
            chef/Home




Thursday, 26 May 2011                               15
16




      Now what?




            Still challenges with analytics & ad-hoc queries

                  Looking at Brisk and Hadoop

            We’re sucking the Twitter firehose for Tellybug

                  MySQL is coping so far, but only just




Thursday, 26 May 2011                                               16
17




      Questions?
       boxm@livetalkback.com

       @malcolmbox




Thursday, 26 May 2011               17

More Related Content

Similar to Scaling the Britain's Got Talent Buzzer

Conquistando el Servidor con Node.JS
Conquistando el Servidor con Node.JSConquistando el Servidor con Node.JS
Conquistando el Servidor con Node.JS
Caridy Patino
 
Java EE Servlet JSP Tutorial- Cookbook 1
Java EE Servlet JSP Tutorial- Cookbook 1Java EE Servlet JSP Tutorial- Cookbook 1
Java EE Servlet JSP Tutorial- Cookbook 1
billdigman
 
What python can learn from java
What python can learn from javaWhat python can learn from java
What python can learn from java
jbellis
 
WTF Is Messaging And Why You Should Use It?
WTF Is Messaging And Why You Should Use It?WTF Is Messaging And Why You Should Use It?
WTF Is Messaging And Why You Should Use It?
James Russell
 
20110903 candycane
20110903 candycane20110903 candycane
20110903 candycane
Yusuke Ando
 
Shake Hooves With BeEF - OWASP AppSec APAC 2012
Shake Hooves With BeEF - OWASP AppSec APAC 2012Shake Hooves With BeEF - OWASP AppSec APAC 2012
Shake Hooves With BeEF - OWASP AppSec APAC 2012
Christian Frichot
 

Similar to Scaling the Britain's Got Talent Buzzer (20)

Retrospective from a startup built in the cloud: top three big lessons learne...
Retrospective from a startup built in the cloud: top three big lessons learne...Retrospective from a startup built in the cloud: top three big lessons learne...
Retrospective from a startup built in the cloud: top three big lessons learne...
 
Caridy patino - node-js
Caridy patino - node-jsCaridy patino - node-js
Caridy patino - node-js
 
Conquistando el Servidor con Node.JS
Conquistando el Servidor con Node.JSConquistando el Servidor con Node.JS
Conquistando el Servidor con Node.JS
 
Java EE Servlet JSP Tutorial- Cookbook 1
Java EE Servlet JSP Tutorial- Cookbook 1Java EE Servlet JSP Tutorial- Cookbook 1
Java EE Servlet JSP Tutorial- Cookbook 1
 
Scaling Selenium
Scaling SeleniumScaling Selenium
Scaling Selenium
 
AppScale Talk at SBonRails
AppScale Talk at SBonRailsAppScale Talk at SBonRails
AppScale Talk at SBonRails
 
Corwin on Containers
Corwin on ContainersCorwin on Containers
Corwin on Containers
 
What python can learn from java
What python can learn from javaWhat python can learn from java
What python can learn from java
 
presentation-chaos-monkey
presentation-chaos-monkeypresentation-chaos-monkey
presentation-chaos-monkey
 
2011 02-14-libre
2011 02-14-libre2011 02-14-libre
2011 02-14-libre
 
WTF Is Messaging And Why You Should Use It?
WTF Is Messaging And Why You Should Use It?WTF Is Messaging And Why You Should Use It?
WTF Is Messaging And Why You Should Use It?
 
20110903 candycane
20110903 candycane20110903 candycane
20110903 candycane
 
Shake Hooves With BeEF - OWASP AppSec APAC 2012
Shake Hooves With BeEF - OWASP AppSec APAC 2012Shake Hooves With BeEF - OWASP AppSec APAC 2012
Shake Hooves With BeEF - OWASP AppSec APAC 2012
 
Big Data Israel Meetup : Couchbase and Big Data
Big Data Israel Meetup : Couchbase and Big DataBig Data Israel Meetup : Couchbase and Big Data
Big Data Israel Meetup : Couchbase and Big Data
 
We all need friends and Akka just found Kubernetes
We all need friends and Akka just found KubernetesWe all need friends and Akka just found Kubernetes
We all need friends and Akka just found Kubernetes
 
Amazon Elastic Beanstalk - Indicthreads.com cloud computing conference 2011
Amazon Elastic Beanstalk  - Indicthreads.com cloud computing conference 2011Amazon Elastic Beanstalk  - Indicthreads.com cloud computing conference 2011
Amazon Elastic Beanstalk - Indicthreads.com cloud computing conference 2011
 
HTTP cache @ PUG Rome 03-29-2011
HTTP cache @ PUG Rome 03-29-2011HTTP cache @ PUG Rome 03-29-2011
HTTP cache @ PUG Rome 03-29-2011
 
Architecting for the Cloud: demo and best practices, by Simone Brunozzi (2011...
Architecting for the Cloud: demo and best practices, by Simone Brunozzi (2011...Architecting for the Cloud: demo and best practices, by Simone Brunozzi (2011...
Architecting for the Cloud: demo and best practices, by Simone Brunozzi (2011...
 
Effective Platform Building with Kubernetes. Is K8S new Linux?
Effective Platform Building with Kubernetes. Is K8S new Linux?Effective Platform Building with Kubernetes. Is K8S new Linux?
Effective Platform Building with Kubernetes. Is K8S new Linux?
 
2 my sql overview technology etc[1]. etc
2 my sql overview   technology etc[1]. etc2 my sql overview   technology etc[1]. etc
2 my sql overview technology etc[1]. etc
 

Recently uploaded

Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
UXDXConf
 

Recently uploaded (20)

Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty Secure
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
Top 10 Symfony Development Companies 2024
Top 10 Symfony Development Companies 2024Top 10 Symfony Development Companies 2024
Top 10 Symfony Development Companies 2024
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Designing for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastDesigning for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at Comcast
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
The UX of Automation by AJ King, Senior UX Researcher, Ocado
The UX of Automation by AJ King, Senior UX Researcher, OcadoThe UX of Automation by AJ King, Senior UX Researcher, Ocado
The UX of Automation by AJ King, Senior UX Researcher, Ocado
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 

Scaling the Britain's Got Talent Buzzer

  • 1. 1 Powering the Britain’s Got Talent buzzer* *And Big Data Big Data Meetup, London 25/5/2011 Thursday, 26 May 2011 1
  • 2. 2 What we do Thursday, 26 May 2011 2
  • 3. 3 Me Malcolm Box, Co-founder & CTO boxm@livetalkback.com @malcolmbox Thursday, 26 May 2011 3
  • 4. 4 The Buzzer BIG DATA Thursday, 26 May 2011 4
  • 5. 5 The challenge 10 Million+ viewers Design goal of 50,000 requests/s, 10,000 buzzes/second Equivalent to 130 Billion requests/month But just on Saturday night And four weeks to build Thursday, 26 May 2011 5
  • 6. 6 The challenge Where does 130 Billion requests fit? Source: http://www.google.com/adplanner/static/top1000/# Thursday, 26 May 2011 6
  • 7. 7 Where we started.... app.livetalkback.com cdn.livetalkback.com Control plane ELB CloudFront Zabbix Webserver Webserver Django Django Ubuntu Ubuntu MySQL S3 Thursday, 26 May 2011 7
  • 8. 8 Step 1: Testing Started with a platform with a previous peak of 100 requests/s No idea where it would break Tsung! http://tsung.erlang-projects.org/ Thursday, 26 May 2011 8
  • 9. 9 Step 2: ELB Amazon Elastic Load Balancer “Infinite capacity” BUT very long impulse response and NO controls :( HAProxy to the rescue 5K requests/s per node Thursday, 26 May 2011 9
  • 10. 10 Step 3: Avoid the DB MySQL was never going to be able to handle 10,000 writes/s, nor 50,000 reads “Hey, Django does memcached. Problem solved” Help, our memcached server I/O is maxed out :( Two-layer cache: https://gist.github.com/953524 Write-behind data Thursday, 26 May 2011 10
  • 11. 11 But we want analytics! Now 10K things to write to disk every second Logging? Database? This is starting to look like BIG DATA Thursday, 26 May 2011 11
  • 12. 12 Step 4: Baby Thursday, 26 May 2011 12
  • 13. 13 Step 5: Cassandra Deployed Cassandra cluster on EC2 to handle buzz records Tested to > 10K writes/s All good! “So how many users did we have last night?” Thursday, 26 May 2011 13
  • 14. 14 Where we ended... app.livetalkback.com cdn.livetalkback.com 10 Control plane HAProxy HAProxy CloudFront nodes Chef Webserver Webserver 100+ nodes Django Django Ubuntu Ubuntu Zabbix Memcached Cassandra Memcached Cassandra RDS Master S3 Thursday, 26 May 2011 14
  • 15. 15 Scaling up - and down Configuring 100+ servers by hand each week would have been a pain Used to Chef to automate Also builds the test swarm http://wiki.opscode.com/display/ chef/Home Thursday, 26 May 2011 15
  • 16. 16 Now what? Still challenges with analytics & ad-hoc queries Looking at Brisk and Hadoop We’re sucking the Twitter firehose for Tellybug MySQL is coping so far, but only just Thursday, 26 May 2011 16
  • 17. 17 Questions? boxm@livetalkback.com @malcolmbox Thursday, 26 May 2011 17