Continuous Deployment at IMVU

Brett Durrett
Brett DurrettCEO at IMVU, Inc.
Scaling with Continuous Deployment

          Social Developer Summit
      San Francisco, CA, June 29, 2010

             Brett G. Durrett (@bdurrett)
 Vice President Engineering & Operations, IMVU, Inc.




                                                       0
An online community where members use 3D avatars
           to meet new people, chat, create
            and have fun with their friends
Survey Says




Continuous Deployment... who is with me?




                                           2
In a Nutshell




What is Continuous Deployment?

• Engineer commits code
• 20 minutes later it is live in production
• Repeat about 50 times per day


                                                 3
Does This Really Work?


“Maybe this is just viable for a single
  developer … your site will be down. A lot.”

“It seems like the author either has no
   customers or very understanding
   customers”

    Responses to February 2009 posting by Timothy Fitz about Continuous Deployment at IMVU
                                                 (at the time IMVU had a $12 million run rate)



                                                                                                 4
Benefits




• Regressions easy to find, correct

• Releases have zero overhead

• Rapid iteration using real customer metrics


                                                5
Finding and Fixing Problems


                        • Each release has few
                          changes, 1-3 commits

                        • Production issues
                          correlate with check-
                          in timestamp

                        • No overhead to
Identifying cause
  takes minutes
                          producing a new
                          release to correct
                          issue
CD at IMVU: Simple Overview

  Local tests                                                   Rollback
pass, engineer                                                  (Blocks)
commits code
                                No


Lots and lots of            Metrics        Yes    Code deployed
   tests run                good?                  to all servers




   All tests       Yes   Code deployed               Metrics
                                                                    No
    pass?                to % of servers              still
                                                     good?

         No                                               Yes

Revert commit
  (Blocks)                                 Win!

                                                                           7
CD at IMVU: Detailed Overview




                            8
Getting Started – Extreme Basics




1. Continuous integration system
2. Production monitoring and alerting
   – System performance
   – Business metrics
   – Trending is nice too 
3. Simple deploy / roll-back system


                                             9
Commit to Making Forward Progress




• Require coverage for all new code

• Add coverage for bugs / regressions

• Understand and fix root cause of failures


                                              10
Expect Some Hurdles


• Production outages
• New overhead
   – Tests
   – Build systems
• Production outages
• Frustration
• Production outages

   (but well worth it)
Dealing with SQL


Problems
• Difficult to roll-back schema
• Alter statements lock / impact customers

Solutions
• New schema has formal review process
• No alter on large tables, create new table
   – Copy on read
   – Complete migration with background job
                                               12
Big Features



• Developed on trunk, not branch
   – “hidden” from customers by A/B experiment
   – 100% control, add QA to experiment


• Deployed daily during development

• Slow roll-out by increasing experiment %
   – Experiment closed = fully launched
                                                 13
Test Speed


Slow tests burden to scaling
• Can’t run all tests in sandbox
• Faster to debug on build cluster

If possible…
• Keep tests fast
• Keep tests specific


                                              14
The cost of failing tests



As the team grows…

• More likely to have test failures
• More people blocked as a result

        Intermittent failures very bad
          Eliminate the root cause

                                                15
Other Issues


• Won’t catch issues that fail slowly
   – SELECT * FROM growing_table WHERE 1


• Some critical areas cause hard lock-ups
   – MySQL
   – Memcached


• Lack of test coverage of older code
   – Not an issue if you start with test coverage
                                                    16
Does Continuous Deployment Scale?

• Technical staff ~50 people

• 10 million monthly unique visitors

• Peak ~115K concurrent IM client logins

• It’s a real business!
   – $40 million run rate
   – Profitable and doubled revenue in 2009
                                              17
Newer Scaling Challenges




Biggest challenges come with growth of the
          engineering organization




                                             18
SLA for Build Systems




Build systems are a critical service




                                        19
SLA for Build Systems




Build systems are a critical service
        Run them that way




                                        20
Build and Push Times




                   21
Overall Availability




                   22
Build Throughput


• Initial implementation sequential builds
   – Scaled okay to ~20 engineers
   – Like trains running every 20 minutes
   – One “red” blocks all following builds


• Solution: build isolation
   – Enable testing single build without deploy
   – “Red” build pulled, allow other builds to pass


                                                      23
Current Systems



• > 15,000 tests

• 72 web build servers
   – 51 Linux, 21 Windows

• > 6 hours of tests on average hardware

• Deploy to cluster of ~700 servers

                                            24
Web Build Software



•   Custom test-file runner with JS GUI
•   PHP SimpleTest
•   Python's built-in unittest
•   Selenium Core with in-house API wrapper
•   YUITest for browser JS unit tests
•   Erlang Eunit


                                              25
Conclusion



• Continuous Deployment is good

• Try it – starting earlier is easier

• It’s a key part of a nutritious development
  process


                                                 26
Questions?




             27
More on Continuous Deployment

• SD Times Leaders of Agile: Kent Beck's
  Principles of Agility: http://bit.ly/9wsAYv
  (this webinar tomorrow, June 30)

• Eric Ries (Startup Lessons Learned) on
  Continuous Deployment: http://bit.ly/5l6X1

• Timothy Fitz (IMVU) Doing the impossible
  50 times a day: http://bit.ly/OxJv
                                                28
Thank You!



           Brett G. Durrett
          bdurrett@imvu.com
           Twitter: @bdurrett

      IMVU was recognized as one of
the “Best Places to Work” (and we’re hiring)
         http://www.imvu.com/jobs/



                                                29
1 of 30

Recommended

Avoiding integration hell by
Avoiding integration hellAvoiding integration hell
Avoiding integration hellaaronbassett
1.2K views46 slides
2016 04-25 continuous integration at google scale by
2016 04-25 continuous integration at google scale2016 04-25 continuous integration at google scale
2016 04-25 continuous integration at google scaleJohn Micco
3.9K views30 slides
Continuous Delivery Un caso de estudio by
Continuous Delivery Un caso de estudioContinuous Delivery Un caso de estudio
Continuous Delivery Un caso de estudioOsvaldo
1.2K views40 slides
Dev ops e infraestructura – acompañando nuestro software a producción by
Dev ops e infraestructura – acompañando nuestro software a producciónDev ops e infraestructura – acompañando nuestro software a producción
Dev ops e infraestructura – acompañando nuestro software a producciónKleer Agile Coaching & Training
735 views28 slides
Secrets of Avatar Design for VR - IMVU AWE, June 2016 by
Secrets of Avatar Design for VR - IMVU AWE, June 2016Secrets of Avatar Design for VR - IMVU AWE, June 2016
Secrets of Avatar Design for VR - IMVU AWE, June 2016Brett Durrett
1.1K views21 slides
Social VR and Gaming by
Social VR and GamingSocial VR and Gaming
Social VR and GamingBrett Durrett
859 views23 slides

More Related Content

Recently uploaded

Qualifying SaaS, IaaS.pptx by
Qualifying SaaS, IaaS.pptxQualifying SaaS, IaaS.pptx
Qualifying SaaS, IaaS.pptxSachin Bhandari
1K views8 slides
Initiating and Advancing Your Strategic GIS Governance Strategy by
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategySafe Software
176 views68 slides
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineShapeBlue
221 views19 slides
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ... by
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...Jasper Oosterveld
35 views49 slides
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... by
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...ShapeBlue
173 views15 slides
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT by
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBITUpdates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBITShapeBlue
206 views8 slides

Recently uploaded(20)

Initiating and Advancing Your Strategic GIS Governance Strategy by Safe Software
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance Strategy
Safe Software176 views
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue221 views
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ... by Jasper Oosterveld
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... by ShapeBlue
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
ShapeBlue173 views
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT by ShapeBlue
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBITUpdates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
ShapeBlue206 views
LLMs in Production: Tooling, Process, and Team Structure by Aggregage
LLMs in Production: Tooling, Process, and Team StructureLLMs in Production: Tooling, Process, and Team Structure
LLMs in Production: Tooling, Process, and Team Structure
Aggregage42 views
Why and How CloudStack at weSystems - Stephan Bienek - weSystems by ShapeBlue
Why and How CloudStack at weSystems - Stephan Bienek - weSystemsWhy and How CloudStack at weSystems - Stephan Bienek - weSystems
Why and How CloudStack at weSystems - Stephan Bienek - weSystems
ShapeBlue238 views
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue by ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlueVNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
ShapeBlue203 views
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc170 views
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda... by ShapeBlue
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
ShapeBlue161 views
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And... by ShapeBlue
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...
ShapeBlue106 views
Future of AR - Facebook Presentation by Rob McCarty
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
Rob McCarty64 views
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue by ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlueCloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
ShapeBlue135 views
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... by ShapeBlue
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
ShapeBlue119 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue198 views
Digital Personal Data Protection (DPDP) Practical Approach For CISOs by Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash158 views

Featured

ChatGPT and the Future of Work - Clark Boyd by
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
28K views69 slides
Getting into the tech field. what next by
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
6.6K views22 slides
Google's Just Not That Into You: Understanding Core Updates & Search Intent by
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
6.9K views99 slides
How to have difficult conversations by
How to have difficult conversations How to have difficult conversations
How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC
5.6K views19 slides
Introduction to Data Science by
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceChristy Abraham Joy
82.6K views51 slides
Time Management & Productivity - Best Practices by
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
169.8K views42 slides

Featured(20)

ChatGPT and the Future of Work - Clark Boyd by Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd28K views
Getting into the tech field. what next by Tessa Mero
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero6.6K views
Google's Just Not That Into You: Understanding Core Updates & Search Intent by Lily Ray
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray6.9K views
Time Management & Productivity - Best Practices by Vit Horky
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
Vit Horky169.8K views
The six step guide to practical project management by MindGenius
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
MindGenius36.7K views
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright... by RachelPearson36
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
RachelPearson3612.7K views
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present... by Applitools
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Applitools55.5K views
12 Ways to Increase Your Influence at Work by GetSmarter
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
GetSmarter401.7K views
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G... by DevGAMM Conference
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
DevGAMM Conference3.6K views
Barbie - Brand Strategy Presentation by Erica Santiago
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
Erica Santiago25.1K views
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well by Saba Software
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Saba Software25.3K views
Introduction to C Programming Language by Simplilearn
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming Language
Simplilearn8.5K views
The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr... by Palo Alto Software
The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr...The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr...
The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr...
Palo Alto Software88.4K views
9 Tips for a Work-free Vacation by Weekdone.com
9 Tips for a Work-free Vacation9 Tips for a Work-free Vacation
9 Tips for a Work-free Vacation
Weekdone.com7.2K views
How to Map Your Future by SlideShop.com
How to Map Your FutureHow to Map Your Future
How to Map Your Future
SlideShop.com275.1K views

Continuous Deployment at IMVU

  • 1. Scaling with Continuous Deployment Social Developer Summit San Francisco, CA, June 29, 2010 Brett G. Durrett (@bdurrett) Vice President Engineering & Operations, IMVU, Inc. 0
  • 2. An online community where members use 3D avatars to meet new people, chat, create and have fun with their friends
  • 4. In a Nutshell What is Continuous Deployment? • Engineer commits code • 20 minutes later it is live in production • Repeat about 50 times per day 3
  • 5. Does This Really Work? “Maybe this is just viable for a single developer … your site will be down. A lot.” “It seems like the author either has no customers or very understanding customers” Responses to February 2009 posting by Timothy Fitz about Continuous Deployment at IMVU (at the time IMVU had a $12 million run rate) 4
  • 6. Benefits • Regressions easy to find, correct • Releases have zero overhead • Rapid iteration using real customer metrics 5
  • 7. Finding and Fixing Problems • Each release has few changes, 1-3 commits • Production issues correlate with check- in timestamp • No overhead to Identifying cause takes minutes producing a new release to correct issue
  • 8. CD at IMVU: Simple Overview Local tests Rollback pass, engineer (Blocks) commits code No Lots and lots of Metrics Yes Code deployed tests run good? to all servers All tests Yes Code deployed Metrics No pass? to % of servers still good? No Yes Revert commit (Blocks) Win! 7
  • 9. CD at IMVU: Detailed Overview 8
  • 10. Getting Started – Extreme Basics 1. Continuous integration system 2. Production monitoring and alerting – System performance – Business metrics – Trending is nice too  3. Simple deploy / roll-back system 9
  • 11. Commit to Making Forward Progress • Require coverage for all new code • Add coverage for bugs / regressions • Understand and fix root cause of failures 10
  • 12. Expect Some Hurdles • Production outages • New overhead – Tests – Build systems • Production outages • Frustration • Production outages (but well worth it)
  • 13. Dealing with SQL Problems • Difficult to roll-back schema • Alter statements lock / impact customers Solutions • New schema has formal review process • No alter on large tables, create new table – Copy on read – Complete migration with background job 12
  • 14. Big Features • Developed on trunk, not branch – “hidden” from customers by A/B experiment – 100% control, add QA to experiment • Deployed daily during development • Slow roll-out by increasing experiment % – Experiment closed = fully launched 13
  • 15. Test Speed Slow tests burden to scaling • Can’t run all tests in sandbox • Faster to debug on build cluster If possible… • Keep tests fast • Keep tests specific 14
  • 16. The cost of failing tests As the team grows… • More likely to have test failures • More people blocked as a result Intermittent failures very bad Eliminate the root cause 15
  • 17. Other Issues • Won’t catch issues that fail slowly – SELECT * FROM growing_table WHERE 1 • Some critical areas cause hard lock-ups – MySQL – Memcached • Lack of test coverage of older code – Not an issue if you start with test coverage 16
  • 18. Does Continuous Deployment Scale? • Technical staff ~50 people • 10 million monthly unique visitors • Peak ~115K concurrent IM client logins • It’s a real business! – $40 million run rate – Profitable and doubled revenue in 2009 17
  • 19. Newer Scaling Challenges Biggest challenges come with growth of the engineering organization 18
  • 20. SLA for Build Systems Build systems are a critical service 19
  • 21. SLA for Build Systems Build systems are a critical service Run them that way 20
  • 22. Build and Push Times 21
  • 24. Build Throughput • Initial implementation sequential builds – Scaled okay to ~20 engineers – Like trains running every 20 minutes – One “red” blocks all following builds • Solution: build isolation – Enable testing single build without deploy – “Red” build pulled, allow other builds to pass 23
  • 25. Current Systems • > 15,000 tests • 72 web build servers – 51 Linux, 21 Windows • > 6 hours of tests on average hardware • Deploy to cluster of ~700 servers 24
  • 26. Web Build Software • Custom test-file runner with JS GUI • PHP SimpleTest • Python's built-in unittest • Selenium Core with in-house API wrapper • YUITest for browser JS unit tests • Erlang Eunit 25
  • 27. Conclusion • Continuous Deployment is good • Try it – starting earlier is easier • It’s a key part of a nutritious development process 26
  • 29. More on Continuous Deployment • SD Times Leaders of Agile: Kent Beck's Principles of Agility: http://bit.ly/9wsAYv (this webinar tomorrow, June 30) • Eric Ries (Startup Lessons Learned) on Continuous Deployment: http://bit.ly/5l6X1 • Timothy Fitz (IMVU) Doing the impossible 50 times a day: http://bit.ly/OxJv 28
  • 30. Thank You! Brett G. Durrett bdurrett@imvu.com Twitter: @bdurrett IMVU was recognized as one of the “Best Places to Work” (and we’re hiring) http://www.imvu.com/jobs/ 29