Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Improving DevOps through better monitoring

2,648 views

Published on

Some developers believe that monitoring is a function of operations team. Some operations teams firmly believe that monitoring the systems they maintain is sufficient to run the business successfully. Most of them are wrong. The complexity of today’s applications have gone far and beyond the capabilities of “traditional” system-level monitoring tools and approaches and requires much broader knowledge of business and application as a whole. The goal of DevOps is to connect all aspects of application development and operations, and monitoring provides visibility and troubleshooting tools to accomplish that goal.

This talk is intended to provide real-world examples of common gaps in monitoring approach and explain why holistic instrumentation of business and functionality monitors should be a part of any project scope.

Published in: Technology
  • Be the first to comment

Improving DevOps through better monitoring

  1. 1. Improving DevOps through better monitoring Leon Fayer @papa_fire
  2. 2. Who am I ? • 20+ years of development and operations of large systems • currently Vice President at OmniTI • can be found online: • @papa_fire • http://fayerplay.com • github:lfayer
  3. 3. So … what is DevOps?
  4. 4. What is DevOps? philosophy of collaboration
  5. 5. … and more to enable business goals
  6. 6. Not DevOps dev ops
  7. 7. Not DevOps either devops
  8. 8. DevOps devops
  9. 9. General consensus Damon Edwards (http://dev2ops.org)
  10. 10. Missing link Damon Edwards (http://dev2ops.org)
  11. 11. Finally, monitoring enter monitoring
  12. 12. What to monitor? “in God we trust all others we monitor”
  13. 13. What to monitor specifically? • systems • databases • application • integration points • performance • user behavior • business processes
  14. 14. Perfect quote “ I don’t give a **** if the datacenter is on fire as long as I am still making money ” - CEO
  15. 15. Example: Twitter serves over 20 million unique visitors a day … legendary for downtime . servers are up and running . HTTP checks return 200 . tweets lost
  16. 16. Why monitor? • software is never perfect • systems are more and more complex • proactive is better than reactive • external dependency worry • …
  17. 17. Why really monitor? things change
  18. 18. … and when things change changes effect business
  19. 19. And now for real example :case study:
  20. 20. Setting the stage • online marketing company • major e-commerce component • 90+ million users • 1 billion emails/months • 300,000+ lines of code • ~ 50 physical devices • 5600+ metrics collected
  21. 21. It all starts with …
  22. 22. Let the hunt begin revenue
  23. 23. Direct cause check revenue + traffic
  24. 24. Going down the stack revenue + traffic + load time
  25. 25. Still descending revenue + traffic + load time + db
  26. 26. Got ya! revenue + traffic + load time + db + email
  27. 27. Keys to monitoring 1. understand business 2. approach top-down 3. correlate data
  28. 28. For more tips & examples: http://omniti.com/explains/monitoring-the-big-picture Questions?

×