Your SlideShare is downloading. ×
Metrics and Monitoring in the cloudDavid Lutz@dlutzy
The objective of metrics is tomake pretty graphs…
The objective of metrics is tomake pretty graphs...in order to understand the performanceand capacityof your systems and h...
The objective of monitoring is to…make the Operations-guy-on-call’s life hell.
The objective of monitoring is tocheck that the system is working as expectedand take action if some component isnt.
“Those who cannot remember the past arecondemned to repeat it” - George Santayana   So here’s a case study…
A long time ago in a data centre far,far away….
Complete system includes humans to run it!Human Factors Engineering.http://en.wikipedia.org/wiki/Human_factors2 x Linux En...
No Monitoring or Metrics.  Black Box. Completely blind.
MRTGNet Saint
Large Development team                              External Consultants                              ITIL Process people5...
WTF happened? It grew…       Virtualization / Cloud       Cloud / Virtualization
Approximately 400 serversStill using Nagios and Cacti15 minutes to add server manually.1 hour or more to add a new check.
And Ganglia.And External SAAS tools:New Relic. Gomez. Omniture.
#monitoringsucks
Getting it right
Getting it wrong
What’s different about the cloud?• Servers come and go• Sometimes automatically with auto-scaling• Topologies and Architec...
The problems with Nagios•   Clunky UI.•   Monolithic design.•   Hard to scale.•   Hard to add nodes    dynamically.
#doingitright
#doingitwrong
Sensu… Is it the Nagios killer?
sensu-serversensu-clientsensu-apisensu-dashboard
• JSON everywhere• Can re-use Nagios checks• Messaging oriented architecture• Designed to be driven from  Config Managemen...
?
?
David Lutz    99designs    @dlutzymeetup.com/Infrastructure-Coders/
Monitoring and metrics in the cloud
Monitoring and metrics in the cloud
Upcoming SlideShare
Loading in...5
×

Monitoring and metrics in the cloud

697

Published on

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
697
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Transcript of "Monitoring and metrics in the cloud"

  1. 1. Metrics and Monitoring in the cloudDavid Lutz@dlutzy
  2. 2. The objective of metrics is tomake pretty graphs…
  3. 3. The objective of metrics is tomake pretty graphs...in order to understand the performanceand capacityof your systems and how they vary over time.
  4. 4. The objective of monitoring is to…make the Operations-guy-on-call’s life hell.
  5. 5. The objective of monitoring is tocheck that the system is working as expectedand take action if some component isnt.
  6. 6. “Those who cannot remember the past arecondemned to repeat it” - George Santayana So here’s a case study…
  7. 7. A long time ago in a data centre far,far away….
  8. 8. Complete system includes humans to run it!Human Factors Engineering.http://en.wikipedia.org/wiki/Human_factors2 x Linux Engineers1 x Network Engineer1 x Do Anything Guy1 x Developer
  9. 9. No Monitoring or Metrics. Black Box. Completely blind.
  10. 10. MRTGNet Saint
  11. 11. Large Development team External Consultants ITIL Process people5 x Linux Engineers1 x Network Engineer2 x Database AdministratorsandPart of an Infrastructure team that includedVirtualization specialistsStorage specialistsHardware specialists
  12. 12. WTF happened? It grew… Virtualization / Cloud Cloud / Virtualization
  13. 13. Approximately 400 serversStill using Nagios and Cacti15 minutes to add server manually.1 hour or more to add a new check.
  14. 14. And Ganglia.And External SAAS tools:New Relic. Gomez. Omniture.
  15. 15. #monitoringsucks
  16. 16. Getting it right
  17. 17. Getting it wrong
  18. 18. What’s different about the cloud?• Servers come and go• Sometimes automatically with auto-scaling• Topologies and Architectures change rapidly• Driven from Configuration Management Systems
  19. 19. The problems with Nagios• Clunky UI.• Monolithic design.• Hard to scale.• Hard to add nodes dynamically.
  20. 20. #doingitright
  21. 21. #doingitwrong
  22. 22. Sensu… Is it the Nagios killer?
  23. 23. sensu-serversensu-clientsensu-apisensu-dashboard
  24. 24. • JSON everywhere• Can re-use Nagios checks• Messaging oriented architecture• Designed to be driven from Config Management tools• Supports dynamic topologies
  25. 25. ?
  26. 26. ?
  27. 27. David Lutz 99designs @dlutzymeetup.com/Infrastructure-Coders/

×