• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Monitoring and metrics in the cloud
 

Monitoring and metrics in the cloud

on

  • 723 views

 

Statistics

Views

Total Views
723
Views on SlideShare
721
Embed Views
2

Actions

Likes
0
Downloads
0
Comments
0

1 Embed 2

http://coderwall.com 2

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Monitoring and metrics in the cloud Monitoring and metrics in the cloud Presentation Transcript

    • Metrics and Monitoring in the cloudDavid Lutz@dlutzy
    • The objective of metrics is tomake pretty graphs…
    • The objective of metrics is tomake pretty graphs...in order to understand the performanceand capacityof your systems and how they vary over time.
    • The objective of monitoring is to…make the Operations-guy-on-call’s life hell.
    • The objective of monitoring is tocheck that the system is working as expectedand take action if some component isnt.
    • “Those who cannot remember the past arecondemned to repeat it” - George Santayana So here’s a case study…
    • A long time ago in a data centre far,far away….
    • Complete system includes humans to run it!Human Factors Engineering.http://en.wikipedia.org/wiki/Human_factors2 x Linux Engineers1 x Network Engineer1 x Do Anything Guy1 x Developer
    • No Monitoring or Metrics. Black Box. Completely blind.
    • MRTGNet Saint
    • Large Development team External Consultants ITIL Process people5 x Linux Engineers1 x Network Engineer2 x Database AdministratorsandPart of an Infrastructure team that includedVirtualization specialistsStorage specialistsHardware specialists
    • WTF happened? It grew… Virtualization / Cloud Cloud / Virtualization
    • Approximately 400 serversStill using Nagios and Cacti15 minutes to add server manually.1 hour or more to add a new check.
    • And Ganglia.And External SAAS tools:New Relic. Gomez. Omniture.
    • #monitoringsucks
    • Getting it right
    • Getting it wrong
    • What’s different about the cloud?• Servers come and go• Sometimes automatically with auto-scaling• Topologies and Architectures change rapidly• Driven from Configuration Management Systems
    • The problems with Nagios• Clunky UI.• Monolithic design.• Hard to scale.• Hard to add nodes dynamically.
    • #doingitright
    • #doingitwrong
    • Sensu… Is it the Nagios killer?
    • sensu-serversensu-clientsensu-apisensu-dashboard
    • • JSON everywhere• Can re-use Nagios checks• Messaging oriented architecture• Designed to be driven from Config Management tools• Supports dynamic topologies
    • ?
    • ?
    • David Lutz 99designs @dlutzymeetup.com/Infrastructure-Coders/