Velocity NY 2014: Signal through the noise

hi.
dave@librato.com
@davejosephsen
github: djosephsen

Signal Through the Noise
dave@librato.com
@davejosephsen
github: djosephsen

Business Projects
IT Projects
Changes
Unplanned Work

Unplanned Work
(eeew Comic Sans)

THE CONTENT OF YOUR ALERTS
MATTERS

What did he just say?
•Notifications are expensive, they hurt people and productivity
•Make people work harder to send them by requiring run books
•Run books add context to alerts. Other types of context are awesome too
•Like graphs

Command Signal
Telemetry Data

1. Identify Operational Limitations
Y<160bpm
X<7m km/h

1. Identify Operational Limitations
2. Monitor those limitations
X<7m km/h
Y<160bpm

Balancer
>66% Host Availability

(Hint: one of these things measures balancing)
%hosts
alive
% IO
VS per instance

Does not measure balancing Measures balancing
66X %hosts
alive
VS .2
% IO
per instance

WE CAN REDUCE ALERTS BY
IMPROVING OUR TELEMETRY
SIGNAL

•Monitoring isn't a thing. It’s just part of the engineering process
•We’re treating it like a thing that only some types of engineers might want to
do, and that’s giving us broken feedback
•Aerospace engineers are rad, they don’t do that.
•Fix your monitoring and your alerts will follow

• Choose metrics that tell you about the things you care about.
•Alert when the things you care about hit limits you understand
•All alerts < critical go to chatrooms, ticket systems or dashboards
•Critical alers use an automated escalation service that enforces on call policy
•Escalated alerts require acknowledgement
•Escalated alerts require run book url’s and/or links to graphs of the metric

EVERYONE OWNS ALERTS
(and dashboards)

The Ultimate Recap
• Enforce a notification policy that requires context
• Make monitoring an engineering process
• Use the same signal for all metrics introspection and notification
• Encourage everyone to rely on telemetry data (graphs or it didn’t happen!)
• Everyone who collects a metric, gets keys to dashboard and alert design

Questions?
Office Hours: 1:15pm

Velocity NY 2014: Signal through the noise

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

Velocity NY 2014: Signal through the noise