Discover trends and patterns
What time of the day do we get the most users?
When x happened, what was the effect on y?
How many hits am I getting per hour?
How does this compare to last week? last month?
Predict future events
When will we need to add more servers? Databases?
Did the release into production fix problem x?
A few reasons:
formulas, no graph introspection, cannot push metrics, cannot feed out of sequence
metrics, ugly graphs, no API, expose system/os metrics on host via snmp, no graph
composer, no custom graphs, predefine metrics, predefine graphs, static polling interval,
unscalable, tons of work to create one graph, no 3rd party ecosystem, etc.
A Common Graphite Stack
Agent for system/hardware level metrics
Growing repository of plugins for a wide variety
disk i/o, disk space, cpu, memory, mysql,
JMX, java, Redis, file sizes, load, etc.
Write your custom plugin in python
You can write Nagios plugins that can alert off
of metrics values
Nagios can also feed graphite
performance data, events (ie: update
counter each time email is sent), etc.