Monitor your CentOS stack with
Julien Pivotto (@roidelapluie)
February 20st, 2019
I like Open Source
I like monitoring
I like automation
... and all of that is my daily job at inuits
Disk is full
No zombie processes
not 100 user connected
swap is empty
Questions that come after:
It's up but it is performant?
It's down but for everyone?
Its is degraded but are the users impacted?
Is it even relevant?
e.g. traditionally graphite
Gather fine grained data at frequent interval
Make them useful by labelling them ; store
Analyze them to understand what is going on
Metrics ARE PART OF
Do not maintain a metrics + a "traditional
Alert from metrics directly!
then Spans (opentracing)
We are in the cloud era.
Here are some buzzwords for you
cloud, API, openstack, devops, docker, bimodal,
stateless, kubernetes, orchestration, automation,
serverless, docker, humanops, ansible, continuous
deployment, cri-o, jenkins, agile, docker, red hat,
containers, virtualization, provisionning, monitoring,
Easy to configure, deploy, maintain
Designed in multiple services
Orchestration ready (dynamic config)
A Metric in Prometheus has metadata:
And lots of function to filter, change, remove...
those metadata while fetching them.
Support for multiple OS
1 service = 1 things
Takes care of its db (time based retention
and/or disk space based retention)
prometheus.spec (I have not tested it)
Creating Grafana Dashboards
Requires deep knowledge of the tools
Improved over time
Easy to share (json + online library)
Lots of data that can be explored in many ways
(subqueries are coming)
Trends and deviations are easy to calculate
Can monitor both business and technical
Very convenient to monitor any kind of stack
Multiple small pieces
For Cloud and On Premise
I will give a
Apr. 11, 2019
talk given at CERN, February 2019, about the prometheus monitoring solution and how it can be used in CentOS stacks