2. confidential
Dirk Wallerstorfer
Cloud Technology Strategist @ Dynatrace
Tech enthusiast
Husband
Father
Son
Austrian (no kangaroos)
Never seen “Sound of music”
Yes, I own a lederhosn
No, I don’t know how to yodel
@wall_dirk
dirk.wallerstorfer@dynatrace.com
12. confidential
Nagios
Monitoring IT infrastructure – and more ...
Monitoring static entities
Possibility to monitor dynamically created VMs
Modify code of nova-api to reconfigure nagios after creation of new VM
Otherwise shutdown instance will be displayed as outage
Image that contains Nagios plugins || CAPS scripts to install plugins
Alternative: Nagios XI – Enterprise
Nagios Log Server
Great talk from Nagios world 2014: Monitoring OpenStack
https://www.youtube.com/watch?v=1U5fo6aPS-k
26. confidential
Monasca Agent
The Monasca Agent supports collecting metrics from a variety of sources as follows:
System metrics
Nagios plugins
Statsd
Host alive (icmp/ssh)
Process checks (# instances, memory, io, threads)
Http Endpoint checks
Service checks (mysql, rabbitmq)
OpenStack process metrics
The Agent is extensible through configuration of additional plugins, written in Python.
41. confidential
Resource capacity and utilization
OpenStack service availability/performance
Supporting services
Log analytics
Applications running on top
Dependencies
Correlation of metrics/events/data
Real user monitoring, UX affects $
PaaS
OPTIMIZING
MAINTENANCE
AVAILABILITY
COST OPTIMIZATION
BILLING
ENSURE IT MATCHES EXPECTATIONS
monitoring IT infrastructure and more
NRPE config @ controller
Nagios config @ Nagios
Beats are lightweight data shippers that you install as agents on your servers
Beats have a small footprint and use fewer system resources than Logstash.
Logstash provides a broad array of input, filter, and output plugins for collecting, enriching, and transforming data from a variety of sources.
X-Pack: security, alerting (watcher), monitoring, reporting, graph, machine learning
Cloud: hosted on AWS/GCP – scaling is easy, activate additional features on demand
modules: apache2, nginx, mysql, syslog, ...
configuration @ controller
processes @ elastic
Agents send data to APIs
Read data through CLI or Grafana
agent configuration in container for system metrics
integration in Horizon, dashboards, and ‘graph metrics’
not only correlation, also causal relation
相関関係は、原因を意味しません。
Update of the Payment Service of one of the rookie developer that are convinced that you have to write everything on your own and reinvent the wheel on a daily basis.
LET’S ASSUME YOU ARE MONITORING EVERYTHING ... in the most professional way, so you immediately notice any error or change in performance
This is one of the things that can go wrong in OpenStack ... now I don’t to talk you out of doing OpenStack, on the contrary, I want to encourage you to think maintenance and operations from the beginning, and don’t just go with
Putting it all together now.
With large environments, manual introspection and correlation and log browsing won’t cut it anymore ... people don’t scale as well ...
What do you need OpenStack for, if you 5 VMs with 8 services, and 2 applications.
you need something that is easy to deploy, adaptable, and able to handle environments of that size