SlideShare a Scribd company logo
Monitoring Mesos, Docker, Containers with Zabbix
By Erik Skytthe, DBC A/S, Denmark
https://dk.linkedin.com/in/erik-skytthe-444a7720
(Zabbix IRC/Forum: eskytthe)
Email: ers ADD dbc.dk
1
DBC
Central libraries
Research libraries
Education libraries
Local libraries:
...
Central material search:
(books, films, articles, research, facts …)
Central databases, web services etc.
Open Source:
https://opensource.dbc.dk/
https://github.com/DBCDK/
- Templates etc.here efter conference
2
DBC
DBC
- > 600 devices (Linux, Windows, SAN, Network, Xen, VMware …)
- Software Development
- Many in-house developed applications (some very old)
- Have used Zabbix in ~ 6 years.
- Version 2.4 ~ will update to 3.X in short time
Erik Skytthe
- ~ DevOps … but most Ops
- Continuous Integration
- Continuous Deployment
- Tools and platforms to support this
3
Who are running?
Some sort of container cluster environment:
- Mesos (Apache)
- Swarm (Docker)
- Kubernetes (Google)
- Fleet (CoreOS)
- Or Other ...
● Swarm ?
● Kubernetes?
● Mesos?
4
Who are using Mesos?
It is all around you ...
● Twitter
● Apple - Siri (voice recognition)
● Netflix
● PayPal
● Cisco
● Microsoft Azure
● eBay, Airbnb, Uber, CERN …...
and
● DBC :-) … small scale Cons: not long prod experience
http://mesos.apache.org/documentation/latest/powered-by-mesos
5
What is Mesos?
A distributed system kernel A low level distributed system
A cluster of resources
Offer resources from
- Cpu, memory, storage, network …
Resources consumed by Frameworks
Runs on bare metal, virtual hosts and cloud
Apache Mesos abstracts CPU, memory, storage, and other compute resources
away from machines … into a shared pool.
6
Mesos architecture
http://blog.couchbase.com/2016/may/docker-apache-mesos-marathon
Swarm
Jenkins
Elastic
Kafka ...
Docker Docker Docker Docker
Mesos:
- CPU
- Mem
- Disk
- Network
- ...
(Process)
(Process)
(Process)
7
Frameworks - “Infrastructure as code”
Orchestration of containers ~ applications / tasks
● Start, Stop, Scaling, Destroy ...
● Health checks
● Grouping
● Dependencies (Mysql container -> Zabbix container)
● Load balance / Proxy
Examples:
Marathon: Long running tasks
Chronos: “Cron” jobs
Jenkins: Up and down scaling of builds
8
Frameworks - Marathon
{
"id": "/stagning/scrum-team/webserver",
"cpus": 0.1,
"mem": 64.0,
"instances": 1,
"container": {
"type": "DOCKER",
"docker": {
"image": "docker-repo/nginx",
"forcePullImage": true,
"network": "BRIDGE",
"portMappings": [{
"containerPort": 80, "hostPort": 0 }]
}
},
"healthChecks": [{
"path": "/",
"gracePeriodSeconds": 5,
"intervalSeconds": 5,
"timeoutSeconds": 5,
"maxConsecutiveFailures": 3
}]
}
curl -X POST -H "Content-type: application/json"
http://mesosmasterX:8080/v2/apps -d @webserver.json
9
Full stack Mesos - Mesosphere DC/OS
https://dcos.io https://mesosphere.com/
(Data Center OS)
Enterprise:
Open Source:
10
Full stack Mesos - Mantl
http://mantl.io - Cloud infrastructure Cisco
- We use it as a reference model
(Collectd) ...
11
MesosCon Amsterdam 2016
http://events.linuxfoundation.org/events/mesoscon-europe
- Fault tolerence in mesos: http://sched.co/7n7x
- Mesos 1.0: http://sched.co/7n7s
- ...
12
MesosCon America 2016
Video sessions:
https://www.linux.com/news/mesoscon-north-america-2016-video-sessions
DevOps / container config and deploy:
- “Lessons Learned from Running Heterogeneous Workload on Apache Mesos”
- “All Marathons Need a Runner. Introducing Pheidippides”
- …
In deep container troubleshooting: (sysdig: cli / runtime tool)
- “Monitoring Microservices: Docker, Mesos and DCOS Visibility at Scale”
- https://sysdig.com/blog/monitoring-mesos/
-
13
Hardware - Server monitoring
Standard zabbix agent - plain vanilla
- “Standard” Linux templates
- Autodiscovery of Disk and Networks
- Syslog to ELK
Trigger / alert note:
- Filesystem space and inodes! - because of Docker (images etc.)
14
Getting Data Metrics
curl -s 'http://mesosmasterx:5050/metrics/snapshot' | jq '.'
{
...
"master/tasks_starting": 0,
"master/tasks_staging": 0,
"master/tasks_running": 38,
"master/tasks_lost": 0,
"master/tasks_killing": 0,
"master/tasks_killed": 770,
"master/tasks_finished": 0,
"master/tasks_failed": 129,
"master/tasks_error": 0,
"master/task_killed/source_slave/reason_executor_unregistered": 1,
"master/slaves_inactive": 0,
"master/slaves_disconnected": 0,
"master/slaves_connected": 8,
"master/slaves_active": 8,
....
} 15
Collectd
We did choose collectd ... so far, because:
- Mantl (ready made ansible roles etc.)
- Recommendations in mesos mail list
- We already running graphite
- (Nice analyse possibilities)
Cons:
- No zabbix write plugin so far
- Easy to make e.g. python program to get data from urls.
Ansible role from Mantl project - to install:
~/work/mantl/roles/collectd (git clone …) Remark: Debian require libpython2.7
16
Collectd data flow
Graphite Zabbix
Mesos Master
Mesos
Marathon
(zookeeper)
Mesos Agent
Mesos
ContainerX
g2zproxy
Grafana
(GUI)
17
Collectd - write to graphite plugin
/etc/collectd/
├── collectd.conf
├── collectd.conf.d
│ ├── carbon.conf
│ ├── filters.conf
│ ├── mesos-master.conf
│ └── thresholds.conf
└── collection.conf
collectd.conf:
# Sampling interval sec.
Interval 20
<Include "/etc/collectd/collectd.conf.d">
Filter "*.conf"
</Include>
Carbon.conf
# For version 5.1 and later using the Write Graphite plugin
FQDNLookup false
Timeout 2
ReadThreads 5
LoadPlugin write_graphite
<Plugin "write_graphite">
<Carbon>
Host "graphitehost name"
Port "{{ GRAPHITE_PORT | default("2003") }}"
Protocol "tcp"
Prefix "{{ GRAPHITE_PREFIX | default("collectd.") }}"
EscapeCharacter "."
StoreRates true
AlwaysAppendDS false
SeparateInstances true
</Carbon>
</Plugin> 18
Collectd mesos-master.conf
/etc/collectd/
├── collectd.conf
├── collectd.conf.d
│ ├── carbon.conf
│ ├── filters.conf
│ ├── mesos-master.conf
│ └── thresholds.conf
└── collection.conf
/usr/share/collectd/plugins/mesos-master.py
mesos-master.conf:
LoadPlugin python
<Plugin "python">
ModulePath "/usr/share/collectd/plugins/"
Import "mesos-master"
<Module "mesos-master">
Host "localhost"
Port 5050
Verbose false
Version "0.22.1"
</Module>
</Plugin>
19
Colltecd mesos-agent.conf
/etc/collectd/
├── collectd.conf
├── collectd.conf.d
│ ├── carbon.conf
│ ├── filters.conf
│ ├── mesos-agent.conf
│ └── thresholds.conf
└── collection.conf
/usr/share/collectd/plugins/mesos-agent.py
Note: May be named “mesos-slave” also
Same config model for marathon,
zookeeper etc.
mesos-agent.conf:
LoadPlugin python
<Plugin "python">
ModulePath "/usr/share/collectd/plugins/"
Import "mesos-agent"
<Module "mesos-agent">
Host "localhost"
Port 5051
Verbose false
Version "0.22.1"
</Module>
</Plugin>
20
Graphite - Zabbix integration
Graphite to Zabbix proxy: Blacked graphite-to-zabbix
https://github.com/blacked/graphite-to-zabbix
Crontab:
*/1 * * * * g2zproxy -z https://zabbixhost -zu {zabbixUser} -zp {zabbixPass} -g
http://graphitehost ...
Graphite key: mesos-masterx.mesos-master.gauge.master_elected
Zabbix Host: mesos-masterx
Zabbix key: graphite[mesos-master.gauge.master_elected]
Pros: Possible to use Graphite functions in zabbix requests (zabbix key) 21
Mesos Master Monitoring
Mesos metrics:
http://mesos.apache.org/documentation/latest/monitoring/
Monitoring and trigger recommendations:
https://docs.mesosphere.com/1.7/administration/monitoring/performance-monitoring/
Cons:
- Data only from elected master
- Bug? 2.8 lost tasks? (counter)
etcd:
https://github.com/shamil/zabbix-etcd
zookeeper:
https://github.com/zhangqin/zookeeper-zabbix-template
Recommendations included in item description:
22
Mesos Master template
-
23
Mesos Master Monitoring
-
24
Mesos Agent (Slave) Monitoring
-
- Plus process monitoring e.g. mesos, docker, etcd 25
Marathon Framework Monitoring
http://mesosmasterx:5050/metrics
Colletctd marathon plugin: Not working with authentication and SSL!
https://github.com/klynch/collectd-marathon
Marathon metrics flags
--reporter_graphite : tcp://graphitehost:2003?prefix=marathon-test&interval=10
Report metrics to Graphite as defined by the URL.
Cons:
- Bug in tasks metric
- Some metric names not updated in documentation
https://docs.mesosphere.com/1.7/administration/monitoring/performance-monitoring
Host
26
Container Monitoring
Construction work ahead !
Note: Huge topic - Need separate presentation 27
Container Monitoring
How to represent containers ~ applications / tasks in Zabbix ?
How to show them …? In a structural way ? …
How to find them?
How to aggregate and analyse them?
28
Container Monitoring
Evolution of the standard application stack. (Off-the-shelf = eg. Java EE)
Source: Datadog - https://www.datadoghq.com/blog/the-docker-monitoring-problem/ 29
Container Monitoring
“we need a new approach, one that does not treat everything as a host.”
- Containers can be on any agent / node in your cluster
- Containers can be in 1 ~ many instances
- Containers can start and get destroyed in ms. / sec.
“Treat containers as hosts that come and go every few minutes.
In this case your life is miserable because the monitoring system always thinks half of
your infrastructure is on fire”
“Monitor all layers of your stack together, so that you can see what is happening
everywhere, at the same time, with no gaps”
“Tag your containers so that you can monitor them as queryable sets rather than as
individuals” 30
Container Monitoring
“Treat containers as hosts that come and go every few minutes”.
“In this case your life is miserable because the monitoring system always thinks
half of your infrastructure is on fire”
Not necessarily true
- Long running containers
- Depend on triggers (if any)
- Depend on your use
Our solution: Treat containers as hosts in zabbix
31
Container Monitoring
Test of two mesos collectd solutions:
https://github.com/bobrik/collectd-mesos-tasks
https://github.com/rayrod2030/collectd-mesos
Cons:
- Open Mesos issue: incorrect CPU metrics
- Can not get them to work (out of the box) in newer mesos versions
32
Container Monitoring
Solution 1: Monitoringartist (Jan): zabbix-docker-monitoring
https://github.com/monitoringartist/zabbix-docker-monitoring
Book: "Monitoring Docker"
By Russ McKendrick
Chapter 4 - Zabbix install etc.
Cons:
- Limited version: “provides only docker metrics, TLS features and Zabbix
agent server IP check are disabled”
Container name
CPU system time
CPU user time
Used cache memory
Used RSS memory
Used swap
Note: See also monitoring analytics and other:
https://hub.docker.com/r/monitoringartist/monitoring-analytics
33
Container Monitoring
Solution 2: Bobrik: Collectd-docker (run as docker container)
https://github.com/bobrik/collectd-docker
Cons:
- Only docker
- Need to modify graphite to zabbix proxy tool (g2zproxy)
Pros:
- Possible to make a “zabbix container discover” solution
- Reliable cpu metrics
- Use of application and task tagging
- Only containers with specific labels will be monitored
- Nice grahana Dashboard
34
Container Monitoring
Group?
Application
Task
35
Container MonitoringHost, App, Task
36
Container Monitoring - Grouping
Marathon: Name group hiraki e.g:
/ Environment / team / application / [ sub application ] / ...
/prod/dataio/dbc-glassfish-harvester
/stagning/dataio/dbc-glassfish-harvester
Task = Running container / instance of application:
dbc-glassfish-harvester.760fedd6-684b-11e6-bfc6-0242c91e8407
Application / task relationship:
- One or more different tasks per application
- Many instances of one task
- Depend how you will group them ... 37
Container Monitoring - tagging / labels
Bobrik: Collectd-docker: Only containers with specific labels will be monitored
Application / Task: Logical grouping of containers
- You own choice of grouping!
docker run 
-d 
--name zabbix-db 
--env="MARIADB_USER=zabbix" 
--env="MARIADB_PASS=my_password" 
--label collectd_docker_app="zabbix"  Application (= zabbixhost)
--label collectd_docker_task="zabbixdb"  Task
monitoringartist/zabbix-db-mariadb 38
Container Monitoring
How to represent containers ~ applications / tasks in Zabbix ?
As hosts
How to show them …? In a structural way ? …
As hosts. But could be better - use of tags etc.
How to find them?
As host search. Needed: Use of tags / labels
How to aggregate and analyse them?
Needed: ~ top10 etc. tools in Zabbix
(Could be nice: image usage, container usage etc. …)
Oldservice-hostdiscussion?
Containerautodiscovery?
Ref:sysdigcloud
39
Log Monitoring
The Twelve-Factor App: https://12factor.net/
https://12factor.net/logs:
“A twelve-factor app never concerns itself
with routing or storage of its output stream.
It should not attempt to write to or manage
logfiles. Instead, each running process writes
its event stream, unbuffered, to stdout”
40
Log Monitoring - Log flow
Kafka
Cluster
Zabbix
Mesos Master
Mesos
Marathon
(zookeeper)
Mesos Agent
Mesos
Docker
ContainerX
Beaver / Logstash
Elastic
Cluster
Logstash
Syslog Kibana
(GUI)
41
Log Monitoring - Logstash
Logstash - as log sender tool
https://www.elastic.co/guide/en/logstash/2.3/introduction.html
Mantl project
logstash config for mesos agent (ansible):
~/mantl/roles/logstash/templates/logstash.conf.j2:
file {
path => [ "/logs/slaves/*/frameworks/*/executors/*/runs/*/stdout",
"/logs/slaves/*/frameworks/*/executors/*/runs/*/stderr" ]
type => "mesos-framework-logs"
}
42
Log Monitoring - Beaver
Beaver - python log sender tool
https://github.com/python-beaver/python-beaver
Pros:
- Do not require java
- Easy to config
Installation with ansible:
https://github.com/azavea/ansible-beaver
Note:
Docker log router: Logspout
https://github.com/gliderlabs/logspout
43
Log Monitoring - Beaver config
/etc/beaver/
├── beaver.ini
└── conf.d
└── mesos.conf
[beaver]
kafka_topic: elkprod
logstash_version: 1
kafka_hosts: kafkaX:9092,kafkaY:9092,kafkaZ:9092,kafkaXX:9092
; Only
queue_timeout: 43200
transport: kafka
44
Log Monitoring - Beaver config - mesos.conf
[/var/log/mesos/mesos-*.WARNING]
type: mesos
tags: mesos-cluster
[/var/log/mesos/mesos-*.INFO]
type: mesos
tags: mesos-cluster
[/var/log/mesos/mesos-*.ERROR]
type: mesos
tags: mesos-cluster
[/var/log/mesos/mesos-*.FATAL]
type: mesos
tags: mesos-cluster
[/data/mesos/slaves/*/frameworks/*/executors/*/runs/*/std*]
exclude: (latest)
type: mesos
tags: mesos-container 45
Log Monitoring - mesos-cluster
46
Log Monitoring - Error logs
47
Log Monitoring - mesos-container
48
Log Monitoring - Logstash
Logstash as log management tool
#“file” frem beaver:
#/data/mesos/slaves/a7d7fc82-c8de-4aff-84b1-f1d5c578efc7-S5/frameworks/a141ab38-8082-4c50-b04f-ff762b8
50aa2-0000/executors/prod_dataio_dbc-glassfish-harvester.760fedd6-684b-11e6-bfc6-0242c91e8407...
grok {
match => { "file" =>
'/data/mesos/slaves/%{DATA}/frameworks/%{DATA}/executors/%{DATA:tags}_%{DATA:group}_%{DATA:task}
/runs/%{GREEDYDATA}'}
}
# task: dbc-glassfish-harvester.760fedd6-684b-11e6-bfc6-0242c91e8407
grok {
match => { "task" => '%{DATA:application}.%{DATA}'}
}
(Cons: grok filter need revision)
49
Log Monitoring - Logstash - zabbix
50
Log Monitoring - Logstash - zabbix
# Look for java Exceptions
filter {
if [message] =~ "Exception" {
mutate {
add_field => {
"[itemkey]" => "applog"
"[alertmsg]" => "%{task}: %{message}" }
}
}
}
output {
if [alertmsg] {
zabbix {
zabbix_server_host => "zabbixhost"
zabbix_host => "[application]"
# Single value also possible
multi_value => [
"[itemkey]", "[alertmsg]"
]
}
}
Zabbix output plugin for Logstash: Zabbix conference 2015 - by untergreek:
http://www.slideshare.net/Zabbix/aaron-mildenstein-using-logstash-with-zabbix
Note: Similar config used for syslog to zabbix
51
Other solutions?
Data to kafka?
- https://github.com/hengyunabc/kafka-zabbix
Data to Elastic?
Data via StatsD?
Collectd zabbix write plugin?
Zabbix container module?
52

More Related Content

What's hot

Openstack on Fedora, Fedora on Openstack: An Introduction to cloud IaaS
Openstack on Fedora, Fedora on Openstack: An Introduction to cloud IaaSOpenstack on Fedora, Fedora on Openstack: An Introduction to cloud IaaS
Openstack on Fedora, Fedora on Openstack: An Introduction to cloud IaaS
Sadique Puthen
 
Introduction to apache zoo keeper
Introduction to apache zoo keeper Introduction to apache zoo keeper
Introduction to apache zoo keeper
Omid Vahdaty
 
Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with UciprovLukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
Zabbix
 
Multi tier-app-network-topology-neutron-final
Multi tier-app-network-topology-neutron-finalMulti tier-app-network-topology-neutron-final
Multi tier-app-network-topology-neutron-final
Sadique Puthen
 
OWASP ZAP Workshop for QA Testers
OWASP ZAP Workshop for QA TestersOWASP ZAP Workshop for QA Testers
OWASP ZAP Workshop for QA Testers
Javan Rasokat
 
Introduction to Apache ZooKeeper | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Apache ZooKeeper | Big Data Hadoop Spark Tutorial | CloudxLabIntroduction to Apache ZooKeeper | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Apache ZooKeeper | Big Data Hadoop Spark Tutorial | CloudxLab
CloudxLab
 
Raymond Kuiper - Working the API like a Unix Pro
Raymond Kuiper - Working the API like a Unix ProRaymond Kuiper - Working the API like a Unix Pro
Raymond Kuiper - Working the API like a Unix Pro
Zabbix
 
Winter is coming? Not if ZooKeeper is there!
Winter is coming? Not if ZooKeeper is there!Winter is coming? Not if ZooKeeper is there!
Winter is coming? Not if ZooKeeper is there!
Joydeep Banik Roy
 
Debugging Distributed Systems - Velocity Santa Clara 2016
Debugging Distributed Systems - Velocity Santa Clara 2016Debugging Distributed Systems - Velocity Santa Clara 2016
Debugging Distributed Systems - Velocity Santa Clara 2016
Donny Nadolny
 
Troubleshooting containerized triple o deployment
Troubleshooting containerized triple o deploymentTroubleshooting containerized triple o deployment
Troubleshooting containerized triple o deployment
Sadique Puthen
 
HighLoad Solutions On MySQL / Xiaobin Lin (Alibaba)
HighLoad Solutions On MySQL / Xiaobin Lin (Alibaba)HighLoad Solutions On MySQL / Xiaobin Lin (Alibaba)
HighLoad Solutions On MySQL / Xiaobin Lin (Alibaba)
Ontico
 
Zabbix Performance Tuning
Zabbix Performance TuningZabbix Performance Tuning
Zabbix Performance Tuning
Ricardo Santos
 
Advanced Apache Cassandra Operations with JMX
Advanced Apache Cassandra Operations with JMXAdvanced Apache Cassandra Operations with JMX
Advanced Apache Cassandra Operations with JMX
zznate
 
ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014
ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014
ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014
ChinaNetCloud
 
What every Java developer should know about network?
What every Java developer should know about network?What every Java developer should know about network?
What every Java developer should know about network?
aragozin
 
Introduction to ZooKeeper - TriHUG May 22, 2012
Introduction to ZooKeeper - TriHUG May 22, 2012Introduction to ZooKeeper - TriHUG May 22, 2012
Introduction to ZooKeeper - TriHUG May 22, 2012
mumrah
 
Zabbix Smart problem detection - FISL 2015 workshop
Zabbix Smart problem detection - FISL 2015 workshopZabbix Smart problem detection - FISL 2015 workshop
Zabbix Smart problem detection - FISL 2015 workshop
Zabbix
 
Anatomy of neutron from the eagle eyes of troubelshoorters
Anatomy of neutron from the eagle eyes of troubelshoortersAnatomy of neutron from the eagle eyes of troubelshoorters
Anatomy of neutron from the eagle eyes of troubelshoorters
Sadique Puthen
 
Trevor McDonald - Nagios XI Under The Hood
Trevor McDonald  - Nagios XI Under The HoodTrevor McDonald  - Nagios XI Under The Hood
Trevor McDonald - Nagios XI Under The Hood
Nagios
 
PyCon US 2012 - Web Server Bottlenecks and Performance Tuning
PyCon US 2012 - Web Server Bottlenecks and Performance TuningPyCon US 2012 - Web Server Bottlenecks and Performance Tuning
PyCon US 2012 - Web Server Bottlenecks and Performance Tuning
Graham Dumpleton
 

What's hot (20)

Openstack on Fedora, Fedora on Openstack: An Introduction to cloud IaaS
Openstack on Fedora, Fedora on Openstack: An Introduction to cloud IaaSOpenstack on Fedora, Fedora on Openstack: An Introduction to cloud IaaS
Openstack on Fedora, Fedora on Openstack: An Introduction to cloud IaaS
 
Introduction to apache zoo keeper
Introduction to apache zoo keeper Introduction to apache zoo keeper
Introduction to apache zoo keeper
 
Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with UciprovLukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
 
Multi tier-app-network-topology-neutron-final
Multi tier-app-network-topology-neutron-finalMulti tier-app-network-topology-neutron-final
Multi tier-app-network-topology-neutron-final
 
OWASP ZAP Workshop for QA Testers
OWASP ZAP Workshop for QA TestersOWASP ZAP Workshop for QA Testers
OWASP ZAP Workshop for QA Testers
 
Introduction to Apache ZooKeeper | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Apache ZooKeeper | Big Data Hadoop Spark Tutorial | CloudxLabIntroduction to Apache ZooKeeper | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Apache ZooKeeper | Big Data Hadoop Spark Tutorial | CloudxLab
 
Raymond Kuiper - Working the API like a Unix Pro
Raymond Kuiper - Working the API like a Unix ProRaymond Kuiper - Working the API like a Unix Pro
Raymond Kuiper - Working the API like a Unix Pro
 
Winter is coming? Not if ZooKeeper is there!
Winter is coming? Not if ZooKeeper is there!Winter is coming? Not if ZooKeeper is there!
Winter is coming? Not if ZooKeeper is there!
 
Debugging Distributed Systems - Velocity Santa Clara 2016
Debugging Distributed Systems - Velocity Santa Clara 2016Debugging Distributed Systems - Velocity Santa Clara 2016
Debugging Distributed Systems - Velocity Santa Clara 2016
 
Troubleshooting containerized triple o deployment
Troubleshooting containerized triple o deploymentTroubleshooting containerized triple o deployment
Troubleshooting containerized triple o deployment
 
HighLoad Solutions On MySQL / Xiaobin Lin (Alibaba)
HighLoad Solutions On MySQL / Xiaobin Lin (Alibaba)HighLoad Solutions On MySQL / Xiaobin Lin (Alibaba)
HighLoad Solutions On MySQL / Xiaobin Lin (Alibaba)
 
Zabbix Performance Tuning
Zabbix Performance TuningZabbix Performance Tuning
Zabbix Performance Tuning
 
Advanced Apache Cassandra Operations with JMX
Advanced Apache Cassandra Operations with JMXAdvanced Apache Cassandra Operations with JMX
Advanced Apache Cassandra Operations with JMX
 
ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014
ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014
ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014
 
What every Java developer should know about network?
What every Java developer should know about network?What every Java developer should know about network?
What every Java developer should know about network?
 
Introduction to ZooKeeper - TriHUG May 22, 2012
Introduction to ZooKeeper - TriHUG May 22, 2012Introduction to ZooKeeper - TriHUG May 22, 2012
Introduction to ZooKeeper - TriHUG May 22, 2012
 
Zabbix Smart problem detection - FISL 2015 workshop
Zabbix Smart problem detection - FISL 2015 workshopZabbix Smart problem detection - FISL 2015 workshop
Zabbix Smart problem detection - FISL 2015 workshop
 
Anatomy of neutron from the eagle eyes of troubelshoorters
Anatomy of neutron from the eagle eyes of troubelshoortersAnatomy of neutron from the eagle eyes of troubelshoorters
Anatomy of neutron from the eagle eyes of troubelshoorters
 
Trevor McDonald - Nagios XI Under The Hood
Trevor McDonald  - Nagios XI Under The HoodTrevor McDonald  - Nagios XI Under The Hood
Trevor McDonald - Nagios XI Under The Hood
 
PyCon US 2012 - Web Server Bottlenecks and Performance Tuning
PyCon US 2012 - Web Server Bottlenecks and Performance TuningPyCon US 2012 - Web Server Bottlenecks and Performance Tuning
PyCon US 2012 - Web Server Bottlenecks and Performance Tuning
 

Viewers also liked

Automating Monitoring with Puppet
Automating Monitoring with PuppetAutomating Monitoring with Puppet
Automating Monitoring with Puppet
Christian Mague
 
eXtreme Tuesday Club at Pivotal Labs ft. Speemdnet / San Francisco - SEP 2015
eXtreme Tuesday Club at Pivotal Labs ft. Speemdnet / San Francisco - SEP 2015eXtreme Tuesday Club at Pivotal Labs ft. Speemdnet / San Francisco - SEP 2015
eXtreme Tuesday Club at Pivotal Labs ft. Speemdnet / San Francisco - SEP 2015
Speedment, Inc.
 
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Nederlandstalige Zabbix Gebruikersgroep
 
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
Zabbix
 
Oleg Ivanivskyi - Lessons Learned While Being On-Site | ZabConf2016
Oleg Ivanivskyi - Lessons Learned While Being On-Site | ZabConf2016Oleg Ivanivskyi - Lessons Learned While Being On-Site | ZabConf2016
Oleg Ivanivskyi - Lessons Learned While Being On-Site | ZabConf2016
Zabbix
 
Vladimir Ulogov - Large Scale Simulation | ZabConf2016 Lightning Talk
Vladimir Ulogov - Large Scale Simulation | ZabConf2016 Lightning TalkVladimir Ulogov - Large Scale Simulation | ZabConf2016 Lightning Talk
Vladimir Ulogov - Large Scale Simulation | ZabConf2016 Lightning Talk
Zabbix
 
Inaba Kazuhiko - Ahiruyaki Zabbix in Japan Part 2 | ZabConf2016 Lightning Talk
Inaba Kazuhiko - Ahiruyaki Zabbix in Japan Part 2 | ZabConf2016 Lightning TalkInaba Kazuhiko - Ahiruyaki Zabbix in Japan Part 2 | ZabConf2016 Lightning Talk
Inaba Kazuhiko - Ahiruyaki Zabbix in Japan Part 2 | ZabConf2016 Lightning Talk
Zabbix
 
Rafael Martinez Guerrero Zabbix CLI | ZabConf2016 Lightning Talk
Rafael Martinez Guerrero Zabbix CLI | ZabConf2016 Lightning TalkRafael Martinez Guerrero Zabbix CLI | ZabConf2016 Lightning Talk
Rafael Martinez Guerrero Zabbix CLI | ZabConf2016 Lightning Talk
Zabbix
 
Zabbix visión general del sistema - 04.12.2013
Zabbix   visión general del sistema - 04.12.2013Zabbix   visión general del sistema - 04.12.2013
Zabbix visión general del sistema - 04.12.2013
Emmanuel Arias
 
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
Zabbix
 
Zabbix Conference LatAm 2016 - Paulo Deolindo - Case Study_BBTS and Zabbix
Zabbix Conference LatAm 2016 - Paulo Deolindo - Case Study_BBTS and ZabbixZabbix Conference LatAm 2016 - Paulo Deolindo - Case Study_BBTS and Zabbix
Zabbix Conference LatAm 2016 - Paulo Deolindo - Case Study_BBTS and Zabbix
Zabbix
 
Rafael Martinez Guerrero - Zabbix at the University of Oslo | ZabConf2016
Rafael Martinez Guerrero - Zabbix at the University of Oslo | ZabConf2016Rafael Martinez Guerrero - Zabbix at the University of Oslo | ZabConf2016
Rafael Martinez Guerrero - Zabbix at the University of Oslo | ZabConf2016
Zabbix
 
Konstantin Yakovlev - Event Analysis Toolset | ZabConf2016
Konstantin Yakovlev - Event Analysis Toolset | ZabConf2016Konstantin Yakovlev - Event Analysis Toolset | ZabConf2016
Konstantin Yakovlev - Event Analysis Toolset | ZabConf2016
Zabbix
 
Sumit Goel - Monitoring Cloud Applications Using Zabbix | ZabConf2016
Sumit Goel - Monitoring Cloud Applications Using Zabbix | ZabConf2016Sumit Goel - Monitoring Cloud Applications Using Zabbix | ZabConf2016
Sumit Goel - Monitoring Cloud Applications Using Zabbix | ZabConf2016
Zabbix
 
Ryan Armstrong - Monitoring More Than 6000 Devices in Zabbix | ZabConf2016
Ryan Armstrong - Monitoring More Than 6000 Devices in Zabbix | ZabConf2016Ryan Armstrong - Monitoring More Than 6000 Devices in Zabbix | ZabConf2016
Ryan Armstrong - Monitoring More Than 6000 Devices in Zabbix | ZabConf2016
Zabbix
 
Alexei Vladishev - Opening Speech | ZabConf2016
Alexei Vladishev - Opening Speech | ZabConf2016Alexei Vladishev - Opening Speech | ZabConf2016
Alexei Vladishev - Opening Speech | ZabConf2016
Zabbix
 
Zabbix Conference LatAm 2016 - Douglas Esteves - Zabbix at UNICAMP
Zabbix Conference LatAm 2016 - Douglas Esteves - Zabbix at UNICAMPZabbix Conference LatAm 2016 - Douglas Esteves - Zabbix at UNICAMP
Zabbix Conference LatAm 2016 - Douglas Esteves - Zabbix at UNICAMP
Zabbix
 
Zabbix Conference LatAm 2016 - Filipe Paternot - Zbx@Globo Automation+Integra...
Zabbix Conference LatAm 2016 - Filipe Paternot - Zbx@Globo Automation+Integra...Zabbix Conference LatAm 2016 - Filipe Paternot - Zbx@Globo Automation+Integra...
Zabbix Conference LatAm 2016 - Filipe Paternot - Zbx@Globo Automation+Integra...
Zabbix
 
Zabbix Conference LatAm 2016 - Andre Deo - Zabbix Brazil Community
Zabbix Conference LatAm 2016 - Andre Deo - Zabbix Brazil CommunityZabbix Conference LatAm 2016 - Andre Deo - Zabbix Brazil Community
Zabbix Conference LatAm 2016 - Andre Deo - Zabbix Brazil Community
Zabbix
 
Realtime statistics using Java, Kafka and Graphite
Realtime statistics using Java, Kafka and GraphiteRealtime statistics using Java, Kafka and Graphite
Realtime statistics using Java, Kafka and Graphite
Hung Nguyen
 

Viewers also liked (20)

Automating Monitoring with Puppet
Automating Monitoring with PuppetAutomating Monitoring with Puppet
Automating Monitoring with Puppet
 
eXtreme Tuesday Club at Pivotal Labs ft. Speemdnet / San Francisco - SEP 2015
eXtreme Tuesday Club at Pivotal Labs ft. Speemdnet / San Francisco - SEP 2015eXtreme Tuesday Club at Pivotal Labs ft. Speemdnet / San Francisco - SEP 2015
eXtreme Tuesday Club at Pivotal Labs ft. Speemdnet / San Francisco - SEP 2015
 
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
 
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
 
Oleg Ivanivskyi - Lessons Learned While Being On-Site | ZabConf2016
Oleg Ivanivskyi - Lessons Learned While Being On-Site | ZabConf2016Oleg Ivanivskyi - Lessons Learned While Being On-Site | ZabConf2016
Oleg Ivanivskyi - Lessons Learned While Being On-Site | ZabConf2016
 
Vladimir Ulogov - Large Scale Simulation | ZabConf2016 Lightning Talk
Vladimir Ulogov - Large Scale Simulation | ZabConf2016 Lightning TalkVladimir Ulogov - Large Scale Simulation | ZabConf2016 Lightning Talk
Vladimir Ulogov - Large Scale Simulation | ZabConf2016 Lightning Talk
 
Inaba Kazuhiko - Ahiruyaki Zabbix in Japan Part 2 | ZabConf2016 Lightning Talk
Inaba Kazuhiko - Ahiruyaki Zabbix in Japan Part 2 | ZabConf2016 Lightning TalkInaba Kazuhiko - Ahiruyaki Zabbix in Japan Part 2 | ZabConf2016 Lightning Talk
Inaba Kazuhiko - Ahiruyaki Zabbix in Japan Part 2 | ZabConf2016 Lightning Talk
 
Rafael Martinez Guerrero Zabbix CLI | ZabConf2016 Lightning Talk
Rafael Martinez Guerrero Zabbix CLI | ZabConf2016 Lightning TalkRafael Martinez Guerrero Zabbix CLI | ZabConf2016 Lightning Talk
Rafael Martinez Guerrero Zabbix CLI | ZabConf2016 Lightning Talk
 
Zabbix visión general del sistema - 04.12.2013
Zabbix   visión general del sistema - 04.12.2013Zabbix   visión general del sistema - 04.12.2013
Zabbix visión general del sistema - 04.12.2013
 
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016
 
Zabbix Conference LatAm 2016 - Paulo Deolindo - Case Study_BBTS and Zabbix
Zabbix Conference LatAm 2016 - Paulo Deolindo - Case Study_BBTS and ZabbixZabbix Conference LatAm 2016 - Paulo Deolindo - Case Study_BBTS and Zabbix
Zabbix Conference LatAm 2016 - Paulo Deolindo - Case Study_BBTS and Zabbix
 
Rafael Martinez Guerrero - Zabbix at the University of Oslo | ZabConf2016
Rafael Martinez Guerrero - Zabbix at the University of Oslo | ZabConf2016Rafael Martinez Guerrero - Zabbix at the University of Oslo | ZabConf2016
Rafael Martinez Guerrero - Zabbix at the University of Oslo | ZabConf2016
 
Konstantin Yakovlev - Event Analysis Toolset | ZabConf2016
Konstantin Yakovlev - Event Analysis Toolset | ZabConf2016Konstantin Yakovlev - Event Analysis Toolset | ZabConf2016
Konstantin Yakovlev - Event Analysis Toolset | ZabConf2016
 
Sumit Goel - Monitoring Cloud Applications Using Zabbix | ZabConf2016
Sumit Goel - Monitoring Cloud Applications Using Zabbix | ZabConf2016Sumit Goel - Monitoring Cloud Applications Using Zabbix | ZabConf2016
Sumit Goel - Monitoring Cloud Applications Using Zabbix | ZabConf2016
 
Ryan Armstrong - Monitoring More Than 6000 Devices in Zabbix | ZabConf2016
Ryan Armstrong - Monitoring More Than 6000 Devices in Zabbix | ZabConf2016Ryan Armstrong - Monitoring More Than 6000 Devices in Zabbix | ZabConf2016
Ryan Armstrong - Monitoring More Than 6000 Devices in Zabbix | ZabConf2016
 
Alexei Vladishev - Opening Speech | ZabConf2016
Alexei Vladishev - Opening Speech | ZabConf2016Alexei Vladishev - Opening Speech | ZabConf2016
Alexei Vladishev - Opening Speech | ZabConf2016
 
Zabbix Conference LatAm 2016 - Douglas Esteves - Zabbix at UNICAMP
Zabbix Conference LatAm 2016 - Douglas Esteves - Zabbix at UNICAMPZabbix Conference LatAm 2016 - Douglas Esteves - Zabbix at UNICAMP
Zabbix Conference LatAm 2016 - Douglas Esteves - Zabbix at UNICAMP
 
Zabbix Conference LatAm 2016 - Filipe Paternot - Zbx@Globo Automation+Integra...
Zabbix Conference LatAm 2016 - Filipe Paternot - Zbx@Globo Automation+Integra...Zabbix Conference LatAm 2016 - Filipe Paternot - Zbx@Globo Automation+Integra...
Zabbix Conference LatAm 2016 - Filipe Paternot - Zbx@Globo Automation+Integra...
 
Zabbix Conference LatAm 2016 - Andre Deo - Zabbix Brazil Community
Zabbix Conference LatAm 2016 - Andre Deo - Zabbix Brazil CommunityZabbix Conference LatAm 2016 - Andre Deo - Zabbix Brazil Community
Zabbix Conference LatAm 2016 - Andre Deo - Zabbix Brazil Community
 
Realtime statistics using Java, Kafka and Graphite
Realtime statistics using Java, Kafka and GraphiteRealtime statistics using Java, Kafka and Graphite
Realtime statistics using Java, Kafka and Graphite
 

Similar to Erik Skytthe - Monitoring Mesos, Docker, Containers with Zabbix | ZabConf2016

Kubernetes - Starting with 1.2
Kubernetes  - Starting with 1.2Kubernetes  - Starting with 1.2
Kubernetes - Starting with 1.2
William Stewart
 
Amazon Web Services and Docker: from developing to production
Amazon Web Services and Docker: from developing to productionAmazon Web Services and Docker: from developing to production
Amazon Web Services and Docker: from developing to production
Paolo latella
 
Dockerizing a Symfony2 application
Dockerizing a Symfony2 applicationDockerizing a Symfony2 application
Dockerizing a Symfony2 application
Roman Rodomansky
 
Puppet at Opera Sofware - PuppetCamp Oslo 2013
Puppet at Opera Sofware - PuppetCamp Oslo 2013Puppet at Opera Sofware - PuppetCamp Oslo 2013
Puppet at Opera Sofware - PuppetCamp Oslo 2013
Cosimo Streppone
 
containerit at useR!2017 conference, Brussels
containerit at useR!2017 conference, Brusselscontainerit at useR!2017 conference, Brussels
containerit at useR!2017 conference, Brussels
Daniel Nüst
 
I Just Want to Run My Code: Waypoint, Nomad, and Other Things
I Just Want to Run My Code: Waypoint, Nomad, and Other ThingsI Just Want to Run My Code: Waypoint, Nomad, and Other Things
I Just Want to Run My Code: Waypoint, Nomad, and Other Things
Michael Lange
 
Introduction to Docker and deployment and Azure
Introduction to Docker and deployment and AzureIntroduction to Docker and deployment and Azure
Introduction to Docker and deployment and Azure
Jérôme Petazzoni
 
Kubernetes for the PHP developer
Kubernetes for the PHP developerKubernetes for the PHP developer
Kubernetes for the PHP developer
Paul Czarkowski
 
Lessons from running potentially malicious code inside containers
Lessons from running potentially malicious code inside containersLessons from running potentially malicious code inside containers
Lessons from running potentially malicious code inside containers
Ben Hall
 
Docker - From Walking To Running
Docker - From Walking To RunningDocker - From Walking To Running
Docker - From Walking To Running
Giacomo Vacca
 
Postgres the hardway
Postgres the hardwayPostgres the hardway
Postgres the hardway
Dave Pitts
 
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Patrick Chanezon
 
Continuous delivery workflow with Docker
Continuous delivery workflow with DockerContinuous delivery workflow with Docker
Continuous delivery workflow with Docker
Igor Moochnick
 
Docker and-containers-for-development-and-deployment-scale12x
Docker and-containers-for-development-and-deployment-scale12xDocker and-containers-for-development-and-deployment-scale12x
Docker and-containers-for-development-and-deployment-scale12x
rkr10
 
DevOPS training - Day 2/2
DevOPS training - Day 2/2DevOPS training - Day 2/2
DevOPS training - Day 2/2
Vincent Mercier
 
Null Bachaav - May 07 Attack Monitoring workshop.
Null Bachaav - May 07 Attack Monitoring workshop.Null Bachaav - May 07 Attack Monitoring workshop.
Null Bachaav - May 07 Attack Monitoring workshop.
Prajal Kulkarni
 
Monitoring Docker at Scale - Docker San Francisco Meetup - August 11, 2015
Monitoring Docker at Scale - Docker San Francisco Meetup - August 11, 2015Monitoring Docker at Scale - Docker San Francisco Meetup - August 11, 2015
Monitoring Docker at Scale - Docker San Francisco Meetup - August 11, 2015
Datadog
 
JDO 2019: Tips and Tricks from Docker Captain - Łukasz Lach
JDO 2019: Tips and Tricks from Docker Captain - Łukasz LachJDO 2019: Tips and Tricks from Docker Captain - Łukasz Lach
JDO 2019: Tips and Tricks from Docker Captain - Łukasz Lach
PROIDEA
 
Server(less) Swift at SwiftCloudWorkshop 3
Server(less) Swift at SwiftCloudWorkshop 3Server(less) Swift at SwiftCloudWorkshop 3
Server(less) Swift at SwiftCloudWorkshop 3
kognate
 
Automate drupal deployments with linux containers, docker and vagrant
Automate drupal deployments with linux containers, docker and vagrant Automate drupal deployments with linux containers, docker and vagrant
Automate drupal deployments with linux containers, docker and vagrant
Ricardo Amaro
 

Similar to Erik Skytthe - Monitoring Mesos, Docker, Containers with Zabbix | ZabConf2016 (20)

Kubernetes - Starting with 1.2
Kubernetes  - Starting with 1.2Kubernetes  - Starting with 1.2
Kubernetes - Starting with 1.2
 
Amazon Web Services and Docker: from developing to production
Amazon Web Services and Docker: from developing to productionAmazon Web Services and Docker: from developing to production
Amazon Web Services and Docker: from developing to production
 
Dockerizing a Symfony2 application
Dockerizing a Symfony2 applicationDockerizing a Symfony2 application
Dockerizing a Symfony2 application
 
Puppet at Opera Sofware - PuppetCamp Oslo 2013
Puppet at Opera Sofware - PuppetCamp Oslo 2013Puppet at Opera Sofware - PuppetCamp Oslo 2013
Puppet at Opera Sofware - PuppetCamp Oslo 2013
 
containerit at useR!2017 conference, Brussels
containerit at useR!2017 conference, Brusselscontainerit at useR!2017 conference, Brussels
containerit at useR!2017 conference, Brussels
 
I Just Want to Run My Code: Waypoint, Nomad, and Other Things
I Just Want to Run My Code: Waypoint, Nomad, and Other ThingsI Just Want to Run My Code: Waypoint, Nomad, and Other Things
I Just Want to Run My Code: Waypoint, Nomad, and Other Things
 
Introduction to Docker and deployment and Azure
Introduction to Docker and deployment and AzureIntroduction to Docker and deployment and Azure
Introduction to Docker and deployment and Azure
 
Kubernetes for the PHP developer
Kubernetes for the PHP developerKubernetes for the PHP developer
Kubernetes for the PHP developer
 
Lessons from running potentially malicious code inside containers
Lessons from running potentially malicious code inside containersLessons from running potentially malicious code inside containers
Lessons from running potentially malicious code inside containers
 
Docker - From Walking To Running
Docker - From Walking To RunningDocker - From Walking To Running
Docker - From Walking To Running
 
Postgres the hardway
Postgres the hardwayPostgres the hardway
Postgres the hardway
 
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
 
Continuous delivery workflow with Docker
Continuous delivery workflow with DockerContinuous delivery workflow with Docker
Continuous delivery workflow with Docker
 
Docker and-containers-for-development-and-deployment-scale12x
Docker and-containers-for-development-and-deployment-scale12xDocker and-containers-for-development-and-deployment-scale12x
Docker and-containers-for-development-and-deployment-scale12x
 
DevOPS training - Day 2/2
DevOPS training - Day 2/2DevOPS training - Day 2/2
DevOPS training - Day 2/2
 
Null Bachaav - May 07 Attack Monitoring workshop.
Null Bachaav - May 07 Attack Monitoring workshop.Null Bachaav - May 07 Attack Monitoring workshop.
Null Bachaav - May 07 Attack Monitoring workshop.
 
Monitoring Docker at Scale - Docker San Francisco Meetup - August 11, 2015
Monitoring Docker at Scale - Docker San Francisco Meetup - August 11, 2015Monitoring Docker at Scale - Docker San Francisco Meetup - August 11, 2015
Monitoring Docker at Scale - Docker San Francisco Meetup - August 11, 2015
 
JDO 2019: Tips and Tricks from Docker Captain - Łukasz Lach
JDO 2019: Tips and Tricks from Docker Captain - Łukasz LachJDO 2019: Tips and Tricks from Docker Captain - Łukasz Lach
JDO 2019: Tips and Tricks from Docker Captain - Łukasz Lach
 
Server(less) Swift at SwiftCloudWorkshop 3
Server(less) Swift at SwiftCloudWorkshop 3Server(less) Swift at SwiftCloudWorkshop 3
Server(less) Swift at SwiftCloudWorkshop 3
 
Automate drupal deployments with linux containers, docker and vagrant
Automate drupal deployments with linux containers, docker and vagrant Automate drupal deployments with linux containers, docker and vagrant
Automate drupal deployments with linux containers, docker and vagrant
 

More from Zabbix

Zabbix Conference LatAm 2016 - Jessian Ferreira - Wireless with Zabbix
Zabbix Conference LatAm 2016 - Jessian Ferreira - Wireless with ZabbixZabbix Conference LatAm 2016 - Jessian Ferreira - Wireless with Zabbix
Zabbix Conference LatAm 2016 - Jessian Ferreira - Wireless with Zabbix
Zabbix
 
Zabbix Conference LatAm 2016 - Jorge Pretel - Low Level Discovery for ODBC an...
Zabbix Conference LatAm 2016 - Jorge Pretel - Low Level Discovery for ODBC an...Zabbix Conference LatAm 2016 - Jorge Pretel - Low Level Discovery for ODBC an...
Zabbix Conference LatAm 2016 - Jorge Pretel - Low Level Discovery for ODBC an...
Zabbix
 
Zabbix Conference LatAm 2016 - Andre Deo - SNMP and Zabbix
Zabbix Conference LatAm 2016 - Andre Deo - SNMP and ZabbixZabbix Conference LatAm 2016 - Andre Deo - SNMP and Zabbix
Zabbix Conference LatAm 2016 - Andre Deo - SNMP and Zabbix
Zabbix
 
Zabbix Conference LatAm 2016 - Rodrigo Mohr - Challenges on Large Env with Or...
Zabbix Conference LatAm 2016 - Rodrigo Mohr - Challenges on Large Env with Or...Zabbix Conference LatAm 2016 - Rodrigo Mohr - Challenges on Large Env with Or...
Zabbix Conference LatAm 2016 - Rodrigo Mohr - Challenges on Large Env with Or...
Zabbix
 
Zabbix Conference LatAm 2016 - Marcio Prop - Monitoring Complex Environments ...
Zabbix Conference LatAm 2016 - Marcio Prop - Monitoring Complex Environments ...Zabbix Conference LatAm 2016 - Marcio Prop - Monitoring Complex Environments ...
Zabbix Conference LatAm 2016 - Marcio Prop - Monitoring Complex Environments ...
Zabbix
 
Zabbix Conference LatAm 2016 - Daniel Nasiloski - Extending Zabbix - Interact...
Zabbix Conference LatAm 2016 - Daniel Nasiloski - Extending Zabbix - Interact...Zabbix Conference LatAm 2016 - Daniel Nasiloski - Extending Zabbix - Interact...
Zabbix Conference LatAm 2016 - Daniel Nasiloski - Extending Zabbix - Interact...
Zabbix
 
Raymond Kuiper - Zen and The Art of Zabbix Template Design | ZabConf2016
Raymond Kuiper - Zen and The Art of Zabbix Template Design | ZabConf2016Raymond Kuiper - Zen and The Art of Zabbix Template Design | ZabConf2016
Raymond Kuiper - Zen and The Art of Zabbix Template Design | ZabConf2016
Zabbix
 
Dimitri Bellini and Pietro Antonacci - Manage Zabbix Proxies in Remote Networ...
Dimitri Bellini and Pietro Antonacci - Manage Zabbix Proxies in Remote Networ...Dimitri Bellini and Pietro Antonacci - Manage Zabbix Proxies in Remote Networ...
Dimitri Bellini and Pietro Antonacci - Manage Zabbix Proxies in Remote Networ...
Zabbix
 
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
Zabbix
 
Lukáš Malý - Log management ELISA controlled by Zabbix | ZabConf2016
Lukáš Malý - Log management ELISA controlled by Zabbix | ZabConf2016Lukáš Malý - Log management ELISA controlled by Zabbix | ZabConf2016
Lukáš Malý - Log management ELISA controlled by Zabbix | ZabConf2016
Zabbix
 
Ingus Vilnis - Benefits of Zabbix Training | ZabConf2016
Ingus Vilnis -  Benefits of Zabbix Training | ZabConf2016Ingus Vilnis -  Benefits of Zabbix Training | ZabConf2016
Ingus Vilnis - Benefits of Zabbix Training | ZabConf2016
Zabbix
 
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
Zabbix
 
Alain Ganuchaud - Trouble Ticket Integration with Zabbix in Large Environment...
Alain Ganuchaud - Trouble Ticket Integration with Zabbix in Large Environment...Alain Ganuchaud - Trouble Ticket Integration with Zabbix in Large Environment...
Alain Ganuchaud - Trouble Ticket Integration with Zabbix in Large Environment...
Zabbix
 
Rihards Olups - Zabbix log management
Rihards Olups - Zabbix log managementRihards Olups - Zabbix log management
Rihards Olups - Zabbix log management
Zabbix
 

More from Zabbix (14)

Zabbix Conference LatAm 2016 - Jessian Ferreira - Wireless with Zabbix
Zabbix Conference LatAm 2016 - Jessian Ferreira - Wireless with ZabbixZabbix Conference LatAm 2016 - Jessian Ferreira - Wireless with Zabbix
Zabbix Conference LatAm 2016 - Jessian Ferreira - Wireless with Zabbix
 
Zabbix Conference LatAm 2016 - Jorge Pretel - Low Level Discovery for ODBC an...
Zabbix Conference LatAm 2016 - Jorge Pretel - Low Level Discovery for ODBC an...Zabbix Conference LatAm 2016 - Jorge Pretel - Low Level Discovery for ODBC an...
Zabbix Conference LatAm 2016 - Jorge Pretel - Low Level Discovery for ODBC an...
 
Zabbix Conference LatAm 2016 - Andre Deo - SNMP and Zabbix
Zabbix Conference LatAm 2016 - Andre Deo - SNMP and ZabbixZabbix Conference LatAm 2016 - Andre Deo - SNMP and Zabbix
Zabbix Conference LatAm 2016 - Andre Deo - SNMP and Zabbix
 
Zabbix Conference LatAm 2016 - Rodrigo Mohr - Challenges on Large Env with Or...
Zabbix Conference LatAm 2016 - Rodrigo Mohr - Challenges on Large Env with Or...Zabbix Conference LatAm 2016 - Rodrigo Mohr - Challenges on Large Env with Or...
Zabbix Conference LatAm 2016 - Rodrigo Mohr - Challenges on Large Env with Or...
 
Zabbix Conference LatAm 2016 - Marcio Prop - Monitoring Complex Environments ...
Zabbix Conference LatAm 2016 - Marcio Prop - Monitoring Complex Environments ...Zabbix Conference LatAm 2016 - Marcio Prop - Monitoring Complex Environments ...
Zabbix Conference LatAm 2016 - Marcio Prop - Monitoring Complex Environments ...
 
Zabbix Conference LatAm 2016 - Daniel Nasiloski - Extending Zabbix - Interact...
Zabbix Conference LatAm 2016 - Daniel Nasiloski - Extending Zabbix - Interact...Zabbix Conference LatAm 2016 - Daniel Nasiloski - Extending Zabbix - Interact...
Zabbix Conference LatAm 2016 - Daniel Nasiloski - Extending Zabbix - Interact...
 
Raymond Kuiper - Zen and The Art of Zabbix Template Design | ZabConf2016
Raymond Kuiper - Zen and The Art of Zabbix Template Design | ZabConf2016Raymond Kuiper - Zen and The Art of Zabbix Template Design | ZabConf2016
Raymond Kuiper - Zen and The Art of Zabbix Template Design | ZabConf2016
 
Dimitri Bellini and Pietro Antonacci - Manage Zabbix Proxies in Remote Networ...
Dimitri Bellini and Pietro Antonacci - Manage Zabbix Proxies in Remote Networ...Dimitri Bellini and Pietro Antonacci - Manage Zabbix Proxies in Remote Networ...
Dimitri Bellini and Pietro Antonacci - Manage Zabbix Proxies in Remote Networ...
 
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
 
Lukáš Malý - Log management ELISA controlled by Zabbix | ZabConf2016
Lukáš Malý - Log management ELISA controlled by Zabbix | ZabConf2016Lukáš Malý - Log management ELISA controlled by Zabbix | ZabConf2016
Lukáš Malý - Log management ELISA controlled by Zabbix | ZabConf2016
 
Ingus Vilnis - Benefits of Zabbix Training | ZabConf2016
Ingus Vilnis -  Benefits of Zabbix Training | ZabConf2016Ingus Vilnis -  Benefits of Zabbix Training | ZabConf2016
Ingus Vilnis - Benefits of Zabbix Training | ZabConf2016
 
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
 
Alain Ganuchaud - Trouble Ticket Integration with Zabbix in Large Environment...
Alain Ganuchaud - Trouble Ticket Integration with Zabbix in Large Environment...Alain Ganuchaud - Trouble Ticket Integration with Zabbix in Large Environment...
Alain Ganuchaud - Trouble Ticket Integration with Zabbix in Large Environment...
 
Rihards Olups - Zabbix log management
Rihards Olups - Zabbix log managementRihards Olups - Zabbix log management
Rihards Olups - Zabbix log management
 

Recently uploaded

Communications Mining Series - Zero to Hero - Session 3
Communications Mining Series - Zero to Hero - Session 3Communications Mining Series - Zero to Hero - Session 3
Communications Mining Series - Zero to Hero - Session 3
DianaGray10
 
Finetuning GenAI For Hacking and Defending
Finetuning GenAI For Hacking and DefendingFinetuning GenAI For Hacking and Defending
Finetuning GenAI For Hacking and Defending
Priyanka Aash
 
Integrating Kafka with MuleSoft 4 and usecase
Integrating Kafka with MuleSoft 4 and usecaseIntegrating Kafka with MuleSoft 4 and usecase
Integrating Kafka with MuleSoft 4 and usecase
shyamraj55
 
Opencast Summit 2024 — Opencast @ University of Münster
Opencast Summit 2024 — Opencast @ University of MünsterOpencast Summit 2024 — Opencast @ University of Münster
Opencast Summit 2024 — Opencast @ University of Münster
Matthias Neugebauer
 
Zaitechno Handheld Raman Spectrometer.pdf
Zaitechno Handheld Raman Spectrometer.pdfZaitechno Handheld Raman Spectrometer.pdf
Zaitechno Handheld Raman Spectrometer.pdf
AmandaCheung15
 
Redefining Cybersecurity with AI Capabilities
Redefining Cybersecurity with AI CapabilitiesRedefining Cybersecurity with AI Capabilities
Redefining Cybersecurity with AI Capabilities
Priyanka Aash
 
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdfAcumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
BrainSell Technologies
 
The History of Embeddings & Multimodal Embeddings
The History of Embeddings & Multimodal EmbeddingsThe History of Embeddings & Multimodal Embeddings
The History of Embeddings & Multimodal Embeddings
Zilliz
 
Mastering OnlyFans Clone App Development: Key Strategies for Success
Mastering OnlyFans Clone App Development: Key Strategies for SuccessMastering OnlyFans Clone App Development: Key Strategies for Success
Mastering OnlyFans Clone App Development: Key Strategies for Success
David Wilson
 
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python CodebaseEuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
Jimmy Lai
 
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
alexjohnson7307
 
It's your unstructured data: How to get your GenAI app to production (and spe...
It's your unstructured data: How to get your GenAI app to production (and spe...It's your unstructured data: How to get your GenAI app to production (and spe...
It's your unstructured data: How to get your GenAI app to production (and spe...
Zilliz
 
LeadMagnet IQ Review: Unlock the Secret to Effortless Traffic and Leads.pdf
LeadMagnet IQ Review:  Unlock the Secret to Effortless Traffic and Leads.pdfLeadMagnet IQ Review:  Unlock the Secret to Effortless Traffic and Leads.pdf
LeadMagnet IQ Review: Unlock the Secret to Effortless Traffic and Leads.pdf
SelfMade bd
 
Vertex AI Agent Builder - GDG Alicante - Julio 2024
Vertex AI Agent Builder - GDG Alicante - Julio 2024Vertex AI Agent Builder - GDG Alicante - Julio 2024
Vertex AI Agent Builder - GDG Alicante - Julio 2024
Nicolás Lopéz
 
Tailored CRM Software Development for Enhanced Customer Insights
Tailored CRM Software Development for Enhanced Customer InsightsTailored CRM Software Development for Enhanced Customer Insights
Tailored CRM Software Development for Enhanced Customer Insights
SynapseIndia
 
Mule Experience Hub and Release Channel with Java 17
Mule Experience Hub and Release Channel with Java 17Mule Experience Hub and Release Channel with Java 17
Mule Experience Hub and Release Channel with Java 17
Bhajan Mehta
 
Camunda Chapter NY Meetup July 2024.pptx
Camunda Chapter NY Meetup July 2024.pptxCamunda Chapter NY Meetup July 2024.pptx
Camunda Chapter NY Meetup July 2024.pptx
ZachWylie3
 
Connector Corner: Leveraging Snowflake Integration for Smarter Decision Making
Connector Corner: Leveraging Snowflake Integration for Smarter Decision MakingConnector Corner: Leveraging Snowflake Integration for Smarter Decision Making
Connector Corner: Leveraging Snowflake Integration for Smarter Decision Making
DianaGray10
 
The Path to General-Purpose Robots - Coatue
The Path to General-Purpose Robots - CoatueThe Path to General-Purpose Robots - Coatue
The Path to General-Purpose Robots - Coatue
Razin Mustafiz
 
Sonkoloniya documentation - ONEprojukti.pdf
Sonkoloniya documentation - ONEprojukti.pdfSonkoloniya documentation - ONEprojukti.pdf
Sonkoloniya documentation - ONEprojukti.pdf
SubhamMandal40
 

Recently uploaded (20)

Communications Mining Series - Zero to Hero - Session 3
Communications Mining Series - Zero to Hero - Session 3Communications Mining Series - Zero to Hero - Session 3
Communications Mining Series - Zero to Hero - Session 3
 
Finetuning GenAI For Hacking and Defending
Finetuning GenAI For Hacking and DefendingFinetuning GenAI For Hacking and Defending
Finetuning GenAI For Hacking and Defending
 
Integrating Kafka with MuleSoft 4 and usecase
Integrating Kafka with MuleSoft 4 and usecaseIntegrating Kafka with MuleSoft 4 and usecase
Integrating Kafka with MuleSoft 4 and usecase
 
Opencast Summit 2024 — Opencast @ University of Münster
Opencast Summit 2024 — Opencast @ University of MünsterOpencast Summit 2024 — Opencast @ University of Münster
Opencast Summit 2024 — Opencast @ University of Münster
 
Zaitechno Handheld Raman Spectrometer.pdf
Zaitechno Handheld Raman Spectrometer.pdfZaitechno Handheld Raman Spectrometer.pdf
Zaitechno Handheld Raman Spectrometer.pdf
 
Redefining Cybersecurity with AI Capabilities
Redefining Cybersecurity with AI CapabilitiesRedefining Cybersecurity with AI Capabilities
Redefining Cybersecurity with AI Capabilities
 
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdfAcumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
 
The History of Embeddings & Multimodal Embeddings
The History of Embeddings & Multimodal EmbeddingsThe History of Embeddings & Multimodal Embeddings
The History of Embeddings & Multimodal Embeddings
 
Mastering OnlyFans Clone App Development: Key Strategies for Success
Mastering OnlyFans Clone App Development: Key Strategies for SuccessMastering OnlyFans Clone App Development: Key Strategies for Success
Mastering OnlyFans Clone App Development: Key Strategies for Success
 
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python CodebaseEuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
 
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
 
It's your unstructured data: How to get your GenAI app to production (and spe...
It's your unstructured data: How to get your GenAI app to production (and spe...It's your unstructured data: How to get your GenAI app to production (and spe...
It's your unstructured data: How to get your GenAI app to production (and spe...
 
LeadMagnet IQ Review: Unlock the Secret to Effortless Traffic and Leads.pdf
LeadMagnet IQ Review:  Unlock the Secret to Effortless Traffic and Leads.pdfLeadMagnet IQ Review:  Unlock the Secret to Effortless Traffic and Leads.pdf
LeadMagnet IQ Review: Unlock the Secret to Effortless Traffic and Leads.pdf
 
Vertex AI Agent Builder - GDG Alicante - Julio 2024
Vertex AI Agent Builder - GDG Alicante - Julio 2024Vertex AI Agent Builder - GDG Alicante - Julio 2024
Vertex AI Agent Builder - GDG Alicante - Julio 2024
 
Tailored CRM Software Development for Enhanced Customer Insights
Tailored CRM Software Development for Enhanced Customer InsightsTailored CRM Software Development for Enhanced Customer Insights
Tailored CRM Software Development for Enhanced Customer Insights
 
Mule Experience Hub and Release Channel with Java 17
Mule Experience Hub and Release Channel with Java 17Mule Experience Hub and Release Channel with Java 17
Mule Experience Hub and Release Channel with Java 17
 
Camunda Chapter NY Meetup July 2024.pptx
Camunda Chapter NY Meetup July 2024.pptxCamunda Chapter NY Meetup July 2024.pptx
Camunda Chapter NY Meetup July 2024.pptx
 
Connector Corner: Leveraging Snowflake Integration for Smarter Decision Making
Connector Corner: Leveraging Snowflake Integration for Smarter Decision MakingConnector Corner: Leveraging Snowflake Integration for Smarter Decision Making
Connector Corner: Leveraging Snowflake Integration for Smarter Decision Making
 
The Path to General-Purpose Robots - Coatue
The Path to General-Purpose Robots - CoatueThe Path to General-Purpose Robots - Coatue
The Path to General-Purpose Robots - Coatue
 
Sonkoloniya documentation - ONEprojukti.pdf
Sonkoloniya documentation - ONEprojukti.pdfSonkoloniya documentation - ONEprojukti.pdf
Sonkoloniya documentation - ONEprojukti.pdf
 

Erik Skytthe - Monitoring Mesos, Docker, Containers with Zabbix | ZabConf2016

  • 1. Monitoring Mesos, Docker, Containers with Zabbix By Erik Skytthe, DBC A/S, Denmark https://dk.linkedin.com/in/erik-skytthe-444a7720 (Zabbix IRC/Forum: eskytthe) Email: ers ADD dbc.dk 1
  • 2. DBC Central libraries Research libraries Education libraries Local libraries: ... Central material search: (books, films, articles, research, facts …) Central databases, web services etc. Open Source: https://opensource.dbc.dk/ https://github.com/DBCDK/ - Templates etc.here efter conference 2
  • 3. DBC DBC - > 600 devices (Linux, Windows, SAN, Network, Xen, VMware …) - Software Development - Many in-house developed applications (some very old) - Have used Zabbix in ~ 6 years. - Version 2.4 ~ will update to 3.X in short time Erik Skytthe - ~ DevOps … but most Ops - Continuous Integration - Continuous Deployment - Tools and platforms to support this 3
  • 4. Who are running? Some sort of container cluster environment: - Mesos (Apache) - Swarm (Docker) - Kubernetes (Google) - Fleet (CoreOS) - Or Other ... ● Swarm ? ● Kubernetes? ● Mesos? 4
  • 5. Who are using Mesos? It is all around you ... ● Twitter ● Apple - Siri (voice recognition) ● Netflix ● PayPal ● Cisco ● Microsoft Azure ● eBay, Airbnb, Uber, CERN …... and ● DBC :-) … small scale Cons: not long prod experience http://mesos.apache.org/documentation/latest/powered-by-mesos 5
  • 6. What is Mesos? A distributed system kernel A low level distributed system A cluster of resources Offer resources from - Cpu, memory, storage, network … Resources consumed by Frameworks Runs on bare metal, virtual hosts and cloud Apache Mesos abstracts CPU, memory, storage, and other compute resources away from machines … into a shared pool. 6
  • 7. Mesos architecture http://blog.couchbase.com/2016/may/docker-apache-mesos-marathon Swarm Jenkins Elastic Kafka ... Docker Docker Docker Docker Mesos: - CPU - Mem - Disk - Network - ... (Process) (Process) (Process) 7
  • 8. Frameworks - “Infrastructure as code” Orchestration of containers ~ applications / tasks ● Start, Stop, Scaling, Destroy ... ● Health checks ● Grouping ● Dependencies (Mysql container -> Zabbix container) ● Load balance / Proxy Examples: Marathon: Long running tasks Chronos: “Cron” jobs Jenkins: Up and down scaling of builds 8
  • 9. Frameworks - Marathon { "id": "/stagning/scrum-team/webserver", "cpus": 0.1, "mem": 64.0, "instances": 1, "container": { "type": "DOCKER", "docker": { "image": "docker-repo/nginx", "forcePullImage": true, "network": "BRIDGE", "portMappings": [{ "containerPort": 80, "hostPort": 0 }] } }, "healthChecks": [{ "path": "/", "gracePeriodSeconds": 5, "intervalSeconds": 5, "timeoutSeconds": 5, "maxConsecutiveFailures": 3 }] } curl -X POST -H "Content-type: application/json" http://mesosmasterX:8080/v2/apps -d @webserver.json 9
  • 10. Full stack Mesos - Mesosphere DC/OS https://dcos.io https://mesosphere.com/ (Data Center OS) Enterprise: Open Source: 10
  • 11. Full stack Mesos - Mantl http://mantl.io - Cloud infrastructure Cisco - We use it as a reference model (Collectd) ... 11
  • 12. MesosCon Amsterdam 2016 http://events.linuxfoundation.org/events/mesoscon-europe - Fault tolerence in mesos: http://sched.co/7n7x - Mesos 1.0: http://sched.co/7n7s - ... 12
  • 13. MesosCon America 2016 Video sessions: https://www.linux.com/news/mesoscon-north-america-2016-video-sessions DevOps / container config and deploy: - “Lessons Learned from Running Heterogeneous Workload on Apache Mesos” - “All Marathons Need a Runner. Introducing Pheidippides” - … In deep container troubleshooting: (sysdig: cli / runtime tool) - “Monitoring Microservices: Docker, Mesos and DCOS Visibility at Scale” - https://sysdig.com/blog/monitoring-mesos/ - 13
  • 14. Hardware - Server monitoring Standard zabbix agent - plain vanilla - “Standard” Linux templates - Autodiscovery of Disk and Networks - Syslog to ELK Trigger / alert note: - Filesystem space and inodes! - because of Docker (images etc.) 14
  • 15. Getting Data Metrics curl -s 'http://mesosmasterx:5050/metrics/snapshot' | jq '.' { ... "master/tasks_starting": 0, "master/tasks_staging": 0, "master/tasks_running": 38, "master/tasks_lost": 0, "master/tasks_killing": 0, "master/tasks_killed": 770, "master/tasks_finished": 0, "master/tasks_failed": 129, "master/tasks_error": 0, "master/task_killed/source_slave/reason_executor_unregistered": 1, "master/slaves_inactive": 0, "master/slaves_disconnected": 0, "master/slaves_connected": 8, "master/slaves_active": 8, .... } 15
  • 16. Collectd We did choose collectd ... so far, because: - Mantl (ready made ansible roles etc.) - Recommendations in mesos mail list - We already running graphite - (Nice analyse possibilities) Cons: - No zabbix write plugin so far - Easy to make e.g. python program to get data from urls. Ansible role from Mantl project - to install: ~/work/mantl/roles/collectd (git clone …) Remark: Debian require libpython2.7 16
  • 17. Collectd data flow Graphite Zabbix Mesos Master Mesos Marathon (zookeeper) Mesos Agent Mesos ContainerX g2zproxy Grafana (GUI) 17
  • 18. Collectd - write to graphite plugin /etc/collectd/ ├── collectd.conf ├── collectd.conf.d │ ├── carbon.conf │ ├── filters.conf │ ├── mesos-master.conf │ └── thresholds.conf └── collection.conf collectd.conf: # Sampling interval sec. Interval 20 <Include "/etc/collectd/collectd.conf.d"> Filter "*.conf" </Include> Carbon.conf # For version 5.1 and later using the Write Graphite plugin FQDNLookup false Timeout 2 ReadThreads 5 LoadPlugin write_graphite <Plugin "write_graphite"> <Carbon> Host "graphitehost name" Port "{{ GRAPHITE_PORT | default("2003") }}" Protocol "tcp" Prefix "{{ GRAPHITE_PREFIX | default("collectd.") }}" EscapeCharacter "." StoreRates true AlwaysAppendDS false SeparateInstances true </Carbon> </Plugin> 18
  • 19. Collectd mesos-master.conf /etc/collectd/ ├── collectd.conf ├── collectd.conf.d │ ├── carbon.conf │ ├── filters.conf │ ├── mesos-master.conf │ └── thresholds.conf └── collection.conf /usr/share/collectd/plugins/mesos-master.py mesos-master.conf: LoadPlugin python <Plugin "python"> ModulePath "/usr/share/collectd/plugins/" Import "mesos-master" <Module "mesos-master"> Host "localhost" Port 5050 Verbose false Version "0.22.1" </Module> </Plugin> 19
  • 20. Colltecd mesos-agent.conf /etc/collectd/ ├── collectd.conf ├── collectd.conf.d │ ├── carbon.conf │ ├── filters.conf │ ├── mesos-agent.conf │ └── thresholds.conf └── collection.conf /usr/share/collectd/plugins/mesos-agent.py Note: May be named “mesos-slave” also Same config model for marathon, zookeeper etc. mesos-agent.conf: LoadPlugin python <Plugin "python"> ModulePath "/usr/share/collectd/plugins/" Import "mesos-agent" <Module "mesos-agent"> Host "localhost" Port 5051 Verbose false Version "0.22.1" </Module> </Plugin> 20
  • 21. Graphite - Zabbix integration Graphite to Zabbix proxy: Blacked graphite-to-zabbix https://github.com/blacked/graphite-to-zabbix Crontab: */1 * * * * g2zproxy -z https://zabbixhost -zu {zabbixUser} -zp {zabbixPass} -g http://graphitehost ... Graphite key: mesos-masterx.mesos-master.gauge.master_elected Zabbix Host: mesos-masterx Zabbix key: graphite[mesos-master.gauge.master_elected] Pros: Possible to use Graphite functions in zabbix requests (zabbix key) 21
  • 22. Mesos Master Monitoring Mesos metrics: http://mesos.apache.org/documentation/latest/monitoring/ Monitoring and trigger recommendations: https://docs.mesosphere.com/1.7/administration/monitoring/performance-monitoring/ Cons: - Data only from elected master - Bug? 2.8 lost tasks? (counter) etcd: https://github.com/shamil/zabbix-etcd zookeeper: https://github.com/zhangqin/zookeeper-zabbix-template Recommendations included in item description: 22
  • 25. Mesos Agent (Slave) Monitoring - - Plus process monitoring e.g. mesos, docker, etcd 25
  • 26. Marathon Framework Monitoring http://mesosmasterx:5050/metrics Colletctd marathon plugin: Not working with authentication and SSL! https://github.com/klynch/collectd-marathon Marathon metrics flags --reporter_graphite : tcp://graphitehost:2003?prefix=marathon-test&interval=10 Report metrics to Graphite as defined by the URL. Cons: - Bug in tasks metric - Some metric names not updated in documentation https://docs.mesosphere.com/1.7/administration/monitoring/performance-monitoring Host 26
  • 27. Container Monitoring Construction work ahead ! Note: Huge topic - Need separate presentation 27
  • 28. Container Monitoring How to represent containers ~ applications / tasks in Zabbix ? How to show them …? In a structural way ? … How to find them? How to aggregate and analyse them? 28
  • 29. Container Monitoring Evolution of the standard application stack. (Off-the-shelf = eg. Java EE) Source: Datadog - https://www.datadoghq.com/blog/the-docker-monitoring-problem/ 29
  • 30. Container Monitoring “we need a new approach, one that does not treat everything as a host.” - Containers can be on any agent / node in your cluster - Containers can be in 1 ~ many instances - Containers can start and get destroyed in ms. / sec. “Treat containers as hosts that come and go every few minutes. In this case your life is miserable because the monitoring system always thinks half of your infrastructure is on fire” “Monitor all layers of your stack together, so that you can see what is happening everywhere, at the same time, with no gaps” “Tag your containers so that you can monitor them as queryable sets rather than as individuals” 30
  • 31. Container Monitoring “Treat containers as hosts that come and go every few minutes”. “In this case your life is miserable because the monitoring system always thinks half of your infrastructure is on fire” Not necessarily true - Long running containers - Depend on triggers (if any) - Depend on your use Our solution: Treat containers as hosts in zabbix 31
  • 32. Container Monitoring Test of two mesos collectd solutions: https://github.com/bobrik/collectd-mesos-tasks https://github.com/rayrod2030/collectd-mesos Cons: - Open Mesos issue: incorrect CPU metrics - Can not get them to work (out of the box) in newer mesos versions 32
  • 33. Container Monitoring Solution 1: Monitoringartist (Jan): zabbix-docker-monitoring https://github.com/monitoringartist/zabbix-docker-monitoring Book: "Monitoring Docker" By Russ McKendrick Chapter 4 - Zabbix install etc. Cons: - Limited version: “provides only docker metrics, TLS features and Zabbix agent server IP check are disabled” Container name CPU system time CPU user time Used cache memory Used RSS memory Used swap Note: See also monitoring analytics and other: https://hub.docker.com/r/monitoringartist/monitoring-analytics 33
  • 34. Container Monitoring Solution 2: Bobrik: Collectd-docker (run as docker container) https://github.com/bobrik/collectd-docker Cons: - Only docker - Need to modify graphite to zabbix proxy tool (g2zproxy) Pros: - Possible to make a “zabbix container discover” solution - Reliable cpu metrics - Use of application and task tagging - Only containers with specific labels will be monitored - Nice grahana Dashboard 34
  • 37. Container Monitoring - Grouping Marathon: Name group hiraki e.g: / Environment / team / application / [ sub application ] / ... /prod/dataio/dbc-glassfish-harvester /stagning/dataio/dbc-glassfish-harvester Task = Running container / instance of application: dbc-glassfish-harvester.760fedd6-684b-11e6-bfc6-0242c91e8407 Application / task relationship: - One or more different tasks per application - Many instances of one task - Depend how you will group them ... 37
  • 38. Container Monitoring - tagging / labels Bobrik: Collectd-docker: Only containers with specific labels will be monitored Application / Task: Logical grouping of containers - You own choice of grouping! docker run -d --name zabbix-db --env="MARIADB_USER=zabbix" --env="MARIADB_PASS=my_password" --label collectd_docker_app="zabbix" Application (= zabbixhost) --label collectd_docker_task="zabbixdb" Task monitoringartist/zabbix-db-mariadb 38
  • 39. Container Monitoring How to represent containers ~ applications / tasks in Zabbix ? As hosts How to show them …? In a structural way ? … As hosts. But could be better - use of tags etc. How to find them? As host search. Needed: Use of tags / labels How to aggregate and analyse them? Needed: ~ top10 etc. tools in Zabbix (Could be nice: image usage, container usage etc. …) Oldservice-hostdiscussion? Containerautodiscovery? Ref:sysdigcloud 39
  • 40. Log Monitoring The Twelve-Factor App: https://12factor.net/ https://12factor.net/logs: “A twelve-factor app never concerns itself with routing or storage of its output stream. It should not attempt to write to or manage logfiles. Instead, each running process writes its event stream, unbuffered, to stdout” 40
  • 41. Log Monitoring - Log flow Kafka Cluster Zabbix Mesos Master Mesos Marathon (zookeeper) Mesos Agent Mesos Docker ContainerX Beaver / Logstash Elastic Cluster Logstash Syslog Kibana (GUI) 41
  • 42. Log Monitoring - Logstash Logstash - as log sender tool https://www.elastic.co/guide/en/logstash/2.3/introduction.html Mantl project logstash config for mesos agent (ansible): ~/mantl/roles/logstash/templates/logstash.conf.j2: file { path => [ "/logs/slaves/*/frameworks/*/executors/*/runs/*/stdout", "/logs/slaves/*/frameworks/*/executors/*/runs/*/stderr" ] type => "mesos-framework-logs" } 42
  • 43. Log Monitoring - Beaver Beaver - python log sender tool https://github.com/python-beaver/python-beaver Pros: - Do not require java - Easy to config Installation with ansible: https://github.com/azavea/ansible-beaver Note: Docker log router: Logspout https://github.com/gliderlabs/logspout 43
  • 44. Log Monitoring - Beaver config /etc/beaver/ ├── beaver.ini └── conf.d └── mesos.conf [beaver] kafka_topic: elkprod logstash_version: 1 kafka_hosts: kafkaX:9092,kafkaY:9092,kafkaZ:9092,kafkaXX:9092 ; Only queue_timeout: 43200 transport: kafka 44
  • 45. Log Monitoring - Beaver config - mesos.conf [/var/log/mesos/mesos-*.WARNING] type: mesos tags: mesos-cluster [/var/log/mesos/mesos-*.INFO] type: mesos tags: mesos-cluster [/var/log/mesos/mesos-*.ERROR] type: mesos tags: mesos-cluster [/var/log/mesos/mesos-*.FATAL] type: mesos tags: mesos-cluster [/data/mesos/slaves/*/frameworks/*/executors/*/runs/*/std*] exclude: (latest) type: mesos tags: mesos-container 45
  • 46. Log Monitoring - mesos-cluster 46
  • 47. Log Monitoring - Error logs 47
  • 48. Log Monitoring - mesos-container 48
  • 49. Log Monitoring - Logstash Logstash as log management tool #“file” frem beaver: #/data/mesos/slaves/a7d7fc82-c8de-4aff-84b1-f1d5c578efc7-S5/frameworks/a141ab38-8082-4c50-b04f-ff762b8 50aa2-0000/executors/prod_dataio_dbc-glassfish-harvester.760fedd6-684b-11e6-bfc6-0242c91e8407... grok { match => { "file" => '/data/mesos/slaves/%{DATA}/frameworks/%{DATA}/executors/%{DATA:tags}_%{DATA:group}_%{DATA:task} /runs/%{GREEDYDATA}'} } # task: dbc-glassfish-harvester.760fedd6-684b-11e6-bfc6-0242c91e8407 grok { match => { "task" => '%{DATA:application}.%{DATA}'} } (Cons: grok filter need revision) 49
  • 50. Log Monitoring - Logstash - zabbix 50
  • 51. Log Monitoring - Logstash - zabbix # Look for java Exceptions filter { if [message] =~ "Exception" { mutate { add_field => { "[itemkey]" => "applog" "[alertmsg]" => "%{task}: %{message}" } } } } output { if [alertmsg] { zabbix { zabbix_server_host => "zabbixhost" zabbix_host => "[application]" # Single value also possible multi_value => [ "[itemkey]", "[alertmsg]" ] } } Zabbix output plugin for Logstash: Zabbix conference 2015 - by untergreek: http://www.slideshare.net/Zabbix/aaron-mildenstein-using-logstash-with-zabbix Note: Similar config used for syslog to zabbix 51
  • 52. Other solutions? Data to kafka? - https://github.com/hengyunabc/kafka-zabbix Data to Elastic? Data via StatsD? Collectd zabbix write plugin? Zabbix container module? 52