Open Source Monitoring in 2019

Kris Buytaert
Kris BuytaertDevops, Linux and Open Source Expert at Inuits
From #MonitoringSucks toFrom #MonitoringSucks to
#MonitoringLove#MonitoringLove
Open Source Monitoring in 2018-2019Open Source Monitoring in 2018-2019
@KrisBuytaert
Devops Meetup, Brno
Kris BuytaertKris Buytaert
● I used to be a Dev,I used to be a Dev,
● Then Became an OpThen Became an Op
● Chief Twitter Ofcer and Open SourceChief Twitter Ofcer and Open Source
Consultant @inuits.euConsultant @inuits.eu
● Everything is an efng DNS ProblemEverything is an efng DNS Problem
● Building Clouds since before the bookstoreBuilding Clouds since before the bookstore
● Organising ConferencesOrganising Conferences
● Evangelizing devopsEvangelizing devops
An opinionated talk about the Open SourceAn opinionated talk about the Open Source
Monitoring tooling landscapeMonitoring tooling landscape
In which I hope to learn from YOUIn which I hope to learn from YOU
#devops=~C(L)AMS#devops=~C(L)AMS
● CultureCulture
● (Lean)(Lean)
● AutomationAutomation
● Monitoring and MeasurementMonitoring and Measurement
● SharingSharing
Damon Edwards and John WillisDamon Edwards and John Willis
Gene KimGene Kim
Monitoring is usually anMonitoring is usually an
aftertoughtaftertought
ENOBUDGET, ENOTIMEENOBUDGET, ENOTIME
AnAn 20082008 OLS PaperOLS Paper
● We have bloated Java toolsWe have bloated Java tools
● Some open Core stufSome open Core stuf
● DYI folks want traditional NagiosDYI folks want traditional Nagios
● DBA RequiredDBA Required
#monitoringsucks#monitoringsucks
● John Vincent (@lusis), june 2011John Vincent (@lusis), june 2011
● A sub #devops movementA sub #devops movement
● https://github.com/monitoringsucks/https://github.com/monitoringsucks/
Why #monitoringsucksWhy #monitoringsucks
● Manual confg (gui)Manual confg (gui)
● Not in sync with realityNot in sync with reality
● Hosts onlyHosts only
● Services sometimesServices sometimes
● Application neverApplication never
● Chaos or out of sync with realityChaos or out of sync with reality
● Alert FatigueAlert Fatigue
#monitoringlove#monitoringlove
•
•
Ulf Mansson #devopsdays Rome 2011Ulf Mansson #devopsdays Rome 2011
•
A new era of toolingA new era of tooling
• #monitoringlove hacksessions @inuits#monitoringlove hacksessions @inuits
•
#monitorama#monitorama
What we wantWhat we want
● Small , well suited componentsSmall , well suited components
•
CollectCollect
•
Transport / MangleTransport / Mangle
•
StoreStore
•
AnalyseAnalyse
•
Act / AlertAct / Alert
•
VisualizeVisualize
Open Source Monitoring in 2019
Open Source Monitoring in 2019
The love was : SensuThe love was : Sensu
● Awesome for non staticAwesome for non static
environmentsenvironments
● Scaling a clustered RabbitMQ ?Scaling a clustered RabbitMQ ?
● Looking more and more likeLooking more and more like
PrometheusPrometheus
● This is Europe, U no do cloudThis is Europe, U no do cloud
Automation ofAutomation of
#monitoring#monitoring
brought backbrought back
thethe #love#love
There is no such thingThere is no such thing
as “Service” discoveryas “Service” discovery
Monitoring aMonitoring a serviceservice
vsvs
Monitoring aMonitoring a ServiceService
AutomationAutomation
defnition of done:defnition of done:
monitored and in productionmonitored and in production
A software project is not doneA software project is not done
until your last end user is deaduntil your last end user is dead
Culture,Culture,
Automation,Automation,
Measurement :Measurement :
measure all the thingsmeasure all the things
SharingSharing
Collection :Collection :
● Collectd,Collectd,
● DiamondDiamond
● From applicationFrom application
● Custom ExportersCustom Exporters
● LogsLogs
● LogsLogs
NetDataNetData
● GranularityGranularity
● Debug vsDebug vs
constant prod ?constant prod ?
●
Transport / Ship / Mangle:Transport / Ship / Mangle:
● Collectd / Diamond / TelegrafCollectd / Diamond / Telegraf
● (r)syslog, Beats, logstash(r)syslog, Beats, logstash
● Q , Nats, ActiveMQ, RabbitMQQ , Nats, ActiveMQ, RabbitMQ
● Collect fromCollect from
anywhereanywhere
● FilterFilter
● Send anywhereSend anywhere
Store :Store :
● TSDB : Time Series DBTSDB : Time Series DB
● Optimized DB for Time SeriesOptimized DB for Time Series
● Graphite/ Infux / OpenTSDB / ....Graphite/ Infux / OpenTSDB / ....
● ElasticElastic
● Long Term vs Short Term StorageLong Term vs Short Term Storage
Oldschool graphiteOldschool graphite
PrometheusPrometheus
● Started 2012Started 2012
● SoundCloudSoundCloud
● Metrics BasedMetrics Based
● ScrapesScrapes
EndpointsEndpoints
•
ExistingExisting
endpoints forendpoints for
limited toolslimited tools
● GraphiteGraphite
ExporterExporter
● Push GatewayPush Gateway
● Great AlertingGreat Alerting
PrometheusPrometheus
● Mostly for Short TermMostly for Short Term
● Still Ship longterm metrics to otherStill Ship longterm metrics to other
TSDBTSDB
● Nginx gw’s all over the placeNginx gw’s all over the place
•
(ssl fun)(ssl fun)
Infnite Diskspace ?Infnite Diskspace ?
● Logstash outputLogstash output
•
Statsd => GraphiteStatsd => Graphite
•
Keep patterns around,Keep patterns around,
•
Selectively purge dataSelectively purge data
● Prometheus for Short TermPrometheus for Short Term
•
Graphite for Long termGraphite for Long term
Log AlternativesLog Alternatives
● Graylog2Graylog2
● ELSA (Enterprise Log Search andELSA (Enterprise Log Search and
Archive)Archive)
● ELK StackELK Stack
● FluentdFluentd
Prometheus ?Prometheus ?
● Only For Containers ?Only For Containers ?
● Also for other setups !Also for other setups !
● Is this sufcient ?Is this sufcient ?
Act / Alert:Act / Alert:
Checking for FailureChecking for Failure
● IcingaIcinga
•
Automated confg generationAutomated confg generation
● SensuSensu
•
CloudstyleCloudstyle
● PrometheusPrometheus
•
AlertManagerAlertManager
Waking you up at nightWaking you up at night
● FlapjackFlapjack
fapjack.iofapjack.io
monitoring notifcation routing +monitoring notifcation routing +
event processing systemevent processing system
● OpenDutyOpenDuty
github.com/szechuen/OpenDutygithub.com/szechuen/OpenDuty
Duty managementDuty management
Waking you up at nightWaking you up at night
● AnagAnag
● Custom written stufCustom written stuf
Analyse:Analyse:
Basic SearchBasic Search
Graphs to KnowledgeGraphs to Knowledge
SkylineSkyline
•
OculusOculus
•
Creating Information out of this dataCreating Information out of this data
•
Big dataBig data
•
Machine LearningMachine Learning
•
Hastic.ioHastic.io
Hastic.ioHastic.io
● Open Source Pattern DetectionOpen Source Pattern Detection
● Label patterns → Wait for learning toLabel patterns → Wait for learning to
complete → Get detectionscomplete → Get detections
● Hastic Server + Grafana AppHastic Server + Grafana App
LogIslandsLogIslands
● Complex event processing &Complex event processing &
patterns mining at scalepatterns mining at scale
● Kafka, Nif, Spark, HadoopKafka, Nif, Spark, Hadoop
Visualize:Visualize:
KibanaKibana
GrafanaGrafana
ChallengeChallenge
● *ana as code*ana as code
● Template your ...Template your ...
● e.g grafonnet-libe.g grafonnet-lib
•
A jsonnet lib to generate GrafanaA jsonnet lib to generate Grafana
dashboards ...dashboards ...
AggregatingAggregating
● ThrukThruk
● GrafanaGrafana
● DashingDashing
Lack of change ?Lack of change ?
● Limited # new toolsLimited # new tools
● Feature Complete ?Feature Complete ?
Is prometheus the newIs prometheus the new
Docker ?Docker ?
APMAPM
Application Performance MonitoringApplication Performance Monitoring
But what about my apps ?But what about my apps ?
● agent required that ties to codeagent required that ties to code
● Code modifcationsCode modifcations
Old PacketBeatOld PacketBeat
Open Source “APM”Open Source “APM”
● Scouter
● Jaeger
● Kamon
● Zipkin
● Beats ...
● Performance Co Pilot
● Kamon
● Pinpoint
● Micrometer
● StageMonitor
● SkyWalking
● Kieker
=> Huge focus on the Java Ecosystem , little
options for PHP/ Python / Ruby shops.
OpenAPM.ioOpenAPM.io
OpenTracing 101OpenTracing 101
● The problem : It was not reasonable to ask all OSS services and all OSSThe problem : It was not reasonable to ask all OSS services and all OSS
packages and all application-specifc code to use a single tracingpackages and all application-specifc code to use a single tracing
vendor => Open Ttracingvendor => Open Ttracing
● Distributed Tracing StandardDistributed Tracing Standard
● CNCFCNCF
● Dapper inside GoogleDapper inside Google
● ““OpenTracing is not a download or a program. Distributed tracingOpenTracing is not a download or a program. Distributed tracing
requires that software developers add instrumentation to the code ofrequires that software developers add instrumentation to the code of
an application, or to the frameworks used in the application”an application, or to the frameworks used in the application”
Complexity is the EnemyComplexity is the Enemy
of Reliabilityof Reliability
I love where Monitoring is headingI love where Monitoring is heading
““Wait , was I oncall last week ?”Wait , was I oncall last week ?”
True words said by one of our oncall engineersTrue words said by one of our oncall engineers
OpservabilityOpservability
ContactContact
Kris Buytaert kris.buytaert@inuits.euKris Buytaert kris.buytaert@inuits.eu
Further ReadingFurther Reading
@krisbuytaert@krisbuytaert
http://www.krisbuytaert.be/blog/http://www.krisbuytaert.be/blog/
http://www.inuits.eu/http://www.inuits.eu/
Find Inuits inFind Inuits in
Brasschaat,Ghent,Brasschaat,Ghent,
Rotterdam,Prague,Rotterdam,Prague,
Kiev,BrnoKiev,Brno
1 of 56

Recommended

Migrating to Puppet 5 by
Migrating to Puppet 5Migrating to Puppet 5
Migrating to Puppet 5Kris Buytaert
1K views34 slides
Repositories as Code by
Repositories as CodeRepositories as Code
Repositories as CodeKris Buytaert
642 views34 slides
GitOps , done Right by
GitOps , done RightGitOps , done Right
GitOps , done RightKris Buytaert
209 views38 slides
Continuous Infrastructure First by
Continuous Infrastructure FirstContinuous Infrastructure First
Continuous Infrastructure FirstKris Buytaert
568 views52 slides
Continuous Infrastructure First by
Continuous Infrastructure FirstContinuous Infrastructure First
Continuous Infrastructure FirstKris Buytaert
521 views42 slides
Deploying your SaaS stack OnPrem by
Deploying your SaaS stack OnPremDeploying your SaaS stack OnPrem
Deploying your SaaS stack OnPremKris Buytaert
681 views38 slides

More Related Content

What's hot

Can we fix dev-oops ? by
Can we fix dev-oops ?Can we fix dev-oops ?
Can we fix dev-oops ?Kris Buytaert
675 views53 slides
From MonitoringSucks to Monitoring Love , 2016 Edition by
From MonitoringSucks to Monitoring Love , 2016 EditionFrom MonitoringSucks to Monitoring Love , 2016 Edition
From MonitoringSucks to Monitoring Love , 2016 EditionKris Buytaert
29.4K views51 slides
Pipeline as Code by
Pipeline as CodePipeline as Code
Pipeline as CodeKris Buytaert
982 views20 slides
Is there a future for devops ? by
Is there a future for devops ?Is there a future for devops ?
Is there a future for devops ?Kris Buytaert
3.5K views49 slides
Devops is Dead, Long live Devops by
Devops is Dead, Long live DevopsDevops is Dead, Long live Devops
Devops is Dead, Long live DevopsKris Buytaert
289 views33 slides
Devops is a Security Requirement by
Devops is a Security RequirementDevops is a Security Requirement
Devops is a Security RequirementKris Buytaert
699 views64 slides

What's hot(20)

From MonitoringSucks to Monitoring Love , 2016 Edition by Kris Buytaert
From MonitoringSucks to Monitoring Love , 2016 EditionFrom MonitoringSucks to Monitoring Love , 2016 Edition
From MonitoringSucks to Monitoring Love , 2016 Edition
Kris Buytaert29.4K views
Is there a future for devops ? by Kris Buytaert
Is there a future for devops ?Is there a future for devops ?
Is there a future for devops ?
Kris Buytaert3.5K views
Devops is Dead, Long live Devops by Kris Buytaert
Devops is Dead, Long live DevopsDevops is Dead, Long live Devops
Devops is Dead, Long live Devops
Kris Buytaert289 views
Devops is a Security Requirement by Kris Buytaert
Devops is a Security RequirementDevops is a Security Requirement
Devops is a Security Requirement
Kris Buytaert699 views
10 years of #devopsdays, but what have we really learned ? by Kris Buytaert
10 years of #devopsdays, but what have we really learned ? 10 years of #devopsdays, but what have we really learned ?
10 years of #devopsdays, but what have we really learned ?
Kris Buytaert594 views
Pipeline as code for your infrastructure as Code by Kris Buytaert
Pipeline as code for your infrastructure as CodePipeline as code for your infrastructure as Code
Pipeline as code for your infrastructure as Code
Kris Buytaert1.9K views
Moby is killing your devops efforts by Kris Buytaert
Moby is killing your devops effortsMoby is killing your devops efforts
Moby is killing your devops efforts
Kris Buytaert2.7K views
Dev secops opsec, devsec, devops ? by Kris Buytaert
Dev secops opsec, devsec, devops ?Dev secops opsec, devsec, devops ?
Dev secops opsec, devsec, devops ?
Kris Buytaert17.4K views
Devops is dead, Long Live Devops by Kris Buytaert
Devops is dead, Long Live DevopsDevops is dead, Long Live Devops
Devops is dead, Long Live Devops
Kris Buytaert1.7K views
Automating MySQL operations with Puppet by Kris Buytaert
Automating MySQL operations with PuppetAutomating MySQL operations with Puppet
Automating MySQL operations with Puppet
Kris Buytaert1.8K views
Groovy there's a docker in my application pipeline by Kris Buytaert
Groovy there's a docker in my application pipelineGroovy there's a docker in my application pipeline
Groovy there's a docker in my application pipeline
Kris Buytaert1.3K views
Devopsdays Amsterdam 2017 Keynote, looking back at 5 years of AMS by Kris Buytaert
Devopsdays Amsterdam 2017 Keynote, looking back at 5 years of AMSDevopsdays Amsterdam 2017 Keynote, looking back at 5 years of AMS
Devopsdays Amsterdam 2017 Keynote, looking back at 5 years of AMS
Kris Buytaert772 views
The Return of the Dull Stack Engineer by Kris Buytaert
The Return of the Dull Stack EngineerThe Return of the Dull Stack Engineer
The Return of the Dull Stack Engineer
Kris Buytaert2.4K views
ADDO 2019: Looking back at over 10 years of Devops by Kris Buytaert
ADDO 2019:    Looking back at over 10 years of DevopsADDO 2019:    Looking back at over 10 years of Devops
ADDO 2019: Looking back at over 10 years of Devops
Kris Buytaert578 views
No, we can't do continuous delivery by Kris Buytaert
No, we can't do continuous deliveryNo, we can't do continuous delivery
No, we can't do continuous delivery
Kris Buytaert1.8K views
On the Importance of Infrastructure as Code by Kris Buytaert
On the Importance of Infrastructure as CodeOn the Importance of Infrastructure as Code
On the Importance of Infrastructure as Code
Kris Buytaert1.8K views

Similar to Open Source Monitoring in 2019

Open Source Monitoring in 2015 by
Open Source Monitoring in 2015Open Source Monitoring in 2015
Open Source Monitoring in 2015Kris Buytaert
2.3K views53 slides
Another 7 tools for your #devops stack by
Another 7 tools for your #devops stackAnother 7 tools for your #devops stack
Another 7 tools for your #devops stackKris Buytaert
1.6K views46 slides
OSDC 2016 - Another 7 Tools for your #devops Stack by Kris Buytaert by
OSDC 2016 - Another 7 Tools for your #devops Stack by Kris BuytaertOSDC 2016 - Another 7 Tools for your #devops Stack by Kris Buytaert
OSDC 2016 - Another 7 Tools for your #devops Stack by Kris BuytaertNETWAYS
81 views46 slides
Monitoring in an Infrastructure as Code Age by
Monitoring in an Infrastructure as Code AgeMonitoring in an Infrastructure as Code Age
Monitoring in an Infrastructure as Code AgePuppet
7.2K views41 slides
OSMC 2014: From monitoringsucks to monitoringlove (and back) | Kris Buytaert by
OSMC 2014: From monitoringsucks to monitoringlove (and back) | Kris BuytaertOSMC 2014: From monitoringsucks to monitoringlove (and back) | Kris Buytaert
OSMC 2014: From monitoringsucks to monitoringlove (and back) | Kris BuytaertNETWAYS
545 views64 slides
Monitoring in an Infrastructure as Code Age by
Monitoring in an Infrastructure as Code AgeMonitoring in an Infrastructure as Code Age
Monitoring in an Infrastructure as Code AgeKris Buytaert
12K views40 slides

Similar to Open Source Monitoring in 2019 (20)

Open Source Monitoring in 2015 by Kris Buytaert
Open Source Monitoring in 2015Open Source Monitoring in 2015
Open Source Monitoring in 2015
Kris Buytaert2.3K views
Another 7 tools for your #devops stack by Kris Buytaert
Another 7 tools for your #devops stackAnother 7 tools for your #devops stack
Another 7 tools for your #devops stack
Kris Buytaert1.6K views
OSDC 2016 - Another 7 Tools for your #devops Stack by Kris Buytaert by NETWAYS
OSDC 2016 - Another 7 Tools for your #devops Stack by Kris BuytaertOSDC 2016 - Another 7 Tools for your #devops Stack by Kris Buytaert
OSDC 2016 - Another 7 Tools for your #devops Stack by Kris Buytaert
NETWAYS81 views
Monitoring in an Infrastructure as Code Age by Puppet
Monitoring in an Infrastructure as Code AgeMonitoring in an Infrastructure as Code Age
Monitoring in an Infrastructure as Code Age
Puppet7.2K views
OSMC 2014: From monitoringsucks to monitoringlove (and back) | Kris Buytaert by NETWAYS
OSMC 2014: From monitoringsucks to monitoringlove (and back) | Kris BuytaertOSMC 2014: From monitoringsucks to monitoringlove (and back) | Kris Buytaert
OSMC 2014: From monitoringsucks to monitoringlove (and back) | Kris Buytaert
NETWAYS545 views
Monitoring in an Infrastructure as Code Age by Kris Buytaert
Monitoring in an Infrastructure as Code AgeMonitoring in an Infrastructure as Code Age
Monitoring in an Infrastructure as Code Age
Kris Buytaert12K views
Continous Delivery of your Infrastructure by Kris Buytaert
Continous Delivery of your InfrastructureContinous Delivery of your Infrastructure
Continous Delivery of your Infrastructure
Kris Buytaert857 views
Monitoring Drupal In an Infrastructure as Code Age by Kris Buytaert
Monitoring Drupal In an Infrastructure as Code AgeMonitoring Drupal In an Infrastructure as Code Age
Monitoring Drupal In an Infrastructure as Code Age
Kris Buytaert10.6K views
Run stuff, Deploy Stuff, Jax London 2017 Edition by Kris Buytaert
Run stuff, Deploy Stuff, Jax London 2017 EditionRun stuff, Deploy Stuff, Jax London 2017 Edition
Run stuff, Deploy Stuff, Jax London 2017 Edition
Kris Buytaert2.2K views
OSMC 2014 | From monitoringsucks to monitoringlove, and back by Kris Buytaert by NETWAYS
OSMC 2014 | From monitoringsucks to monitoringlove, and back by Kris BuytaertOSMC 2014 | From monitoringsucks to monitoringlove, and back by Kris Buytaert
OSMC 2014 | From monitoringsucks to monitoringlove, and back by Kris Buytaert
NETWAYS44 views
When traditional configuration management is to slow for your needs by Kris Buytaert
When traditional configuration management is to slow for your needsWhen traditional configuration management is to slow for your needs
When traditional configuration management is to slow for your needs
Kris Buytaert1.4K views
Continuous Delivery of (y)our infrastructure. by Kris Buytaert
Continuous Delivery of (y)our infrastructure.Continuous Delivery of (y)our infrastructure.
Continuous Delivery of (y)our infrastructure.
Kris Buytaert3K views
From Config Management Sucks to #cfgmgmtlove by Kris Buytaert
From Config Management Sucks to #cfgmgmtlove From Config Management Sucks to #cfgmgmtlove
From Config Management Sucks to #cfgmgmtlove
Kris Buytaert2K views
OSDC 2015: Kris Buytaert | From ConfigManagementSucks to ConfigManagementLove by NETWAYS
OSDC 2015: Kris Buytaert | From ConfigManagementSucks to ConfigManagementLoveOSDC 2015: Kris Buytaert | From ConfigManagementSucks to ConfigManagementLove
OSDC 2015: Kris Buytaert | From ConfigManagementSucks to ConfigManagementLove
NETWAYS204 views
Icinga Camp Amsterdam - Infrastructure as Code by Icinga
Icinga Camp Amsterdam - Infrastructure as CodeIcinga Camp Amsterdam - Infrastructure as Code
Icinga Camp Amsterdam - Infrastructure as Code
Icinga3.3K views
OSMC 2017 | Groovy There is a Docker in my Dashing Pipeline by Kris Buytaert by NETWAYS
OSMC 2017 | Groovy There is a Docker in my Dashing Pipeline by Kris Buytaert OSMC 2017 | Groovy There is a Docker in my Dashing Pipeline by Kris Buytaert
OSMC 2017 | Groovy There is a Docker in my Dashing Pipeline by Kris Buytaert
NETWAYS152 views
The devops approach to monitoring, Open Source and Infrastructure as Code Style by Julien Pivotto
The devops approach to monitoring, Open Source and Infrastructure as Code StyleThe devops approach to monitoring, Open Source and Infrastructure as Code Style
The devops approach to monitoring, Open Source and Infrastructure as Code Style
Julien Pivotto14.6K views
OSDC 2012 | Devops and Open Source by Kris Buytaert by NETWAYS
OSDC 2012 | Devops and Open Source by Kris BuytaertOSDC 2012 | Devops and Open Source by Kris Buytaert
OSDC 2012 | Devops and Open Source by Kris Buytaert
NETWAYS23 views
OSDC 2012 | Devops and Open Source by Kris Buyaert by NETWAYS
OSDC 2012 | Devops and Open Source by Kris BuyaertOSDC 2012 | Devops and Open Source by Kris Buyaert
OSDC 2012 | Devops and Open Source by Kris Buyaert
NETWAYS18 views

More from Kris Buytaert

Years of (not) learning , from devops to devoops by
Years of (not) learning , from devops to devoopsYears of (not) learning , from devops to devoops
Years of (not) learning , from devops to devoopsKris Buytaert
65 views44 slides
Observability will not fix your Broken Monitoring ,Ignite by
Observability will not fix your Broken Monitoring ,IgniteObservability will not fix your Broken Monitoring ,Ignite
Observability will not fix your Broken Monitoring ,IgniteKris Buytaert
167 views20 slides
Infrastructure as Code Patterns by
Infrastructure as Code PatternsInfrastructure as Code Patterns
Infrastructure as Code PatternsKris Buytaert
117 views53 slides
From devoops to devops 13 years of (not) learning by
From devoops to devops 13 years of (not) learningFrom devoops to devops 13 years of (not) learning
From devoops to devops 13 years of (not) learningKris Buytaert
185 views40 slides
Pipeline all the Dashboards as Code by
Pipeline all the Dashboards as CodePipeline all the Dashboards as Code
Pipeline all the Dashboards as CodeKris Buytaert
644 views20 slides
Is there a Future for devops ? by
Is there a Future for devops   ? Is there a Future for devops   ?
Is there a Future for devops ? Kris Buytaert
478 views53 slides

More from Kris Buytaert(10)

Years of (not) learning , from devops to devoops by Kris Buytaert
Years of (not) learning , from devops to devoopsYears of (not) learning , from devops to devoops
Years of (not) learning , from devops to devoops
Kris Buytaert65 views
Observability will not fix your Broken Monitoring ,Ignite by Kris Buytaert
Observability will not fix your Broken Monitoring ,IgniteObservability will not fix your Broken Monitoring ,Ignite
Observability will not fix your Broken Monitoring ,Ignite
Kris Buytaert167 views
Infrastructure as Code Patterns by Kris Buytaert
Infrastructure as Code PatternsInfrastructure as Code Patterns
Infrastructure as Code Patterns
Kris Buytaert117 views
From devoops to devops 13 years of (not) learning by Kris Buytaert
From devoops to devops 13 years of (not) learningFrom devoops to devops 13 years of (not) learning
From devoops to devops 13 years of (not) learning
Kris Buytaert185 views
Pipeline all the Dashboards as Code by Kris Buytaert
Pipeline all the Dashboards as CodePipeline all the Dashboards as Code
Pipeline all the Dashboards as Code
Kris Buytaert644 views
Is there a Future for devops ? by Kris Buytaert
Is there a Future for devops   ? Is there a Future for devops   ?
Is there a Future for devops ?
Kris Buytaert478 views
10 Years of #devopsdays weirdness by Kris Buytaert
10 Years of #devopsdays weirdness10 Years of #devopsdays weirdness
10 Years of #devopsdays weirdness
Kris Buytaert400 views
Continuous Infrastructure First Ignite Edition by Kris Buytaert
Continuous Infrastructure First  Ignite EditionContinuous Infrastructure First  Ignite Edition
Continuous Infrastructure First Ignite Edition
Kris Buytaert476 views
Looking back at 5 years of #cfgmgmtcamp by Kris Buytaert
Looking back at 5 years of #cfgmgmtcampLooking back at 5 years of #cfgmgmtcamp
Looking back at 5 years of #cfgmgmtcamp
Kris Buytaert625 views
Looking back at 7.5 years of Devopsdays , DOd PDX by Kris Buytaert
Looking back at 7.5 years of Devopsdays , DOd PDXLooking back at 7.5 years of Devopsdays , DOd PDX
Looking back at 7.5 years of Devopsdays , DOd PDX
Kris Buytaert463 views

Recently uploaded

Business Analyst Series 2023 - Week 4 Session 7 by
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7DianaGray10
146 views31 slides
Transcript: Redefining the book supply chain: A glimpse into the future - Tec... by
Transcript: Redefining the book supply chain: A glimpse into the future - Tec...Transcript: Redefining the book supply chain: A glimpse into the future - Tec...
Transcript: Redefining the book supply chain: A glimpse into the future - Tec...BookNet Canada
41 views16 slides
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream by
Evaluation of Quality of Experience of ABR Schemes in Gaming StreamEvaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming StreamAlpen-Adria-Universität
38 views34 slides
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...ShapeBlue
183 views18 slides
"Surviving highload with Node.js", Andrii Shumada by
"Surviving highload with Node.js", Andrii Shumada "Surviving highload with Node.js", Andrii Shumada
"Surviving highload with Node.js", Andrii Shumada Fwdays
58 views29 slides
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... by
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...ShapeBlue
162 views25 slides

Recently uploaded(20)

Business Analyst Series 2023 - Week 4 Session 7 by DianaGray10
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7
DianaGray10146 views
Transcript: Redefining the book supply chain: A glimpse into the future - Tec... by BookNet Canada
Transcript: Redefining the book supply chain: A glimpse into the future - Tec...Transcript: Redefining the book supply chain: A glimpse into the future - Tec...
Transcript: Redefining the book supply chain: A glimpse into the future - Tec...
BookNet Canada41 views
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by ShapeBlue
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
ShapeBlue183 views
"Surviving highload with Node.js", Andrii Shumada by Fwdays
"Surviving highload with Node.js", Andrii Shumada "Surviving highload with Node.js", Andrii Shumada
"Surviving highload with Node.js", Andrii Shumada
Fwdays58 views
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... by ShapeBlue
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
ShapeBlue162 views
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... by ShapeBlue
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
ShapeBlue141 views
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And... by ShapeBlue
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...
ShapeBlue108 views
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading... by The Digital Insurer
Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ... by ShapeBlue
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
ShapeBlue129 views
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue by ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
ShapeBlue152 views
Business Analyst Series 2023 - Week 4 Session 8 by DianaGray10
Business Analyst Series 2023 -  Week 4 Session 8Business Analyst Series 2023 -  Week 4 Session 8
Business Analyst Series 2023 - Week 4 Session 8
DianaGray10145 views
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue225 views
The Role of Patterns in the Era of Large Language Models by Yunyao Li
The Role of Patterns in the Era of Large Language ModelsThe Role of Patterns in the Era of Large Language Models
The Role of Patterns in the Era of Large Language Models
Yunyao Li91 views
The Power of Heat Decarbonisation Plans in the Built Environment by IES VE
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built Environment
IES VE84 views
NTGapps NTG LowCode Platform by Mustafa Kuğu
NTGapps NTG LowCode Platform NTGapps NTG LowCode Platform
NTGapps NTG LowCode Platform
Mustafa Kuğu437 views
"Package management in monorepos", Zoltan Kochan by Fwdays
"Package management in monorepos", Zoltan Kochan"Package management in monorepos", Zoltan Kochan
"Package management in monorepos", Zoltan Kochan
Fwdays34 views
Optimizing Communication to Optimize Human Behavior - LCBM by Yaman Kumar
Optimizing Communication to Optimize Human Behavior - LCBMOptimizing Communication to Optimize Human Behavior - LCBM
Optimizing Communication to Optimize Human Behavior - LCBM
Yaman Kumar38 views

Open Source Monitoring in 2019

  • 1. From #MonitoringSucks toFrom #MonitoringSucks to #MonitoringLove#MonitoringLove Open Source Monitoring in 2018-2019Open Source Monitoring in 2018-2019 @KrisBuytaert Devops Meetup, Brno
  • 2. Kris BuytaertKris Buytaert ● I used to be a Dev,I used to be a Dev, ● Then Became an OpThen Became an Op ● Chief Twitter Ofcer and Open SourceChief Twitter Ofcer and Open Source Consultant @inuits.euConsultant @inuits.eu ● Everything is an efng DNS ProblemEverything is an efng DNS Problem ● Building Clouds since before the bookstoreBuilding Clouds since before the bookstore ● Organising ConferencesOrganising Conferences ● Evangelizing devopsEvangelizing devops
  • 3. An opinionated talk about the Open SourceAn opinionated talk about the Open Source Monitoring tooling landscapeMonitoring tooling landscape In which I hope to learn from YOUIn which I hope to learn from YOU
  • 4. #devops=~C(L)AMS#devops=~C(L)AMS ● CultureCulture ● (Lean)(Lean) ● AutomationAutomation ● Monitoring and MeasurementMonitoring and Measurement ● SharingSharing Damon Edwards and John WillisDamon Edwards and John Willis Gene KimGene Kim
  • 5. Monitoring is usually anMonitoring is usually an aftertoughtaftertought ENOBUDGET, ENOTIMEENOBUDGET, ENOTIME
  • 6. AnAn 20082008 OLS PaperOLS Paper ● We have bloated Java toolsWe have bloated Java tools ● Some open Core stufSome open Core stuf ● DYI folks want traditional NagiosDYI folks want traditional Nagios ● DBA RequiredDBA Required
  • 7. #monitoringsucks#monitoringsucks ● John Vincent (@lusis), june 2011John Vincent (@lusis), june 2011 ● A sub #devops movementA sub #devops movement ● https://github.com/monitoringsucks/https://github.com/monitoringsucks/
  • 8. Why #monitoringsucksWhy #monitoringsucks ● Manual confg (gui)Manual confg (gui) ● Not in sync with realityNot in sync with reality ● Hosts onlyHosts only ● Services sometimesServices sometimes ● Application neverApplication never ● Chaos or out of sync with realityChaos or out of sync with reality ● Alert FatigueAlert Fatigue
  • 9. #monitoringlove#monitoringlove • • Ulf Mansson #devopsdays Rome 2011Ulf Mansson #devopsdays Rome 2011 • A new era of toolingA new era of tooling • #monitoringlove hacksessions @inuits#monitoringlove hacksessions @inuits • #monitorama#monitorama
  • 10. What we wantWhat we want ● Small , well suited componentsSmall , well suited components • CollectCollect • Transport / MangleTransport / Mangle • StoreStore • AnalyseAnalyse • Act / AlertAct / Alert • VisualizeVisualize
  • 13. The love was : SensuThe love was : Sensu ● Awesome for non staticAwesome for non static environmentsenvironments ● Scaling a clustered RabbitMQ ?Scaling a clustered RabbitMQ ? ● Looking more and more likeLooking more and more like PrometheusPrometheus ● This is Europe, U no do cloudThis is Europe, U no do cloud
  • 14. Automation ofAutomation of #monitoring#monitoring brought backbrought back thethe #love#love
  • 15. There is no such thingThere is no such thing as “Service” discoveryas “Service” discovery
  • 16. Monitoring aMonitoring a serviceservice vsvs Monitoring aMonitoring a ServiceService
  • 18. defnition of done:defnition of done: monitored and in productionmonitored and in production
  • 19. A software project is not doneA software project is not done until your last end user is deaduntil your last end user is dead
  • 20. Culture,Culture, Automation,Automation, Measurement :Measurement : measure all the thingsmeasure all the things SharingSharing
  • 21. Collection :Collection : ● Collectd,Collectd, ● DiamondDiamond ● From applicationFrom application ● Custom ExportersCustom Exporters ● LogsLogs ● LogsLogs
  • 22. NetDataNetData ● GranularityGranularity ● Debug vsDebug vs constant prod ?constant prod ? ●
  • 23. Transport / Ship / Mangle:Transport / Ship / Mangle: ● Collectd / Diamond / TelegrafCollectd / Diamond / Telegraf ● (r)syslog, Beats, logstash(r)syslog, Beats, logstash ● Q , Nats, ActiveMQ, RabbitMQQ , Nats, ActiveMQ, RabbitMQ
  • 24. ● Collect fromCollect from anywhereanywhere ● FilterFilter ● Send anywhereSend anywhere
  • 25. Store :Store : ● TSDB : Time Series DBTSDB : Time Series DB ● Optimized DB for Time SeriesOptimized DB for Time Series ● Graphite/ Infux / OpenTSDB / ....Graphite/ Infux / OpenTSDB / .... ● ElasticElastic ● Long Term vs Short Term StorageLong Term vs Short Term Storage
  • 27. PrometheusPrometheus ● Started 2012Started 2012 ● SoundCloudSoundCloud ● Metrics BasedMetrics Based ● ScrapesScrapes EndpointsEndpoints • ExistingExisting endpoints forendpoints for limited toolslimited tools ● GraphiteGraphite ExporterExporter ● Push GatewayPush Gateway ● Great AlertingGreat Alerting
  • 28. PrometheusPrometheus ● Mostly for Short TermMostly for Short Term ● Still Ship longterm metrics to otherStill Ship longterm metrics to other TSDBTSDB ● Nginx gw’s all over the placeNginx gw’s all over the place • (ssl fun)(ssl fun)
  • 29. Infnite Diskspace ?Infnite Diskspace ? ● Logstash outputLogstash output • Statsd => GraphiteStatsd => Graphite • Keep patterns around,Keep patterns around, • Selectively purge dataSelectively purge data ● Prometheus for Short TermPrometheus for Short Term • Graphite for Long termGraphite for Long term
  • 30. Log AlternativesLog Alternatives ● Graylog2Graylog2 ● ELSA (Enterprise Log Search andELSA (Enterprise Log Search and Archive)Archive) ● ELK StackELK Stack ● FluentdFluentd
  • 31. Prometheus ?Prometheus ? ● Only For Containers ?Only For Containers ? ● Also for other setups !Also for other setups ! ● Is this sufcient ?Is this sufcient ?
  • 32. Act / Alert:Act / Alert:
  • 33. Checking for FailureChecking for Failure ● IcingaIcinga • Automated confg generationAutomated confg generation ● SensuSensu • CloudstyleCloudstyle ● PrometheusPrometheus • AlertManagerAlertManager
  • 34. Waking you up at nightWaking you up at night ● FlapjackFlapjack fapjack.iofapjack.io monitoring notifcation routing +monitoring notifcation routing + event processing systemevent processing system ● OpenDutyOpenDuty github.com/szechuen/OpenDutygithub.com/szechuen/OpenDuty Duty managementDuty management
  • 35. Waking you up at nightWaking you up at night ● AnagAnag ● Custom written stufCustom written stuf
  • 38. Graphs to KnowledgeGraphs to Knowledge SkylineSkyline • OculusOculus • Creating Information out of this dataCreating Information out of this data • Big dataBig data • Machine LearningMachine Learning • Hastic.ioHastic.io
  • 39. Hastic.ioHastic.io ● Open Source Pattern DetectionOpen Source Pattern Detection ● Label patterns → Wait for learning toLabel patterns → Wait for learning to complete → Get detectionscomplete → Get detections ● Hastic Server + Grafana AppHastic Server + Grafana App
  • 40. LogIslandsLogIslands ● Complex event processing &Complex event processing & patterns mining at scalepatterns mining at scale ● Kafka, Nif, Spark, HadoopKafka, Nif, Spark, Hadoop
  • 44. ChallengeChallenge ● *ana as code*ana as code ● Template your ...Template your ... ● e.g grafonnet-libe.g grafonnet-lib • A jsonnet lib to generate GrafanaA jsonnet lib to generate Grafana dashboards ...dashboards ...
  • 46. Lack of change ?Lack of change ? ● Limited # new toolsLimited # new tools ● Feature Complete ?Feature Complete ?
  • 47. Is prometheus the newIs prometheus the new Docker ?Docker ?
  • 48. APMAPM Application Performance MonitoringApplication Performance Monitoring But what about my apps ?But what about my apps ? ● agent required that ties to codeagent required that ties to code ● Code modifcationsCode modifcations
  • 50. Open Source “APM”Open Source “APM” ● Scouter ● Jaeger ● Kamon ● Zipkin ● Beats ... ● Performance Co Pilot ● Kamon ● Pinpoint ● Micrometer ● StageMonitor ● SkyWalking ● Kieker => Huge focus on the Java Ecosystem , little options for PHP/ Python / Ruby shops.
  • 52. OpenTracing 101OpenTracing 101 ● The problem : It was not reasonable to ask all OSS services and all OSSThe problem : It was not reasonable to ask all OSS services and all OSS packages and all application-specifc code to use a single tracingpackages and all application-specifc code to use a single tracing vendor => Open Ttracingvendor => Open Ttracing ● Distributed Tracing StandardDistributed Tracing Standard ● CNCFCNCF ● Dapper inside GoogleDapper inside Google ● ““OpenTracing is not a download or a program. Distributed tracingOpenTracing is not a download or a program. Distributed tracing requires that software developers add instrumentation to the code ofrequires that software developers add instrumentation to the code of an application, or to the frameworks used in the application”an application, or to the frameworks used in the application”
  • 53. Complexity is the EnemyComplexity is the Enemy of Reliabilityof Reliability
  • 54. I love where Monitoring is headingI love where Monitoring is heading ““Wait , was I oncall last week ?”Wait , was I oncall last week ?” True words said by one of our oncall engineersTrue words said by one of our oncall engineers
  • 56. ContactContact Kris Buytaert kris.buytaert@inuits.euKris Buytaert kris.buytaert@inuits.eu Further ReadingFurther Reading @krisbuytaert@krisbuytaert http://www.krisbuytaert.be/blog/http://www.krisbuytaert.be/blog/ http://www.inuits.eu/http://www.inuits.eu/ Find Inuits inFind Inuits in Brasschaat,Ghent,Brasschaat,Ghent, Rotterdam,Prague,Rotterdam,Prague, Kiev,BrnoKiev,Brno