Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Monitoring in an Infrastructure as Code Age


Published on

Monitoring an infrastructure is still one of the most complex tasks at hand. Yet in this age of Infrastructure as Code a lot of people are still stuck using tools that haven't adapted. This talk will explain you how to look at monitoring your infrastructure from another angle, where you can benefit from using Puppet to deploy and provision your monitoring platforms. This talk will explain you which monitoring tools are suitable for monitoring at scale and Puppet driven provisioning and which ones will only make your life harder than it should be. Where do tools like Icinga, Sensu, Graphite or even CheckMK fit into the picture ?

Kris Buytaert
Chief Travel Officer,
Kris Buytaert is a long time Linux and Open Source Consultant. He's one of instigators of the devops movement, currently working for Inuits He is frequently speaking at, or organizing different international conferences and has written about the same subjects in different Books, Papers and Articles He spends most of his time working on bridging the gap between developers and operations with a strong focus on High Availability, Scalability , Virtualisation and Large Infrastructure Management projects hence trying to build infrastructures that can survive the 10th floor test, better known today as the cloud while actively promoting the devops idea ! His blog titled "Everything is a Freaking DNS Problem" can be found at

Published in: Technology

Monitoring in an Infrastructure as Code Age

  1. 1. Monitoring in an IAC AgeMonitoring in an IAC Age PuppetConf 2013 Kris Buytaert
  2. 2. Kris BuytaertKris Buytaert ● I used to be a Dev,I used to be a Dev, ● Then Became an OpThen Became an Op ● Chief Trolling Officer and Open SourceChief Trolling Officer and Open Source Consultant @inuits.euConsultant ● Everything is an effing DNS ProblemEverything is an effing DNS Problem ● Building Clouds since before the bookstoreBuilding Clouds since before the bookstore ● Some books, some papers, some blogsSome books, some papers, some blogs ● Evangelizing devopsEvangelizing devops
  3. 3. devops = clamsdevops = clams ● CultureCulture ● (Lean)(Lean) ● Automate all the things ...Automate all the things ... • Build AutomationBuild Automation • Test AutomationTest Automation • IACIAC ● Monitoring , Metrics ...Monitoring , Metrics ... ● SharingSharing
  4. 4. Monitoring is usually anMonitoring is usually an aftertoughtaftertought ENOBUDGET, ENOTIMEENOBUDGET, ENOTIME
  5. 5. #monitoringsucks#monitoringsucks ● John Vincent (@lusis)John Vincent (@lusis) ● A sub movementA sub movement ●
  6. 6. #monitoringlove#monitoringlove • #monitoringlove hacksessions#monitoringlove hacksessions • #monitorama#monitorama
  7. 7. Infrastructure as CodeInfrastructure as Code ● Model our infrastructureModel our infrastructure ● A fast reproducable platformA fast reproducable platform ● Disaster discovery for “free”Disaster discovery for “free”
  8. 8. For years we've tolerated humans to to makeFor years we've tolerated humans to to make structural manual changes to the infrastructurestructural manual changes to the infrastructure our critical applications are running on.our critical applications are running on. Whilst at the same time demanding those criticalWhilst at the same time demanding those critical applications to go trough rigid test scenarios.applications to go trough rigid test scenarios. Who let this happen ?Who let this happen ?
  9. 9. Infrastructure as CodeInfrastructure as Code ● Code = CodeCode = Code ● Version ControlVersion Control ● Quality ChecksQuality Checks ● TestingTesting ● Continuous IntegrationContinuous Integration ● Continous DeliveryContinous Delivery
  10. 10. Infrastructure as CodeInfrastructure as Code ● Core InfrastructureCore Infrastructure ● Middleware deployment andMiddleware deployment and integrationintegration ● Automated continuous applicationAutomated continuous application deploymentdeployment ● Integrated Security enforcementIntegrated Security enforcement ● Host, Service and ApplicationHost, Service and Application Monitoring configuredMonitoring configured
  11. 11. Where to monitor ?Where to monitor ? ● DevDev ● AcceptanceAcceptance ● ProdProd
  12. 12. Why #monitoringsucksWhy #monitoringsucks ● Manual config (gui)Manual config (gui) ● Not in sync with realityNot in sync with reality ● Hosts onlyHosts only ● Services sometimesServices sometimes ● Appliccation neverAppliccation never ● ChaosChaos
  13. 13. Let's forget aboutLet's forget about ● Tools with no (stable) APITools with no (stable) API ● Tools with strong focus on GUITools with strong focus on GUI ● Unless you are an SME with < 100 nodesUnless you are an SME with < 100 nodes ● Zabixx, Zenoss, Hyperic, GroundWork, ....Zabixx, Zenoss, Hyperic, GroundWork, ....
  14. 14. What we wantWhat we want ● Small , wel suited componentsSmall , wel suited components • CollectCollect • Transport / MangleTransport / Mangle • Analyse / ActAnalyse / Act • VisualizeVisualize
  15. 15. Monitoring BaselineMonitoring Baseline ● Deploy a host,Deploy a host, ● Add it to the monitoringAdd it to the monitoring ● Add collection toolsAdd collection tools ● Add check definitionsAdd check definitions ● Update the monitoring tool configUpdate the monitoring tool config
  16. 16. Configuring a ServiceConfiguring a Service ● PackagePackage ● ConfigConfig ● ServiceService <- we pray status is correct<- we pray status is correct
  17. 17. Apache Example:Apache Example:
  18. 18. Icinga ?Icinga ? • Isn't nagios dead ?Isn't nagios dead ? • Vibrant CommunityVibrant Community • Throw great parties in NurnbergThrow great parties in Nurnberg • Nobody can pronounce it anyhowNobody can pronounce it anyhow •
  19. 19. Stored ConfigsStored Configs
  20. 20. Collection and ExportCollection and Export Export :Export : @@resource {@@resource { ... }... } Collect:Collect: Resource <<|Resource <<| query |>>query |>> Clean out nodes that dissapearClean out nodes that dissapear puppet node cleanpuppet node clean
  21. 21. Exporting and CollectingExporting and Collecting
  22. 22. Monitoring a VhostMonitoring a Vhost
  23. 23. ● AutodetectionAutodetection ● MultiplexingMultiplexing ● Trend ForecastingTrend Forecasting I love CheckMKI love CheckMK
  24. 24. • Autodetection ?Autodetection ? • Service,Service, • FunctionalitiesFunctionalities • eg. vhosts etceg. vhosts etc • Single Source of TruthSingle Source of Truth I hate CheckMKI hate CheckMK
  25. 25. Monitoring a service vs Monitoring a serviceMonitoring a service vs Monitoring a service
  26. 26. Definition of Done:Definition of Done: monitored and in productionmonitored and in production
  27. 27. A software project is not doneA software project is not done untill your last end user is deaduntill your last end user is dead
  28. 28. Exit DODExit DOD Measure Application UsageMeasure Application Usage
  29. 29. But , err how do I ?But , err how do I ?
  30. 30. Culture,Culture, Automation,Automation, Measurement :Measurement : measure all the thingsmeasure all the things SharingSharing
  31. 31. Deploy StatisticsDeploy Statistics ● Time To DeployTime To Deploy ● DeployDeploy FrequencyFrequency ● LifecycleLifecycle frequencyfrequency ● Map toMap to
  32. 32. Application MetricsApplication Metrics ● Number of current usersNumber of current users ● Number of sign upsNumber of sign ups ● Response timesResponse times ● TroughputTroughput ● XYZ UsageXYZ Usage ● # restarts# restarts ● Insert your specific valuable stuffInsert your specific valuable stuff
  33. 33. Graphite APIGraphite API
  34. 34. Triggers on GraphsTriggers on Graphs ● Export Java MetricsExport Java Metrics ● JMXTransJMXTrans ● Export JMXConfigsExport JMXConfigs ● Configure NRPEConfigure NRPE CheckCheck ● Export NagiosCheckExport NagiosCheck ● Collect JMX ExportsCollect JMX Exports on JMXTransNodeon JMXTransNode ● Graph EmGraph Em Collect NagiosCollect Nagios Configs on NagiosConfigs on Nagios ServerServer
  35. 35. Triggers on GraphsTriggers on Graphs
  36. 36. Triggers on GraphsTriggers on Graphs
  37. 37. Self ServiceSelf Service Gdash based pipelinesGdash based pipelines Puppetized Templates (wip)Puppetized Templates (wip)
  38. 38. Up Next:Up Next: • Creating Information out of this dataCreating Information out of this data • Big dataBig data • Machine LearningMachine Learning
  39. 39. HomeworkHomework SkylineSkyline OculusOculus DuskDusk RiemannRiemann EsperEsper Puppetdb externalPuppetdb external NaginatorNaginator
  40. 40. ContactContact Further ReadingFurther Reading @krisbuytaert@krisbuytaert InuitsInuits Duboistraat 50Duboistraat 50 2060 Antwerpen2060 Antwerpen BelgiumBelgium 891.514.231891.514.231 +32 475 961221+32 475 961221