Boost Fertility New Invention Ups Success Rates.pdf
Monitoring in an Infrastructure as Code Age
1. Monitoring in an IAC AgeMonitoring in an IAC Age
PuppetConf 2013
Kris Buytaert
2. Kris BuytaertKris Buytaert
● I used to be a Dev,I used to be a Dev,
● Then Became an OpThen Became an Op
● Chief Trolling Officer and Open SourceChief Trolling Officer and Open Source
Consultant @inuits.euConsultant @inuits.eu
● Everything is an effing DNS ProblemEverything is an effing DNS Problem
● Building Clouds since before the bookstoreBuilding Clouds since before the bookstore
● Some books, some papers, some blogsSome books, some papers, some blogs
● Evangelizing devopsEvangelizing devops
3. devops = clamsdevops = clams
● CultureCulture
● (Lean)(Lean)
● Automate all the things ...Automate all the things ...
•
Build AutomationBuild Automation
•
Test AutomationTest Automation
•
IACIAC
● Monitoring , Metrics ...Monitoring , Metrics ...
● SharingSharing
4. Monitoring is usually anMonitoring is usually an
aftertoughtaftertought
ENOBUDGET, ENOTIMEENOBUDGET, ENOTIME
7. Infrastructure as CodeInfrastructure as Code
● Model our infrastructureModel our infrastructure
● A fast reproducable platformA fast reproducable platform
● Disaster discovery for “free”Disaster discovery for “free”
8. For years we've tolerated humans to to makeFor years we've tolerated humans to to make
structural manual changes to the infrastructurestructural manual changes to the infrastructure
our critical applications are running on.our critical applications are running on.
Whilst at the same time demanding those criticalWhilst at the same time demanding those critical
applications to go trough rigid test scenarios.applications to go trough rigid test scenarios.
Who let this happen ?Who let this happen ?
9. Infrastructure as CodeInfrastructure as Code
● Code = CodeCode = Code
● Version ControlVersion Control
● Quality ChecksQuality Checks
● TestingTesting
● Continuous IntegrationContinuous Integration
● Continous DeliveryContinous Delivery
10. Infrastructure as CodeInfrastructure as Code
● Core InfrastructureCore Infrastructure
● Middleware deployment andMiddleware deployment and
integrationintegration
● Automated continuous applicationAutomated continuous application
deploymentdeployment
● Integrated Security enforcementIntegrated Security enforcement
● Host, Service and ApplicationHost, Service and Application
Monitoring configuredMonitoring configured
11. Why #monitoringsucksWhy #monitoringsucks
● Manual config (gui)Manual config (gui)
● Not in sync with realityNot in sync with reality
● Hosts onlyHosts only
● Services sometimesServices sometimes
● Appliccation neverAppliccation never
● ChaosChaos
12. Let's forget aboutLet's forget about
● Tools with no (stable) APITools with no (stable) API
● Tools with strong focus on GUITools with strong focus on GUI
● Unless you are an SME with < 100 nodesUnless you are an SME with < 100 nodes
● Zabixx, Zenoss, Hyperic, GroundWork, ....Zabixx, Zenoss, Hyperic, GroundWork, ....
13. Where to monitor ?Where to monitor ?
● DevDev
● AcceptanceAcceptance
● ProdProd
14. What we wantWhat we want
● Small , wel suited componentsSmall , wel suited components
•
CollectCollect
•
Transport / MangleTransport / Mangle
•
Analyse / ActAnalyse / Act
•
VisualizeVisualize
15. Monitoring BaselineMonitoring Baseline
● Deploy a host,Deploy a host,
● Add it to the monitoringAdd it to the monitoring
● Add collection toolsAdd collection tools
● Add check definitionsAdd check definitions
● Update the monitoring tool configUpdate the monitoring tool config
18. Icinga ?Icinga ?
•
Isn't nagios dead ?Isn't nagios dead ?
•
Vibrant CommunityVibrant Community
•
Throw great parties in NurnbergThrow great parties in Nurnberg
•
Nobody can pronounce it anyhowNobody can pronounce it anyhow
•
https://github.com/Inuits/puppet-icinga/https://github.com/Inuits/puppet-icinga/
31. Deploy StatisticsDeploy Statistics
● Time To DeployTime To Deploy
● DeployDeploy
FrequencyFrequency
● LifecycleLifecycle
frequencyfrequency
● Map toMap to
32. Application MetricsApplication Metrics
● Number of current usersNumber of current users
● Number of sign upsNumber of sign ups
● Response timesResponse times
● TroughputTroughput
● XYZ UsageXYZ Usage
● # restarts# restarts
● Insert your specific valuable stuffInsert your specific valuable stuff
37. Self ServiceSelf Service
Gdash based pipelinesGdash based pipelines
Puppetized Templates (wip)Puppetized Templates (wip)
38. Up Next:Up Next:
•
Creating Information out of this dataCreating Information out of this data
•
Big dataBig data
•
Machine LearningMachine Learning