Things like Infrastructure as Code, Service Discovery and Config Management can and have helped us to quickly build and rebuild infrastructure but we haven't nearly spend enough time to train our self to review, monitor and respond to outages. Does our platform degrade in a graceful way or what does a high cpu load really mean? What can we learn from level 1 outages to be able to run our platforms more reliably.
We all love infrastructure as code, we automate everything ™ but how many of us can really say we could destroy and recreate our core infrastructure without human intervention. A Container Scheduler adds application life cycle management, scheduling and placement based on available resources and connectivity features to your cloud system. It takes away the responsibility from you to take care for these tasks. During this talk we will investigate how we can setup and monitor a cloud native container platform that scales using hashicorp's consul and nomad service discovery and container scheduling tools will define some best practices that will allow your platforms to heal itself.