Evolving for Kubernetes

Agenda
• Some Items about Kubernetes

• Lift and Shift

• Evolving Applications for Kubernetes

• Changing People/Processes for Kubernetes

tl;dr
• Be ready for change

• As you evolve your processes/support, you have to provide backwards compatibility for both your infrastructure/applications and your processes

• Really focus on having a stable deployment mechanism

• Sort out your interaction contracts

• Start small and constrained - Realize that you won't do it "right"

• You will have to make application changes

• You will have to change your expectations

• Be ready for cutting edges

• Kubernetes has very simple set of core primitives, and a lot of options to build on top

• CICD

• Authentication and Access Control

• Resource management - forcing to use limits/requests

• Avoiding mixing goals

Perspective (Me)
• I'm a platform administration. I make sure the clusters...

• are available,

• have resources,

• connect to the rest of the infrastructure.

• I run applications, but for the most part help app teams use the platform

Survey (You)
• Who's familiar with cfengine / puppet / chef / ansible / etc?

• Who's familiar with current containers?

• Who's familiar with Kubernetes? The object / resource model?

• Who's running Kubernetes in Production (even for the smallest workload)?

Some items about Kubernetes...

Declarative State
• Tell me what you want

• Not how to do it

• How to do it can change in diﬀerent contexts

• "LoadBalancer" is slightly diﬀerent in AWS, GCP, Azure

• All state stored in the API Server

Reconciliation Loop Driven
• Many speciﬁc independent actors

• Controllers

• Operators

• Actors implement declarative state and current state

• Actor can change declarative state for another actor
and trigger its actions

• Main ones

• schduler

• controller-manager

• kubelet

Network Focused
• Interact with other applications over network

Networking
• Everything in the cluster is reachable to everything else

• (Policy might restrict)

• Magic Mappings.

• L4 Load Balancer

• DNS Mapping

• Map from outside cluster to inside

Resources
• What are used to deﬁned declarative state

• Stored in cluster (API Server)

• aka Manifests

Container
• Process

• Isolation: Namespaces

• Resource Management: CGroups

• Restriction: Capabilities

Pod
• Collection of containers working tightly together

• Unit of Scheduling

• Share network stack

• Can share disk mounts

• Sidecar: a support container running with the application container

Service
• Logical construct representing a network resource

ConfigMap
• (Static) Configuration data stored inside the cluster

• Can be exposed as environment variable(s) or files place into the
container

Higher Level Constructs
• Multiple Pods all the same ==> ReplicaSet

• Multiple ReplicaSets running an application (with canned rolling update
mechanism) ==> Deployment

• Sensitive ConﬁgMaps ==> Secret

• Diﬀerent Service Implementations ==> Ingress

Controller -> Controller (via API Server)
Kubelet
Endpoint
Controller
RS
Controller
Deploy
Controller
Replica
Set
PodDeployment
Service
App Route

Kubernetes is a Distributed System
for Building Distributed Systems
full of Independent Actors.

Goals
• Run an application inside of Kubernetes

• Change the code as little as possible

• Hook into the existing infrastructure as much as possible

• Keep it simple - avoid state, storage, etc

• Kick the tires

Starting Point
• Application Server based

• Takes requests in

• Talks to a Database in the Back

Starting Point - Control
• Conﬁguration

• Logs

Host
App
Server
LOGSCONFIG
CLIENT
DB

Host
Tomcat LOGSTASHCHEF
CLIENT
DB

App Design for Kubernetes
• Application Pod with Logstash Sidecar Pod

• ConﬁgMap holding prerendered output from Chef. Mounted under conf dir

• Shared mount (emptydir) for log output

• Written by app

• Read by logstash

• Service deﬁnition to map from outside to inside

C
TomcatConﬁg
CLIENT
DB
LOG
STASH
Pod

C
TomcatConﬁg
CLIENT
DB
LOG
STASH
App
Container
Logstash
Sidecar
Container

C
TomcatConﬁg
CLIENT
DB
LOG
STASH
Prerendered
Conf
stored in
ConﬁgMap
Mounted
into app
container for
conf dir

C
TomcatConﬁg
CLIENT
DB
LOG
STASH
Shared
mount for
logs

C
TomcatConﬁg
CLIENT
DB
LOG
STASH
Service
Deﬁnition

Team Processes for Kubernetes
• kubectl is the hammer

• Deploy using manifests (from source control) with kubectl apply -f

• Troubleshooting

• kubectl logs ...
• kubectl exec ...

Lift and Shift: Success!
• It worked!

Lessons Learned
• Startup time takes its time (tomcat startup)

• Debugging (kube exec)

• ConﬁgMap/Deployments only worked for one environment

• Healthchecks didn't ﬁt well in the model and worked counter to
debugging steps

Next Goals
• Determine how to sustainable run applications inside of Kubernetes

Evolving Applications for Kubernetes
(Phase 2,3,4,5,6,7,8,9....)

https://www.redhat.com/cms/managed-ﬁles/cl-cloud-native-container-design-whitepaper-f8808kc-201710-v3-en.pdf

1. Single Concern Principle
• Do one thing (and do it well)

• Separation of Concerns

• Target updates

• Minimize (vertical) image sprawl

2. Image Immutability Principle
• Image is a delivery artifact with all of
the properties that that should have

• "Build once, deploy everywhere"

• Don't layer conﬁguration on as part of
image (unless you're putting *all*
foreseeable conﬁguration possibilities
in there)

3. Self-Containment Principle
• Extension of Image Immutability

• On deployment, layer in instance
unique items (conﬁg, data)

• This uniqueness layer should be
speciﬁc to this instance

4. Runtime Conﬁnement Principle
• Get an understanding of your
resource requirements

• And use them! (helps with scheduling)

• Without them ==>

• Unintentional, uninformed
oversubscription

• Roving micro-oversubscription
hotspots

5. Process Disposability Principle
• Processes are ephemeral

• Before ready for them to not be there

• This will happen often (every change)

Containers (by themselves) are half suited for
Kubernetes
• Kubernetes builds on containers

• If you have been following container
modeling, that translates directly

Kubernetes Cluster Ecosystem
• Application process interacts with
the cluster

6. Life-Cycle Conformance Principle
• Figure out your timings (shutdown cleanly, startup)

7. High Observability Principle
• Change in behaviors

• Biggest change in thinking

• Forced thinking of items like health checks and monit et al

• Add to Disposability Principle - have to be able to debug quickly, over the network, and with remaining forensics

Changing People/Process for Kubernetes

Be Ready for Change
• Changed Deployment Strategy

• Single manifests -> Helm Charts

• Changed Helm Chart Structure 5 Times

• Changed Logging Infrastructure 3 Times

• BE VERY CAREFUL IN WHAT YOU STOP SUPPORTING

Contracts
• Describe what each side/component of processes will provide and accept

• Helps to deﬁne

• What can be changed without impacting others

• What needs to be talked about before changing

Kubernetes Cluster Kubernetes Team App Team / User
-Receives App Definition
+Define App Definition

(name, resource count, users)
-Receives namespace, RBAC +Defines namespace, RBAC
-Trusts central auth +Logs in via central auth
-Allows access to granted resources +Accesses namespace
App Team Onboarding to Cluster

App (Pod) Kubernetes Cluster Monitoring App Team / User
+Logs to STDOUT -Receives from STDOUT
+Transmits to Logging Bus -Receives on Logging Bus
+JSON Structured Log
Format
-Handles JSON Format
+Indexes in Search Tool -Search in Search Tool
+Infrastructure enrichments:
pod, cluster, container host,
environment, datacenter
-Search by infrastructure
information
Logging Contract

Be Comfortable with Being Uncomfortable
• A lot of this technology is new/recent

• A lot of simple implementations (ﬁrst pass)

• A lot of undiscovered bugs

• "Best practices" are highly localized

Simpliﬁed Primitives
• Deployments

• All at once (destroy + build)

• Rolling

• Load Balancing

• Only equal weight round robin (be it via L4 forwarding, or DNS)

• What's Layer7?

Common App Team Concerns
• How do I get onboarded?

• How does my application have to interact with the system?

• How do I run my application?

• How do I troubleshoot my application?

• How does it all work?!!?!?!

How do I run my application?
• Build an Application Template

• Dockerfile

• Helm Chart

• Jenkinsfile

• Extend with organizational specific functions

• Partial Deploy functions

• Incorporate environment values

Debugging
• kubectl exec ...

How does it all work?!?!?!?!
• Boot camps

C
Jar
App
Conﬁg
Map
CLIENT
DB
Secret
Management
Conﬁg
Map

New Application Model
• Jar App (faster start up)

• zmetrics port (separate from client interface)

• Prometheus scrapes metrics

• Readiness/liveness probes

• Logs to STDOUT

New Deployment Model
• CICD Driven

• Standard format for repository

• Dockerﬁle, Chart --> artifacts

• Environment speciﬁc values

• Multiproject pipeline pushes to multiple environments with approval gates

• Automatic canary deploy, sanity check, then full deploy

• If you change too quickly, you will be in for a world of hurt

• Different ways to deploy

• Different Kubernetes versions (v1alpha1, v1beta1, v1)

• You can't please everyone

• Tradeoffs

• Training - bootcamps and walking people through...

• Examples examples examples - easy to copy (cargo culting)

Health Checks
• Liveliness probe: If this fails, Kubernetes will restart the
container.

• Readiness probe: If this fails, Kubernetes will take the
pod out of the service pool.

• If an app is bad, I should stop sending traﬃc to it and
recover it, right? ==> Ok to set these to the same
thing.
https://cloud.google.com/kubernetes-engine/kubernetes-comic/

"We DDOSed Ourselves!!!"
• On startup, application can be ready

• Gets flooded with traffic

• Kube restarts because liveness failed as well

• Quick fix: Removed liveness

• Real fix:

• Run liveness and readiness on a different port/
connection threadpool/etc

• Know they mean different items

• Prometheusbeat has limited support

• Security scanning checkbox

• Type:LoadBalancer Services (and anything built off them) get a permit *
ICMP Destination Unreachable (Type 3) - runs afoul of security policies

• Provide helper tools to setup configuration

• Login ==> can also gather cluster information like certificates and
endpoints

Q?
Obligatory - we're hiring...
Maybe Answers...

Evolving for Kubernetes

Recommended

Recommended

More Related Content

Similar to Evolving for Kubernetes

Similar to Evolving for Kubernetes (20)

More from Chris McEniry

More from Chris McEniry (8)

Recently uploaded

Recently uploaded (20)

Evolving for Kubernetes