A Primer Towards Running Kafka on Top of Kubernetes.pdf

Lessons on running Kafka on K8S
Avinash Upadhyaya
Tech Socialite @ Platformatory
Ashwin Venkatesh
Principal Engineer @ Platformatory

Speaker Info
● Platform engineer @ platformatory.io
● Meetup organizer for Kong, Kafka, Grafana,
Docker and Bangalore Streams
● CCDAK among other cloud certs
● Principal Engineer @ platformatory.io
● Experienced Apache Kafka consultant
● CCDAK

Hold my beer while I rebalance stuff

- More gluttony for torture
- Surprisingly simpler than
conﬁguring
server.properties by hand
(or ansible)
- (if done well)
You want to run Kafka on K8S?

The Operator
Pattern in a
summary
- Kubernetes operator watches a CR type and takes application-speciﬁc actions to make the
current state match the desired state in that resource
- Implement domain-speciﬁc knowledge using Kubernetes
- Allows managing complex applications using the Kubernetes API and the kubectl interface

Any complex stateful
workload that can’t be run
as a fully managed service
will be provided as a K8S
operator

Scope of coverage:
A mental model on
Kubernetes
Operators for kafka
- Operator Core
- Custom Resources
- Workload Type
- Networking
- Storage
- Security
- Authentication
- Authorization
- Operational Features
- Balancing
- Monitoring
- Disaster Recovery
- Scale up/out
- Deployments & Rollouts
- Extensibility

Security: What is a typical requirement for kafka?
● Auto generate certiﬁcates for TLS and mTLS between brokers and other internal components
● Natively support authentication mechanism such as SASL/PLAIN, SASL/SCRAM,
SASL/OAUTHBEARER, SASL/GSSAPI
● Authorization with ACLs - Provide user management capabilities using the k8s API

Operations: What is a typical requirement for kafka?
● Re-balancing partitions when the load on the brokers is uneven, broker is added/removed
● Monitoring cluster health with JMX metrics
● Rolling upgrades with no downtime
● Replicate data across clusters
● Rack awareness for durability

Confluent For
Kubernetes(CFK)
● Confluent Platform on Kubernetes
● Based on experience of running Kafka on
Kubernetes for Confluent Cloud
● Uses StatefulSets for restoring a Kafka pod with
the same Kafka broker ID, configuration, and
persistent storage volumes if a failure occurs.
● Provides server properties, JVM, and Log4j
configuration overrides for customization of all
Confluent Platform components.
● Complete granular RBAC
● Support for credential management systems,
such as Hashicorp Vault, to inject sensitive
configurations in memory to Confluent
deployments
● Supports tiered storage
● Supports multi-region

Strimzi
● Open source, CNCF sandbox project
● Implement security in a Kubernetes-native
fashion
● Uses StrimziPodSets to overcome challenges of
StatefulSets
○ Add/remove broker arbitrarily
○ Stretch cluster across k8s clusters
○ Different conﬁgurations and volumes for different
brokers
● KafkaBridge for a RESTful HTTP interface

Koperator (Banzai
Cloud)
● Open-source core component of Banzai Cloud
Supertubes
○ most of the compelling features and integrations
are only available as part of the Supertubes Core
or Supertubes Pro product suites
● Envoy based load balancing for external access
● Uses pods instead of StatefulSets, in order to
○ modify the conﬁguration of unique Brokers
○ remove speciﬁc Brokers from clusters
○ use multiple Persistent Volumes for each Broker

Prescriptive Advise
- As with all things, k8s: It is important to setup
resource constraints (CPU, MemLimits)
- Generally advised to have Kafka nodes tainted
to NoSchedule and run on a dedicated basis.
- = no binpack nodes
- For most real-life use-cases, CRs are a starting
point. Will need to be or packaged to “platform
recipes” with different components, orienting
some level of tenancy around the brokers as
well as the components
- Typically a higher order Helm chart, preferably
with GitOps style deployments
- Prospective users must also think about operator
tenancy itself. Could be a global operator or a
namespaced operator

Key Takeaways
- Running Kafka on K8S can be a lot of toil,
without an operator. If you are running Kafka at
scale (and not on a managed service), consider
running one. It will save you time, money &
sanity
- You can make a choice based on your
environment, features (or the lack thereof),
licensing and other specialized purposes
- YMMV with Operator CRs. Each operator has its
own opinion based on the realities it was
designed for
- Kafka is ultimately not “k8s native”. The operator
only provides so much operational sugar
- As a result, there are several shoehorning
mechanisms (such as conﬁg overrides to inject
component properties, builtin); Full expressivity of
the workload doesn’t quite exist
- All operators provide comparable performance

Thank you
hello@platformatory.com
www.platformatory.io

A Primer Towards Running Kafka on Top of Kubernetes.pdf

Recommended

Recommended

More Related Content

Similar to A Primer Towards Running Kafka on Top of Kubernetes.pdf

Similar to A Primer Towards Running Kafka on Top of Kubernetes.pdf (20)

Recently uploaded

Recently uploaded (20)

A Primer Towards Running Kafka on Top of Kubernetes.pdf