Intelligent Auto-scaling of Kafka Consumers with Workload Prediction | Ming Sheu, ProphetStor Data Services Inc.

Intelligent Autoscaling of Kafka
Consumers with Workload Prediction
Kafka Summit Americas 2021
Ming Sheu
EVP, Product
ProphetStor Data Services, Inc.

Agenda
Scaling Kafka Consumers
Scaling with Kubernetes Native HPA
Intelligent Autoscaling with Predictions
Conclusion

Producers
Producer 1
Producer 2
Kafka Cluster
Kafka Broker
Topic 1
Partition 0
Partition 1
Topic 2
Partition 0
Consumers
Consumer 1
Consumer
Group
Consumer 2
Consumer 3
ZooKeeper Service
Copyright©2021 ProphetStor
• Long latency if not enough
Consumers
• Too many consumers result in
waste of resources
Performance vs Cost
3

Agenda
Scaling with Kubernetes Native
HPA
Intelligent Autoscaling with Predictions
Conclusion

Kubernetes Native Horizontal Pod Autoscaling
5
• Autoscaling of pods based on one or more
metrics
• Specify target metric threshold as a basis
for scaling up or down
• HPA Autoscaler calculates replica count
that best meets the target metric
threshold

Kafka Consumer Autoscaling Testing
6
• Kubernetes Cluster: 1 Master, 3 Workers
• Brokers: 3
• ZooKeepers: 3
• Producer: 1 Topic: 1 Partitions: 40
• Consumer Scaling Range: 1 to 40
• Message Production Rate: various rates from 20K msg/min ~ 120K msg/min,
avg 85K msg/min
Kubernetes Native Horizontal Pod Autoscaling
• Scaling based on avg CPU% of Kafka consumer pods
• Scaling based on Consumer Lag

Native HPA with Consumer CPU Utilization
Consumer 1 Consumer 2 Consumer 3
Decreasing Replicas
Increasing Consumer Lag
Scaling Replicas Based on Target
Maintain Consumer Lag
Decreasing Replicas
Increasing Consumer Lag
7

Native HPA with Target KPI Metrics
❑ Steep increase/decrease
of consumer replicas not
matching real workloads
❑ Difficult to set right target
thresholds
❑ Many ways to tune HPA
settings for autoscaling
behaviors
Using the ratio of current and
target consumer lag is NOT
the best way to calculate
consumer replicas.
500
Consumer
Lag Target Consumer Replicas Message Production Rate
1000
2000
8

10
Problems with Native HPA
Resource Utilization is not always an indication of
real workload of an application.
What would be the right target threshold to
choose?
Current HPA calculation method does not apply to
all application metrics.
Reactive autoscaling: scaling based on what
currently observed.

Federator.ai Intelligent Autoscaling with Workload Prediction
ZooKeeper Service
Producers
Producer 1
Kafka Cluster
Kafka Broker
Topic 1
Partition 0
Partition 1
Topic 2
Partition 0
Consumers
Consumer 1
Consumer
Group
Consumer 2
Consumer 3
Producer 2
• Use message production rate as a
workload indicator
• Make predictions on the workload
• Calculate right number of
consumers based on workload
prediction and target KPI metrics
11
Autoscaling
Message Production Rate
Consumer Lag
Workload Prediction

Better Autoscaling Kafka Consumers with Intelligence
12
Consumer Replicas
Production/Consumption Rate
❑ Number of consumer replicas matches the
message production rate
❑ 20% less average number of replicas than
Kubernetes native HPA while meeting
performance target
❑ AI-based algorithm; NO guessing on what the
right target metric threshold should be
❑ Predicted workload helps finding the right
number of replicas at the right time

14
Proactive scaling of consumer pods based on predicted workloads.
Using intelligent algorithm to find pod capacity in terms of workload
handling helps calculating the right number of consumer replicas.
No guessing and no experimenting on what metric threshold to set
in Kubernetes native HPA.
Result in better use of resource for desired performance.
Advantages of Prediction-based Autoscaling for Kafka Consumers

Federator.ai – AI-based Solution for Cloud Applications
• Timeseries Forecasting
• Reinforcement Learning
• Predictive Analytics
• Multi-layer correlation
15
• Cost effective resource
recommendations for
applications and clusters
• Prediction-based resource
recommendations ensure
application performance
• Intelligent autoscaling for
dynamic application
workloads
• Automatic resource
provisioning
Dynamic Workload Metrics

Thank You!
16

Intelligent Auto-scaling of Kafka Consumers with Workload Prediction | Ming Sheu, ProphetStor Data Services Inc.

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Intelligent Auto-scaling of Kafka Consumers with Workload Prediction | Ming Sheu, ProphetStor Data Services Inc.

Similar to Intelligent Auto-scaling of Kafka Consumers with Workload Prediction | Ming Sheu, ProphetStor Data Services Inc. (20)

More from HostedbyConfluent

More from HostedbyConfluent (20)

Recently uploaded

Recently uploaded (20)

Intelligent Auto-scaling of Kafka Consumers with Workload Prediction | Ming Sheu, ProphetStor Data Services Inc.