SlideShare a Scribd company logo
1 of 24
Download to read offline
NIELS ROETERT
Solutions Architect,
StormForge
BENEDIKT
STEMMILDT
TalentFormation,
CTO
FEATURING
Triple 20 IT – How to
reduce costs on target
while increasing speed
and quality.
WEBINAR
Today’s Presenters
BENEDIKT STEMMILDT
CTO
NIELS ROETERT
Solutions Architect
● Software architect, full-stack
developer and speaker
● 10+ years experience in cloud,
continuous delivery, software
development, architecture
● Based in Hamburg, Germany
● Solutions Architect and speaker
● 30+ years experience in Linux,
UNIX, Kubernetes, microservices
● Based in Wageningen,
Netherlands
3
Agenda 1
Benedikt: Triple 20 IT – How to reduce costs
on target while increasing speed and quality.
2
3
Niels: ML for cost efficiency in K8s - Optimize
Live demo
Wrap-up/Q&A
w w w . s t o r m f o r g e . i o t a l e n t f o r m a t i o n . c o m
a w s . a m a z o n . c o m / p a r t n e r s
NIELS ROETERT
Solutions Architect,
StormForge
BENEDIKT
STEMMILDT
TalentFormation,
CTO
FEATURING
Triple 20 IT – How to
reduce costs on target
while increasing speed
and quality.
WEBINAR
Today’s Presenters
BENEDIKT STEMMILDT
CTO
NIELS ROETERT
Solutions Architect
● Software architect, full-stack
developer and speaker
● 10+ years experience in cloud,
continuous delivery, software
development, architecture
● Based in Hamburg, Germany
● Solutions Architect and speaker
● 30+ years experience in Linux,
UNIX, Kubernetes, microservices
● Based in Wageningen,
Netherlands
3
Agenda 1
Benedikt: Triple 20 IT – How to reduce costs
on target while increasing speed and quality.
2
3
Niels: ML for cost efficiency in K8s - Optimize
Live demo
Wrap-up/Q&A
w w w . s t o r m f o r g e . i o t a l e n t f o r m a t i o n . c o m
a w s . a m a z o n . c o m / p a r t n e r s
Put the ‘Auto’ in Autoscaling
Stop wasting human/compute resources
Niels Roetert
Solutions Architect
The Kubernetes Efficiency Challenge
● Configuring applications to run efficiently on
Kubernetes is difficult
● Developers often have to guess at resource
settings or end up using defaults
● Platform teams are left to manage
resources without knowing the needs of the
application
● “Do more with less” is the new reality, but
teams feel they must choose between
reliability and cost
What are Kubernetes Autoscalers
● Vertical Pod Autoscaler (VPA): Adjusts container CPU and memory requests and limits
based on usage, optimizing resource allocation.
● Horizontal Pod Autoscaler (HPA): Scales pod replicas based on CPU, memory, or
custom metrics, maintaining application performance and availability.
● Cluster Autoscaler: Adds or removes nodes in the cluster based on resource demands
and utilization, ensuring efficient resource usage and cost-effectiveness.
3
The Vertical Pod Autoscaler (VPA), Horizontal Pod Autoscaler (HPA),
and Cluster Autoscaler are components in Kubernetes that help
manage and scale resources automatically based on workload
demands and cluster resource utilization.
Most teams start with
horizontal pod
autoscaling (HPA)
4
What should my HPA
target utilization be
set to?
Are my pods the right size?
Am I just multiplying my
inefficiencies?
How am I going to
configure this across
hundreds of services?
Why is it called “auto”
scaling anyway?!
Without autoscaling, over-provisioning or risk are inevitable
5
Scenario 1: Conservative & wasteful Scenario 2: Aggressive & risky
Requests and Limits?
● Requests: The minimum resources (CPU and memory) a container is guaranteed.
● Limits: The maximum resources a container can consume before being throttled or
terminated.
● Relevance: They ensure efficient resource distribution and prevent resource contention
in a cluster.
● Usage: They help maintain application performance, stability, and prevent
over-provisioning or under-provisioning of resources.
6
Requests and limits are Kubernetes resource settings for containers,
they manage CPU and memory allocation.
targetUtilization
The targetUtilization field in the HPA config specifies the
desired resource utilization percentage. The HPA scales
the pod replicas up or down based on the observed
resource utilization to maintain the target value,
ensuring efficient resource usage and optimal
performance.
7
Kubernetes HPA automatically scales the number of pod replicas
based on observed metrics, like CPU utilization, to maintain optimal
resource usage and application performance.
In a Land of Rainbows and Unicorns
8
They have the knowledge of the application's
performance characteristics, resource usage patterns,
and architecture, which allows them to make informed
decisions about the appropriate resource allocation for
each container.
The development team or DevOps engineers are
responsible for setting the initial requests, limits
and targetUtilization for containers in a Kubernetes
deployment, because:
In a Realm of Chaos and Dragons
● Analyze application requirements.
● Set up a baseline with default Pod/HPA settings.
● Conduct load testing and adjust Pod/HPA.
● Monitor performance and fine-tune Pod/HPA.
● Review and update Pod/HPA settings regularly.
9
Determining the best settings for Pods and the HPA target
utilization in Kubernetes environments for new applications
can be challenging.
StormForge Optimize Live Overview
10
● Reduces costs and improves reliability by
right-sizing Kubernetes application resources.
● Machine learning analyzes CPU and memory
utilization, and provides recommendations to
adjust resources requests up & down to meet
demand as patterns change
● Reduces toil by automatically applying
recommendations automatically, freeing up
engineering resources
● Enables bi-dimensional autoscaling,
providing vertical rightsizing and efficient
horizontal scaling through recommended
target utilization for HPA enabled workloads
● Low barrier to entry making it fast and easy
to get started
RECOMMEND
● CPU and memory
● HPA target utilization
DISCOVER
● Continuously ingest
workloads
● Machine Learning
analyzes Kubernetes
data
IMPLEMENT
● Automatic or manual
● Route through CI/CD
process
INSTALL
11
StormForge Optimize Live: How does it work?
OPTIMIZE LIVE
12
StormForge Architecture
13
How does StormForge ML maximize efficiency without risking
performance or reliability?
Other Solutions StormForge
Focused on cost visibility only Actionable insights and specific
recommendations
Single objective, e.g. cost/resource
only
Multi-objective, reduce
resource/cost while ensuring
performance/reliability
Simplistic, inflexible Configurable - Specify goals, set
guardrails, use for a variety of use
cases
No HPA compatibility HPA compatibility for intelligent
bi-dimensional autoscaling
“Black box” approach Explainable AI fosters trust
What about security?
● We only collect telemetry data, similar to what
Datadog or other SaaS-based observability solutions
collect
● All communication to our SaaS endpoints occur using
SSL/TLS over port 443
● Telemetry data from different organizations is kept
separate
● ML models are trained separately for each customer
using only their own application data
● StormForge is SOC2 compliant
● Example datasets and other details can be provided
for your organization's security team as needed
14
The StormForge difference
INTELLIGENCE
Actionable
recommendations to
optimize resources as
usage varies.
Unlike cloud cost management tools that merely provide visibility, StormForge uses:
15
AUTOMATION
to proactively and
continuously right-size -
improving efficiency &
eliminating cloud waste.
VISIBILITY
Show current utilization
and identify
opportunities for
improvement.
+ +
w w w . s t o r m f o r g e . i o

More Related Content

Similar to Triple 20 IT – How to reduce costs on target while increasing speed and quality

Lessons learned from embedding Cassandra in xPatterns
Lessons learned from embedding Cassandra in xPatternsLessons learned from embedding Cassandra in xPatterns
Lessons learned from embedding Cassandra in xPatterns
Claudiu Barbura
 
Kakarla Sriram K _resume_sep_2016
Kakarla Sriram K _resume_sep_2016Kakarla Sriram K _resume_sep_2016
Kakarla Sriram K _resume_sep_2016
srkkakarla
 
How to Migrate Applications Off a Mainframe
How to Migrate Applications Off a MainframeHow to Migrate Applications Off a Mainframe
How to Migrate Applications Off a Mainframe
VMware Tanzu
 
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
Provectus
 

Similar to Triple 20 IT – How to reduce costs on target while increasing speed and quality (20)

Addressing the 8 Key Pain Points of Kubernetes Cluster Management
Addressing the 8 Key Pain Points of Kubernetes Cluster ManagementAddressing the 8 Key Pain Points of Kubernetes Cluster Management
Addressing the 8 Key Pain Points of Kubernetes Cluster Management
 
Scaling AI/ML with Containers and Kubernetes
Scaling AI/ML with Containers and Kubernetes Scaling AI/ML with Containers and Kubernetes
Scaling AI/ML with Containers and Kubernetes
 
Lessons learned from embedding Cassandra in xPatterns
Lessons learned from embedding Cassandra in xPatternsLessons learned from embedding Cassandra in xPatterns
Lessons learned from embedding Cassandra in xPatterns
 
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning Infrastructure
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning InfrastructureML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning Infrastructure
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning Infrastructure
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
Construção de uma plataforma de observabilidade centralizada
Construção de uma plataforma de observabilidade centralizadaConstrução de uma plataforma de observabilidade centralizada
Construção de uma plataforma de observabilidade centralizada
 
Solving the Hidden Costs of Kubernetes with Observability
Solving the Hidden Costs of Kubernetes with ObservabilitySolving the Hidden Costs of Kubernetes with Observability
Solving the Hidden Costs of Kubernetes with Observability
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of ML
 
Kakarla Sriram K _resume_sep_2016
Kakarla Sriram K _resume_sep_2016Kakarla Sriram K _resume_sep_2016
Kakarla Sriram K _resume_sep_2016
 
Confluent Partner Tech Talk with Reply
Confluent Partner Tech Talk with ReplyConfluent Partner Tech Talk with Reply
Confluent Partner Tech Talk with Reply
 
Container Days 22 - Predictive Autoscaling Patterns with Kubernetes.pdf
Container Days 22 - Predictive Autoscaling Patterns with Kubernetes.pdfContainer Days 22 - Predictive Autoscaling Patterns with Kubernetes.pdf
Container Days 22 - Predictive Autoscaling Patterns with Kubernetes.pdf
 
High Performance Computing Pitch Deck
High Performance Computing Pitch DeckHigh Performance Computing Pitch Deck
High Performance Computing Pitch Deck
 
StorPool Presents at Cloud Field Day 9
StorPool Presents at Cloud Field Day 9StorPool Presents at Cloud Field Day 9
StorPool Presents at Cloud Field Day 9
 
How to Migrate Applications Off a Mainframe
How to Migrate Applications Off a MainframeHow to Migrate Applications Off a Mainframe
How to Migrate Applications Off a Mainframe
 
AWS Well-Architected Framework (nov 2017)
AWS Well-Architected Framework (nov 2017)AWS Well-Architected Framework (nov 2017)
AWS Well-Architected Framework (nov 2017)
 
Pivotal Container Service Overview
Pivotal Container Service Overview Pivotal Container Service Overview
Pivotal Container Service Overview
 
Higher ROI-N
Higher ROI-NHigher ROI-N
Higher ROI-N
 
Building a centralized observability platform
Building a centralized observability platformBuilding a centralized observability platform
Building a centralized observability platform
 
Building a centralized observability platform
Building a centralized observability platformBuilding a centralized observability platform
Building a centralized observability platform
 
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
 

Recently uploaded

Recently uploaded (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Triple 20 IT – How to reduce costs on target while increasing speed and quality

  • 1. NIELS ROETERT Solutions Architect, StormForge BENEDIKT STEMMILDT TalentFormation, CTO FEATURING Triple 20 IT – How to reduce costs on target while increasing speed and quality. WEBINAR
  • 2. Today’s Presenters BENEDIKT STEMMILDT CTO NIELS ROETERT Solutions Architect ● Software architect, full-stack developer and speaker ● 10+ years experience in cloud, continuous delivery, software development, architecture ● Based in Hamburg, Germany ● Solutions Architect and speaker ● 30+ years experience in Linux, UNIX, Kubernetes, microservices ● Based in Wageningen, Netherlands
  • 3. 3 Agenda 1 Benedikt: Triple 20 IT – How to reduce costs on target while increasing speed and quality. 2 3 Niels: ML for cost efficiency in K8s - Optimize Live demo Wrap-up/Q&A
  • 4. w w w . s t o r m f o r g e . i o t a l e n t f o r m a t i o n . c o m a w s . a m a z o n . c o m / p a r t n e r s
  • 5. NIELS ROETERT Solutions Architect, StormForge BENEDIKT STEMMILDT TalentFormation, CTO FEATURING Triple 20 IT – How to reduce costs on target while increasing speed and quality. WEBINAR
  • 6. Today’s Presenters BENEDIKT STEMMILDT CTO NIELS ROETERT Solutions Architect ● Software architect, full-stack developer and speaker ● 10+ years experience in cloud, continuous delivery, software development, architecture ● Based in Hamburg, Germany ● Solutions Architect and speaker ● 30+ years experience in Linux, UNIX, Kubernetes, microservices ● Based in Wageningen, Netherlands
  • 7. 3 Agenda 1 Benedikt: Triple 20 IT – How to reduce costs on target while increasing speed and quality. 2 3 Niels: ML for cost efficiency in K8s - Optimize Live demo Wrap-up/Q&A
  • 8. w w w . s t o r m f o r g e . i o t a l e n t f o r m a t i o n . c o m a w s . a m a z o n . c o m / p a r t n e r s
  • 9. Put the ‘Auto’ in Autoscaling Stop wasting human/compute resources Niels Roetert Solutions Architect
  • 10. The Kubernetes Efficiency Challenge ● Configuring applications to run efficiently on Kubernetes is difficult ● Developers often have to guess at resource settings or end up using defaults ● Platform teams are left to manage resources without knowing the needs of the application ● “Do more with less” is the new reality, but teams feel they must choose between reliability and cost
  • 11. What are Kubernetes Autoscalers ● Vertical Pod Autoscaler (VPA): Adjusts container CPU and memory requests and limits based on usage, optimizing resource allocation. ● Horizontal Pod Autoscaler (HPA): Scales pod replicas based on CPU, memory, or custom metrics, maintaining application performance and availability. ● Cluster Autoscaler: Adds or removes nodes in the cluster based on resource demands and utilization, ensuring efficient resource usage and cost-effectiveness. 3 The Vertical Pod Autoscaler (VPA), Horizontal Pod Autoscaler (HPA), and Cluster Autoscaler are components in Kubernetes that help manage and scale resources automatically based on workload demands and cluster resource utilization.
  • 12. Most teams start with horizontal pod autoscaling (HPA) 4 What should my HPA target utilization be set to? Are my pods the right size? Am I just multiplying my inefficiencies? How am I going to configure this across hundreds of services? Why is it called “auto” scaling anyway?!
  • 13. Without autoscaling, over-provisioning or risk are inevitable 5 Scenario 1: Conservative & wasteful Scenario 2: Aggressive & risky
  • 14. Requests and Limits? ● Requests: The minimum resources (CPU and memory) a container is guaranteed. ● Limits: The maximum resources a container can consume before being throttled or terminated. ● Relevance: They ensure efficient resource distribution and prevent resource contention in a cluster. ● Usage: They help maintain application performance, stability, and prevent over-provisioning or under-provisioning of resources. 6 Requests and limits are Kubernetes resource settings for containers, they manage CPU and memory allocation.
  • 15. targetUtilization The targetUtilization field in the HPA config specifies the desired resource utilization percentage. The HPA scales the pod replicas up or down based on the observed resource utilization to maintain the target value, ensuring efficient resource usage and optimal performance. 7 Kubernetes HPA automatically scales the number of pod replicas based on observed metrics, like CPU utilization, to maintain optimal resource usage and application performance.
  • 16. In a Land of Rainbows and Unicorns 8 They have the knowledge of the application's performance characteristics, resource usage patterns, and architecture, which allows them to make informed decisions about the appropriate resource allocation for each container. The development team or DevOps engineers are responsible for setting the initial requests, limits and targetUtilization for containers in a Kubernetes deployment, because:
  • 17. In a Realm of Chaos and Dragons ● Analyze application requirements. ● Set up a baseline with default Pod/HPA settings. ● Conduct load testing and adjust Pod/HPA. ● Monitor performance and fine-tune Pod/HPA. ● Review and update Pod/HPA settings regularly. 9 Determining the best settings for Pods and the HPA target utilization in Kubernetes environments for new applications can be challenging.
  • 18. StormForge Optimize Live Overview 10 ● Reduces costs and improves reliability by right-sizing Kubernetes application resources. ● Machine learning analyzes CPU and memory utilization, and provides recommendations to adjust resources requests up & down to meet demand as patterns change ● Reduces toil by automatically applying recommendations automatically, freeing up engineering resources ● Enables bi-dimensional autoscaling, providing vertical rightsizing and efficient horizontal scaling through recommended target utilization for HPA enabled workloads ● Low barrier to entry making it fast and easy to get started
  • 19. RECOMMEND ● CPU and memory ● HPA target utilization DISCOVER ● Continuously ingest workloads ● Machine Learning analyzes Kubernetes data IMPLEMENT ● Automatic or manual ● Route through CI/CD process INSTALL 11 StormForge Optimize Live: How does it work?
  • 21. 13 How does StormForge ML maximize efficiency without risking performance or reliability? Other Solutions StormForge Focused on cost visibility only Actionable insights and specific recommendations Single objective, e.g. cost/resource only Multi-objective, reduce resource/cost while ensuring performance/reliability Simplistic, inflexible Configurable - Specify goals, set guardrails, use for a variety of use cases No HPA compatibility HPA compatibility for intelligent bi-dimensional autoscaling “Black box” approach Explainable AI fosters trust
  • 22. What about security? ● We only collect telemetry data, similar to what Datadog or other SaaS-based observability solutions collect ● All communication to our SaaS endpoints occur using SSL/TLS over port 443 ● Telemetry data from different organizations is kept separate ● ML models are trained separately for each customer using only their own application data ● StormForge is SOC2 compliant ● Example datasets and other details can be provided for your organization's security team as needed 14
  • 23. The StormForge difference INTELLIGENCE Actionable recommendations to optimize resources as usage varies. Unlike cloud cost management tools that merely provide visibility, StormForge uses: 15 AUTOMATION to proactively and continuously right-size - improving efficiency & eliminating cloud waste. VISIBILITY Show current utilization and identify opportunities for improvement. + +
  • 24. w w w . s t o r m f o r g e . i o