SlideShare a Scribd company logo
1 of 12
Container Management Platform
[K8S]
PaaS Journey
By Uladzimir Palkhouski
https://www.linkedin.com/in/uladzimirpalkhouski/
there is always a reason behind...
Challenges of countless projects:
- application resiliency issues
- low resource utilization and cost in-efficiency
- operational inefficiency in using non-unified technology stack for managing different layers and
deploying applications
- low deployment velocity and elasticity
- security and compliance issues (host & app level access and audit)
- operational overhead in managing infrastructure
Fundamental Principles
● Cattle, no Pet
● Immutable Infrastructure
● Codified Infrastructure
● Golden Image
● OOB Resiliency
● OOB Telemetry
is there anything we can do in a dramatically
different way?
… for a single ultimate goal
let application developers focus on application development and business capabilities
... while somebody else (or something else) takes care about infrastructure maintenance, networking,
capacity planning, resiliency, telemetry, security and compliance, etc…
responding to the demand – K8S based
container management PaaS
- best-of-breed container scheduler – K8S
- KOPS and AWS based OSS K8S distribution
- OSS-based addons ecosystem (fluentd, weave scope, heapster, etc.)
- rolling cluster update to answer most of operational challenges
- unified addons, resources, applications and services deployment
(helm)
- 100% OSS, no proprietary closed products
- full CNCF K8S conformance (read as no lock, can migrate to other
distributions) Source: https://redislabs.com/redis-enterprise-
documentation/administering/kubernetes/upgrading-redis-enterprise-
cluster-kubernetes-deployment-operator/
fundamental principles > platform capabilities
- Cattle Host -> no pet hosts, any node can be killed any time if misbehaving. Workloads will by
rescheduled on alternative nodes
- Immutable Infrastructure → rolling cluster update. Through mechanics of rolling update any
compliance, security hardening or configuration management issue is addressed. Lift & shift container
- Golden Image → backed into the cluster definition
- Codified infrastructure → clusters, addons, resources, applications - all declaratively defined
- Build-in resiliency and telemetry - out of the box open source addons that require none to low effort
on product team side
Container management PaaS is essentially an integrated family of cloud-native capabilities that lets you
increase speed and reliability, improve security and focus on delivery
capabilities overview
journey
- Oct 2016 – realized that there is a need for container scheduler. Chosen Rancher for the cloud and scheduler
agnostic approach
- June 2017 – realized that Rancher does not deliver in accordance with expectations (health + readiness
checks, granular control over workloads and rolling service updates)
- May – Sep 2017 – OpenShift evaluation
- Nov 2017 – taken decision about vanilla K8S, started POC
- Feb 2018 – started K8S productionalization
- Sep 2018 – finalizing productionalization
today
- Unified cluster operations, 24/7 monitoring with PD, office hours support in place
- 4 clusters in place (2 prod)
- Overall capacity of 50+ nodes
- 10+ products / services hosted
- Unified stack of addons for performance monitoring, DNS management, ingress controller, centralized logging
- 3 engineers + 1 architect in the team - 24/7 support included - to prove validity of fundamental principles
(and economy of scale!)
- Product teams are excited!
mistakes made
- not implementing cluster-level DR strategy early enough (etcd backup) – we killed cluster twice, both times
due to unexpected behavior of tooling (KOPS – split brain, HELM – resource termination during deletion of
failed deployment) and overconfidence
- toolset overconfidence: took traefik as ingress controller and ended up with 4 ingress controllers for a single
environment, HTTP + TCP, internal + external
- too broad scope: monitoring and security addons, rich networking capabilities, clusters maintenance, teams
support
- no time for OSS contribution – not sustainable approach
- rolling update is still hard – regular failures and need for manual interventions, maintenance windows
agreements, etc.
- Stitch-free Cloud Native and Cloud (AWS) Integration is still a challenge:
- environment segregation (VPC or account based)
- provisioning of related services (RDS, RedShift, Lambdas) as part of unified deployment stack
positioning against competitive offerings
what would you like to see next time?
- Automatic cluster provisioning and rolling update mechanics DEMO (Terraform, KOPS, Jenkins) ?
- Reach application deployment capabilities DEMO (HELM, Kubectl, Jenkins) ?
- Routing & Networking techniques DEMO (Ingress Controllers, DNS Management) ?
- Telemetry capabilities DEMO (DataDog, Weave.Scope, Prometheus) ?

More Related Content

What's hot

RedisConf18 - Redis Cluster Provisioning with Kubernetes Service-Catalog Exte...
RedisConf18 - Redis Cluster Provisioning with Kubernetes Service-Catalog Exte...RedisConf18 - Redis Cluster Provisioning with Kubernetes Service-Catalog Exte...
RedisConf18 - Redis Cluster Provisioning with Kubernetes Service-Catalog Exte...
Redis Labs
 
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
Amazon Web Services Korea
 

What's hot (20)

Nutanix basic
Nutanix basicNutanix basic
Nutanix basic
 
Open Source Summit NorthAmerica 2017 Stateful Apps in Containers
Open Source Summit NorthAmerica 2017 Stateful Apps in ContainersOpen Source Summit NorthAmerica 2017 Stateful Apps in Containers
Open Source Summit NorthAmerica 2017 Stateful Apps in Containers
 
Experiencing the hyperconverged
Experiencing the hyperconvergedExperiencing the hyperconverged
Experiencing the hyperconverged
 
7 - Monitoring Kubernetes with Elastic
7 - Monitoring Kubernetes with Elastic7 - Monitoring Kubernetes with Elastic
7 - Monitoring Kubernetes with Elastic
 
Web scale IT - Nutanix
Web scale IT - NutanixWeb scale IT - Nutanix
Web scale IT - Nutanix
 
Elastic on a Hyper-Converged Infrastructure for Operational Log Analytics
Elastic on a Hyper-Converged Infrastructure for Operational Log AnalyticsElastic on a Hyper-Converged Infrastructure for Operational Log Analytics
Elastic on a Hyper-Converged Infrastructure for Operational Log Analytics
 
RedisConf18 - Redis Cluster Provisioning with Kubernetes Service-Catalog Exte...
RedisConf18 - Redis Cluster Provisioning with Kubernetes Service-Catalog Exte...RedisConf18 - Redis Cluster Provisioning with Kubernetes Service-Catalog Exte...
RedisConf18 - Redis Cluster Provisioning with Kubernetes Service-Catalog Exte...
 
Botvinnik webinar
Botvinnik webinarBotvinnik webinar
Botvinnik webinar
 
Nutanix Technology Bootcamp
Nutanix Technology BootcampNutanix Technology Bootcamp
Nutanix Technology Bootcamp
 
Containerisation and DEIS
Containerisation and DEISContainerisation and DEIS
Containerisation and DEIS
 
Red hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategyRed hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategy
 
An Easy Path to Kubernetes on Nutanix
An Easy Path to Kubernetes on Nutanix An Easy Path to Kubernetes on Nutanix
An Easy Path to Kubernetes on Nutanix
 
Overview of v cloud case studies
Overview of v cloud case studiesOverview of v cloud case studies
Overview of v cloud case studies
 
Coursera's Adoption of Cassandra
Coursera's Adoption of CassandraCoursera's Adoption of Cassandra
Coursera's Adoption of Cassandra
 
The Evolution of VMTurbo, now Turbonomic, Product Releases
The Evolution of VMTurbo, now Turbonomic, Product ReleasesThe Evolution of VMTurbo, now Turbonomic, Product Releases
The Evolution of VMTurbo, now Turbonomic, Product Releases
 
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
 
Cassandra summit 2015 - Simplifying Streaming Analytics
Cassandra summit 2015 - Simplifying Streaming AnalyticsCassandra summit 2015 - Simplifying Streaming Analytics
Cassandra summit 2015 - Simplifying Streaming Analytics
 
CEPH & OPENSTACK - Red Hat's Winning Combination for Enterprise Clouds
CEPH & OPENSTACK - Red Hat's Winning Combination for Enterprise CloudsCEPH & OPENSTACK - Red Hat's Winning Combination for Enterprise Clouds
CEPH & OPENSTACK - Red Hat's Winning Combination for Enterprise Clouds
 
Nutanix and microsoft_webinar_oct_28
Nutanix and microsoft_webinar_oct_28Nutanix and microsoft_webinar_oct_28
Nutanix and microsoft_webinar_oct_28
 
How Lenovo and Nutanix are delivering the invisible infrastructure
How Lenovo and Nutanix are delivering the invisible infrastructureHow Lenovo and Nutanix are delivering the invisible infrastructure
How Lenovo and Nutanix are delivering the invisible infrastructure
 

Similar to Kubernetes - Container Management PaaS Journey

Resume_Achhar_Kalia
Resume_Achhar_KaliaResume_Achhar_Kalia
Resume_Achhar_Kalia
Achhar Kalia
 
Client Solutions Executive - Niche Skilled on AWS Cloud, Digital Apps & Infra
Client Solutions Executive - Niche Skilled on AWS Cloud, Digital Apps & InfraClient Solutions Executive - Niche Skilled on AWS Cloud, Digital Apps & Infra
Client Solutions Executive - Niche Skilled on AWS Cloud, Digital Apps & Infra
Rawud Manasseh
 
Red hat ceph storage customer presentation
Red hat ceph storage customer presentationRed hat ceph storage customer presentation
Red hat ceph storage customer presentation
Rodrigo Missiaggia
 
Prioritizing Complexities of Enterprise Kubernetes.pdf
Prioritizing Complexities of Enterprise Kubernetes.pdfPrioritizing Complexities of Enterprise Kubernetes.pdf
Prioritizing Complexities of Enterprise Kubernetes.pdf
Kedarnath76
 

Similar to Kubernetes - Container Management PaaS Journey (20)

Winning Governance Strategies for the Technology Disruptions of our Time
Winning Governance Strategies for the Technology Disruptions of our TimeWinning Governance Strategies for the Technology Disruptions of our Time
Winning Governance Strategies for the Technology Disruptions of our Time
 
PaaS Lessons: Cisco IT Deploys OpenShift to Meet Developer Demand
PaaS Lessons: Cisco IT Deploys OpenShift to Meet Developer DemandPaaS Lessons: Cisco IT Deploys OpenShift to Meet Developer Demand
PaaS Lessons: Cisco IT Deploys OpenShift to Meet Developer Demand
 
Stephen miller resume
Stephen miller resumeStephen miller resume
Stephen miller resume
 
Cloud-native Application Lifecycle Management
Cloud-native Application Lifecycle ManagementCloud-native Application Lifecycle Management
Cloud-native Application Lifecycle Management
 
What HPC can learn from DevOps?
What HPC can learn from DevOps?What HPC can learn from DevOps?
What HPC can learn from DevOps?
 
Stephen miller resume
Stephen miller resumeStephen miller resume
Stephen miller resume
 
Stephen miller resume
Stephen miller resumeStephen miller resume
Stephen miller resume
 
Stephen miller resume
Stephen miller resumeStephen miller resume
Stephen miller resume
 
Stephen miller resume
Stephen miller resumeStephen miller resume
Stephen miller resume
 
Enabling Fast IT using Containers, Microservices and DAVROS models: an overview
Enabling Fast IT using Containers, Microservices and DAVROS models: an overviewEnabling Fast IT using Containers, Microservices and DAVROS models: an overview
Enabling Fast IT using Containers, Microservices and DAVROS models: an overview
 
AWS TechConnect 2018 - Container Adoption
AWS TechConnect 2018 - Container AdoptionAWS TechConnect 2018 - Container Adoption
AWS TechConnect 2018 - Container Adoption
 
Resume_Achhar_Kalia
Resume_Achhar_KaliaResume_Achhar_Kalia
Resume_Achhar_Kalia
 
Enabling Business Agility with SUSE CaaS Platform
Enabling Business Agility with SUSE CaaS PlatformEnabling Business Agility with SUSE CaaS Platform
Enabling Business Agility with SUSE CaaS Platform
 
Client Solutions Executive - Niche Skilled on AWS Cloud, Digital Apps & Infra
Client Solutions Executive - Niche Skilled on AWS Cloud, Digital Apps & InfraClient Solutions Executive - Niche Skilled on AWS Cloud, Digital Apps & Infra
Client Solutions Executive - Niche Skilled on AWS Cloud, Digital Apps & Infra
 
Oracle - Continuous Delivery NYC meetup, June 07, 2018
Oracle - Continuous Delivery NYC meetup, June 07, 2018Oracle - Continuous Delivery NYC meetup, June 07, 2018
Oracle - Continuous Delivery NYC meetup, June 07, 2018
 
(ENT306) Application Portfolio Migration | AWS re:Invent 2014
(ENT306) Application Portfolio Migration | AWS re:Invent 2014(ENT306) Application Portfolio Migration | AWS re:Invent 2014
(ENT306) Application Portfolio Migration | AWS re:Invent 2014
 
Red hat ceph storage customer presentation
Red hat ceph storage customer presentationRed hat ceph storage customer presentation
Red hat ceph storage customer presentation
 
Prioritizing Complexities of Enterprise Kubernetes.pdf
Prioritizing Complexities of Enterprise Kubernetes.pdfPrioritizing Complexities of Enterprise Kubernetes.pdf
Prioritizing Complexities of Enterprise Kubernetes.pdf
 
Elastic-Engineering
Elastic-EngineeringElastic-Engineering
Elastic-Engineering
 
Nicholas Noor - Cloud-Infra Architect
Nicholas Noor - Cloud-Infra ArchitectNicholas Noor - Cloud-Infra Architect
Nicholas Noor - Cloud-Infra Architect
 

Recently uploaded

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Kubernetes - Container Management PaaS Journey

  • 1. Container Management Platform [K8S] PaaS Journey By Uladzimir Palkhouski https://www.linkedin.com/in/uladzimirpalkhouski/
  • 2. there is always a reason behind... Challenges of countless projects: - application resiliency issues - low resource utilization and cost in-efficiency - operational inefficiency in using non-unified technology stack for managing different layers and deploying applications - low deployment velocity and elasticity - security and compliance issues (host & app level access and audit) - operational overhead in managing infrastructure
  • 3. Fundamental Principles ● Cattle, no Pet ● Immutable Infrastructure ● Codified Infrastructure ● Golden Image ● OOB Resiliency ● OOB Telemetry is there anything we can do in a dramatically different way?
  • 4. … for a single ultimate goal let application developers focus on application development and business capabilities ... while somebody else (or something else) takes care about infrastructure maintenance, networking, capacity planning, resiliency, telemetry, security and compliance, etc…
  • 5. responding to the demand – K8S based container management PaaS - best-of-breed container scheduler – K8S - KOPS and AWS based OSS K8S distribution - OSS-based addons ecosystem (fluentd, weave scope, heapster, etc.) - rolling cluster update to answer most of operational challenges - unified addons, resources, applications and services deployment (helm) - 100% OSS, no proprietary closed products - full CNCF K8S conformance (read as no lock, can migrate to other distributions) Source: https://redislabs.com/redis-enterprise- documentation/administering/kubernetes/upgrading-redis-enterprise- cluster-kubernetes-deployment-operator/
  • 6. fundamental principles > platform capabilities - Cattle Host -> no pet hosts, any node can be killed any time if misbehaving. Workloads will by rescheduled on alternative nodes - Immutable Infrastructure → rolling cluster update. Through mechanics of rolling update any compliance, security hardening or configuration management issue is addressed. Lift & shift container - Golden Image → backed into the cluster definition - Codified infrastructure → clusters, addons, resources, applications - all declaratively defined - Build-in resiliency and telemetry - out of the box open source addons that require none to low effort on product team side Container management PaaS is essentially an integrated family of cloud-native capabilities that lets you increase speed and reliability, improve security and focus on delivery
  • 8. journey - Oct 2016 – realized that there is a need for container scheduler. Chosen Rancher for the cloud and scheduler agnostic approach - June 2017 – realized that Rancher does not deliver in accordance with expectations (health + readiness checks, granular control over workloads and rolling service updates) - May – Sep 2017 – OpenShift evaluation - Nov 2017 – taken decision about vanilla K8S, started POC - Feb 2018 – started K8S productionalization - Sep 2018 – finalizing productionalization
  • 9. today - Unified cluster operations, 24/7 monitoring with PD, office hours support in place - 4 clusters in place (2 prod) - Overall capacity of 50+ nodes - 10+ products / services hosted - Unified stack of addons for performance monitoring, DNS management, ingress controller, centralized logging - 3 engineers + 1 architect in the team - 24/7 support included - to prove validity of fundamental principles (and economy of scale!) - Product teams are excited!
  • 10. mistakes made - not implementing cluster-level DR strategy early enough (etcd backup) – we killed cluster twice, both times due to unexpected behavior of tooling (KOPS – split brain, HELM – resource termination during deletion of failed deployment) and overconfidence - toolset overconfidence: took traefik as ingress controller and ended up with 4 ingress controllers for a single environment, HTTP + TCP, internal + external - too broad scope: monitoring and security addons, rich networking capabilities, clusters maintenance, teams support - no time for OSS contribution – not sustainable approach - rolling update is still hard – regular failures and need for manual interventions, maintenance windows agreements, etc. - Stitch-free Cloud Native and Cloud (AWS) Integration is still a challenge: - environment segregation (VPC or account based) - provisioning of related services (RDS, RedShift, Lambdas) as part of unified deployment stack
  • 12. what would you like to see next time? - Automatic cluster provisioning and rolling update mechanics DEMO (Terraform, KOPS, Jenkins) ? - Reach application deployment capabilities DEMO (HELM, Kubectl, Jenkins) ? - Routing & Networking techniques DEMO (Ingress Controllers, DNS Management) ? - Telemetry capabilities DEMO (DataDog, Weave.Scope, Prometheus) ?