SlideShare a Scribd company logo
1 of 15
Download to read offline
Jeff Carpenter (DataStax)
DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
Cloud Native
DoK Day North America 2022 @ KubeCon
Database
Jeff Carpenter, DataStax
“The Kubernetes Native Database”
Cloud Native
DoK Day North America 2022 @ KubeCon
Database
Kubernetes Native Database
Jeff Carpenter, DataStax
“The Kubernetes Native Database”
Cloud Native
DoK Day North America 2022 @ KubeCon
Database
Kubernetes Native Database
Serverless Database
Jeff Carpenter, DataStax
“The Kubernetes Native Database”
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
Managing Cloud Native
Data on Kubernetes
● Coming Dec 2022
● This talk based on Chapter 7
“The Kubernetes Native Database”
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
1. Leverage compute / network / storage as commodity APIs
2. Separate the control and data planes
3. Make observability easy
4. Make the default configuration secure
5. Prefer declarative configuration
Cloud Native Data Principles
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
● MySQL Compatible
● Hybrid Transactional and
Analytical Processing (HTAP)
● Separation of compute and
storage
○ TiDB - compute
○ TiKV - SQL storage
○ TiFlash - columnar storage
● Spark Integration
● K8s only
TiDB
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
● Operator controls all
components
● Optional extension to K8s
Scheduler
● Manages CRDs including
TiDBCluster, TiDBMonitor
TiDB Operator
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
● TiDB resource
○ Allows specification of TiDB /
TiKV / TiFlash instances and
supporting infrastructure
○ Monitor with Prometheus /
Grafana stack (not shown)
● Not fully cloud-native
○ Could use object storage
instead of PVs
○ Could use etcd instead of
Discovery Service
TiDB Cluster
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
● Apache Cassandra factored
into microservices and
available as a managed service
● Uses object storage instead of
PVs for a true serverless
architecture
● Leverages etcd and
Prometheus/Grafana stack
● API access via Stargate
○ REST, GraphQL, Docs, gRPC
● Multi-tenant, multi-cluster
AstraDB
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
● Astra DB operator deploys multi-tenant
clusters using DBInstallation resource
● Ingress routes incoming traffic by tenant to
specific Coordinator / Data Service instances
○ Metadata stored in etcd (not shown)
● Authentication delegated to IAM service
● Data Services use local PVs for caching,
object storage for longer term persistence
● Compaction Service processes data files in
object storage in the background
Astra DBInstallation
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
● Maximum leverage of Kubernetes APIs
○ StatefulSets, Deployments, Etcd, Ingress, Scheduler
● Automated, declarative management
○ Via operators and CRDs
● Observable through standard APIs
○ I.e. Prometheus
● Secure by default
○ I.e. no default passwords
What makes a Database Kubernetes Native ?
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
● Microservices / serverless
● Multi-cluster / Multi-cloud
● Multi-tenant
● Community based
● Open source
The future of Kubernetes
Native Databases
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
● Improved StatefulSets
● Resources to manage multi-tenancy and multi-cluster
● Additional hypervisor support
● Compute resource management (e.g. quotas)
● Better disk initialization (e.g. striping)
What Databases Need from Kubernetes
Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon
“The Kubernetes Native Database”
Special thanks to:
● Ed Huang, PingCAP
● Jake Luciani, DataStax
Thank you!

More Related Content

Similar to The Kubernetes Native Database

What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?
DataWorks Summit
 
Running secured Spark job in Kubernetes compute cluster and integrating with ...
Running secured Spark job in Kubernetes compute cluster and integrating with ...Running secured Spark job in Kubernetes compute cluster and integrating with ...
Running secured Spark job in Kubernetes compute cluster and integrating with ...
DataWorks Summit
 
The many uses of Kubernetes cross cluster migration of persistent data
The many uses of Kubernetes cross cluster migration of persistent dataThe many uses of Kubernetes cross cluster migration of persistent data
The many uses of Kubernetes cross cluster migration of persistent data
DoKC
 
Building a Global-Scale Multi-Tenant Cloud Platform on AWS and Docker: Lesson...
Building a Global-Scale Multi-Tenant Cloud Platform on AWS and Docker: Lesson...Building a Global-Scale Multi-Tenant Cloud Platform on AWS and Docker: Lesson...
Building a Global-Scale Multi-Tenant Cloud Platform on AWS and Docker: Lesson...
Felix Gessert
 

Similar to The Kubernetes Native Database (20)

Cloud Migration Paths: Kubernetes, IaaS, or DBaaS
Cloud Migration Paths: Kubernetes, IaaS, or DBaaSCloud Migration Paths: Kubernetes, IaaS, or DBaaS
Cloud Migration Paths: Kubernetes, IaaS, or DBaaS
 
Architectural patterns for high performance microservices in kubernetes
Architectural patterns for high performance microservices in kubernetesArchitectural patterns for high performance microservices in kubernetes
Architectural patterns for high performance microservices in kubernetes
 
Kubera Launch Webinar: Kubernetes native management of Kubernetes native data
Kubera Launch Webinar: Kubernetes native management of Kubernetes native dataKubera Launch Webinar: Kubernetes native management of Kubernetes native data
Kubera Launch Webinar: Kubernetes native management of Kubernetes native data
 
What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?
 
OpenEBS Technical Workshop - KubeCon San Diego 2019
OpenEBS Technical Workshop - KubeCon San Diego 2019OpenEBS Technical Workshop - KubeCon San Diego 2019
OpenEBS Technical Workshop - KubeCon San Diego 2019
 
Room 2 - 6 - Đinh Tuấn Phong - Migrate opensource database to Kubernetes easi...
Room 2 - 6 - Đinh Tuấn Phong - Migrate opensource database to Kubernetes easi...Room 2 - 6 - Đinh Tuấn Phong - Migrate opensource database to Kubernetes easi...
Room 2 - 6 - Đinh Tuấn Phong - Migrate opensource database to Kubernetes easi...
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Webinar: Data Protection for Kubernetes
Webinar: Data Protection for KubernetesWebinar: Data Protection for Kubernetes
Webinar: Data Protection for Kubernetes
 
Running secured Spark job in Kubernetes compute cluster and integrating with ...
Running secured Spark job in Kubernetes compute cluster and integrating with ...Running secured Spark job in Kubernetes compute cluster and integrating with ...
Running secured Spark job in Kubernetes compute cluster and integrating with ...
 
Google container engine (GKE)
Google container engine (GKE)Google container engine (GKE)
Google container engine (GKE)
 
Apache Cassandra Lunch #93: K8ssandra on Digital Ocean
Apache Cassandra Lunch #93: K8ssandra on Digital OceanApache Cassandra Lunch #93: K8ssandra on Digital Ocean
Apache Cassandra Lunch #93: K8ssandra on Digital Ocean
 
stackconf 2023 | Database Infrastructure with Open Source Kubernetes Database...
stackconf 2023 | Database Infrastructure with Open Source Kubernetes Database...stackconf 2023 | Database Infrastructure with Open Source Kubernetes Database...
stackconf 2023 | Database Infrastructure with Open Source Kubernetes Database...
 
The many uses of Kubernetes cross cluster migration of persistent data
The many uses of Kubernetes cross cluster migration of persistent dataThe many uses of Kubernetes cross cluster migration of persistent data
The many uses of Kubernetes cross cluster migration of persistent data
 
The many uses of Kubernetes cross cluster migration of persistent data
The many uses of Kubernetes cross cluster migration of persistent dataThe many uses of Kubernetes cross cluster migration of persistent data
The many uses of Kubernetes cross cluster migration of persistent data
 
Building a Global-Scale Multi-Tenant Cloud Platform on AWS and Docker: Lesson...
Building a Global-Scale Multi-Tenant Cloud Platform on AWS and Docker: Lesson...Building a Global-Scale Multi-Tenant Cloud Platform on AWS and Docker: Lesson...
Building a Global-Scale Multi-Tenant Cloud Platform on AWS and Docker: Lesson...
 
Greenplum Kontained: Coordinating Many PostgreSQL Instances on Kubernetes: Cl...
Greenplum Kontained: Coordinating Many PostgreSQL Instances on Kubernetes: Cl...Greenplum Kontained: Coordinating Many PostgreSQL Instances on Kubernetes: Cl...
Greenplum Kontained: Coordinating Many PostgreSQL Instances on Kubernetes: Cl...
 
[WSO2Con EU 2018] Deploying Applications in K8S and Docker
[WSO2Con EU 2018] Deploying Applications in K8S and Docker[WSO2Con EU 2018] Deploying Applications in K8S and Docker
[WSO2Con EU 2018] Deploying Applications in K8S and Docker
 
Data Lakes with Azure Databricks
Data Lakes with Azure DatabricksData Lakes with Azure Databricks
Data Lakes with Azure Databricks
 
[WSO2Con Asia 2018] Deploying Applications in K8S and Docker
[WSO2Con Asia 2018] Deploying Applications in K8S and Docker[WSO2Con Asia 2018] Deploying Applications in K8S and Docker
[WSO2Con Asia 2018] Deploying Applications in K8S and Docker
 
Neptue Graph Database - 0 to Production
Neptue Graph Database - 0 to ProductionNeptue Graph Database - 0 to Production
Neptue Graph Database - 0 to Production
 

More from DoKC

Overcoming challenges with protecting and migrating data in multi-cloud K8s e...
Overcoming challenges with protecting and migrating data in multi-cloud K8s e...Overcoming challenges with protecting and migrating data in multi-cloud K8s e...
Overcoming challenges with protecting and migrating data in multi-cloud K8s e...
DoKC
 
We will Dok You! - The journey to adopt stateful workloads on k8s
We will Dok You! - The journey to adopt stateful workloads on k8sWe will Dok You! - The journey to adopt stateful workloads on k8s
We will Dok You! - The journey to adopt stateful workloads on k8s
DoKC
 
Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Develo...
Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Develo...Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Develo...
Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Develo...
DoKC
 

More from DoKC (20)

Distributed Vector Databases - What, Why, and How
Distributed Vector Databases - What, Why, and HowDistributed Vector Databases - What, Why, and How
Distributed Vector Databases - What, Why, and How
 
Is It Safe? Security Hardening for Databases Using Kubernetes Operators
Is It Safe? Security Hardening for Databases Using Kubernetes OperatorsIs It Safe? Security Hardening for Databases Using Kubernetes Operators
Is It Safe? Security Hardening for Databases Using Kubernetes Operators
 
Stop Worrying and Keep Querying, Using Automated Multi-Region Disaster Recovery
Stop Worrying and Keep Querying, Using Automated Multi-Region Disaster RecoveryStop Worrying and Keep Querying, Using Automated Multi-Region Disaster Recovery
Stop Worrying and Keep Querying, Using Automated Multi-Region Disaster Recovery
 
Transforming Data Processing with Kubernetes: Journey Towards a Self-Serve Da...
Transforming Data Processing with Kubernetes: Journey Towards a Self-Serve Da...Transforming Data Processing with Kubernetes: Journey Towards a Self-Serve Da...
Transforming Data Processing with Kubernetes: Journey Towards a Self-Serve Da...
 
The State of Stateful on Kubernetes
The State of Stateful on KubernetesThe State of Stateful on Kubernetes
The State of Stateful on Kubernetes
 
Colocating Data Workloads and Web Services on Kubernetes to Improve Resource ...
Colocating Data Workloads and Web Services on Kubernetes to Improve Resource ...Colocating Data Workloads and Web Services on Kubernetes to Improve Resource ...
Colocating Data Workloads and Web Services on Kubernetes to Improve Resource ...
 
Make Your Kafka Cluster Production-Ready
Make Your Kafka Cluster Production-ReadyMake Your Kafka Cluster Production-Ready
Make Your Kafka Cluster Production-Ready
 
Dynamic Large Scale Spark on Kubernetes: Empowering the Community with Argo W...
Dynamic Large Scale Spark on Kubernetes: Empowering the Community with Argo W...Dynamic Large Scale Spark on Kubernetes: Empowering the Community with Argo W...
Dynamic Large Scale Spark on Kubernetes: Empowering the Community with Argo W...
 
Run PostgreSQL in Warp Speed Using NVMe/TCP in the Cloud
Run PostgreSQL in Warp Speed Using NVMe/TCP in the CloudRun PostgreSQL in Warp Speed Using NVMe/TCP in the Cloud
Run PostgreSQL in Warp Speed Using NVMe/TCP in the Cloud
 
ING Data Services hosted on ICHP DoK Amsterdam 2023
ING Data Services hosted on ICHP DoK Amsterdam 2023ING Data Services hosted on ICHP DoK Amsterdam 2023
ING Data Services hosted on ICHP DoK Amsterdam 2023
 
Implementing data and databases on K8s within the Dutch government
Implementing data and databases on K8s within the Dutch governmentImplementing data and databases on K8s within the Dutch government
Implementing data and databases on K8s within the Dutch government
 
StatefulSets in K8s - DoK Talks #154
StatefulSets in K8s - DoK Talks #154StatefulSets in K8s - DoK Talks #154
StatefulSets in K8s - DoK Talks #154
 
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...
 
Analytics with Apache Superset and ClickHouse - DoK Talks #151
Analytics with Apache Superset and ClickHouse - DoK Talks #151Analytics with Apache Superset and ClickHouse - DoK Talks #151
Analytics with Apache Superset and ClickHouse - DoK Talks #151
 
Overcoming challenges with protecting and migrating data in multi-cloud K8s e...
Overcoming challenges with protecting and migrating data in multi-cloud K8s e...Overcoming challenges with protecting and migrating data in multi-cloud K8s e...
Overcoming challenges with protecting and migrating data in multi-cloud K8s e...
 
Evaluating Cloud Native Storage Vendors - DoK Talks #147
Evaluating Cloud Native Storage Vendors - DoK Talks #147Evaluating Cloud Native Storage Vendors - DoK Talks #147
Evaluating Cloud Native Storage Vendors - DoK Talks #147
 
Kubernetes Cluster Upgrade Strategies and Data: Best Practices for your State...
Kubernetes Cluster Upgrade Strategies and Data: Best Practices for your State...Kubernetes Cluster Upgrade Strategies and Data: Best Practices for your State...
Kubernetes Cluster Upgrade Strategies and Data: Best Practices for your State...
 
We will Dok You! - The journey to adopt stateful workloads on k8s
We will Dok You! - The journey to adopt stateful workloads on k8sWe will Dok You! - The journey to adopt stateful workloads on k8s
We will Dok You! - The journey to adopt stateful workloads on k8s
 
Mastering MongoDB on Kubernetes, the power of operators
Mastering MongoDB on Kubernetes, the power of operators Mastering MongoDB on Kubernetes, the power of operators
Mastering MongoDB on Kubernetes, the power of operators
 
Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Develo...
Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Develo...Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Develo...
Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Develo...
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

The Kubernetes Native Database

  • 1. Jeff Carpenter (DataStax) DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database”
  • 2. Cloud Native DoK Day North America 2022 @ KubeCon Database Jeff Carpenter, DataStax “The Kubernetes Native Database”
  • 3. Cloud Native DoK Day North America 2022 @ KubeCon Database Kubernetes Native Database Jeff Carpenter, DataStax “The Kubernetes Native Database”
  • 4. Cloud Native DoK Day North America 2022 @ KubeCon Database Kubernetes Native Database Serverless Database Jeff Carpenter, DataStax “The Kubernetes Native Database”
  • 5. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” Managing Cloud Native Data on Kubernetes ● Coming Dec 2022 ● This talk based on Chapter 7 “The Kubernetes Native Database”
  • 6. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” 1. Leverage compute / network / storage as commodity APIs 2. Separate the control and data planes 3. Make observability easy 4. Make the default configuration secure 5. Prefer declarative configuration Cloud Native Data Principles
  • 7. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” ● MySQL Compatible ● Hybrid Transactional and Analytical Processing (HTAP) ● Separation of compute and storage ○ TiDB - compute ○ TiKV - SQL storage ○ TiFlash - columnar storage ● Spark Integration ● K8s only TiDB
  • 8. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” ● Operator controls all components ● Optional extension to K8s Scheduler ● Manages CRDs including TiDBCluster, TiDBMonitor TiDB Operator
  • 9. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” ● TiDB resource ○ Allows specification of TiDB / TiKV / TiFlash instances and supporting infrastructure ○ Monitor with Prometheus / Grafana stack (not shown) ● Not fully cloud-native ○ Could use object storage instead of PVs ○ Could use etcd instead of Discovery Service TiDB Cluster
  • 10. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” ● Apache Cassandra factored into microservices and available as a managed service ● Uses object storage instead of PVs for a true serverless architecture ● Leverages etcd and Prometheus/Grafana stack ● API access via Stargate ○ REST, GraphQL, Docs, gRPC ● Multi-tenant, multi-cluster AstraDB
  • 11. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” ● Astra DB operator deploys multi-tenant clusters using DBInstallation resource ● Ingress routes incoming traffic by tenant to specific Coordinator / Data Service instances ○ Metadata stored in etcd (not shown) ● Authentication delegated to IAM service ● Data Services use local PVs for caching, object storage for longer term persistence ● Compaction Service processes data files in object storage in the background Astra DBInstallation
  • 12. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” ● Maximum leverage of Kubernetes APIs ○ StatefulSets, Deployments, Etcd, Ingress, Scheduler ● Automated, declarative management ○ Via operators and CRDs ● Observable through standard APIs ○ I.e. Prometheus ● Secure by default ○ I.e. no default passwords What makes a Database Kubernetes Native ?
  • 13. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” ● Microservices / serverless ● Multi-cluster / Multi-cloud ● Multi-tenant ● Community based ● Open source The future of Kubernetes Native Databases
  • 14. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” ● Improved StatefulSets ● Resources to manage multi-tenancy and multi-cluster ● Additional hypervisor support ● Compute resource management (e.g. quotas) ● Better disk initialization (e.g. striping) What Databases Need from Kubernetes
  • 15. Jeff Carpenter, DataStax DoK Day North America 2022 @ KubeCon “The Kubernetes Native Database” Special thanks to: ● Ed Huang, PingCAP ● Jake Luciani, DataStax Thank you!