Link: https://youtu.be/IoEJvsl1ZqM
https://go.dok.community/slack
https://dok.community/
From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE)
“Zero downtime” and “always-on” are illusions. All systems fail sooner or later, whether it’s a regional e-commerce website or a major cloud region hosting thousands of applications. That’s why, instead of chasing these illusions, it’s worth focusing on the nines of availability.
Based on true stories, this session walks you through modern data availability and reliability patterns used by architects whose applications withstood major cloud outages. With the focus on the data storage layer and Kubernetes, you’ll learn:
* How to architect the data layer in Kubernetes with the server, zone, and region-level resiliency in mind.
* How to find a compromise between latency and availability for multi-region deployments.
* How to ensure the data layer remains reliable (i.e., always returns expected data) even during a major incident.
-----
Denis Magda has spent half of his career working on distributed systems, applications, and databases. His experience spans from the development of distributed database engines and high-performance applications to training and education on the topic of distributed and cloud computing. Presently, Denis runs the Developer Relations team at Yugabyte and serves a PMC Member for Apache Ignite. He started his professional career at Sun Microsystems and Oracle, where he led one of the Java development groups and worked on technology evangelism efforts.
3. DoK Day Europe 2022 @ KubeCon
Going cloud native?
Fasten your seat belts!
○ Zone-level outages are commonplace
○ Region-level outages become the new
normal
○ Starting 2011, AWS had a region-wide
incident at least once a year:
○ It can take 4+ hours to recover
https://aws.amazon.com/premiumsupport/tec
hnology/pes/
4. DoK Day Europe 2022 @ KubeCon
Stuff that is NOT
under your control
○ Or what causes major cloud outages
○ Hardware & networking failures
○ Bugs or outages in cloud services:
○ Amazon Kinesis outage that cascaded to
dependent services
○ Human errors:
○ S3 outage caused by a wrong command
executed by an AWS team member
5. DoK Day Europe 2022 @ KubeCon
Stuff that is under
YOUR control
1. Accept the fact that a cloud storm can happen to you
2. Design multi-AZ (at least) and multi-region (better) solutions
6. DoK Day Europe 2022 @ KubeCon
What about speed and
latency?
○ Zones are interconnected through a high-performance network:
○ GCP: round-trip network latency is under 5 milliseconds on the 95th
percentile
○ Regions are interconnected through a regional network (~latencies for
GCP):
○ US West (Oregon)->US East (North Virginia): 60ms
○ US West (Oregon)->London: 130ms
○ US West (Oregon)->Mumbai: 220ms
○ US West (Oregon)->Tokyo: 90ms
7. DoK Day Europe 2022 @ KubeCon
Data Management Patterns for
Reliability and Availability
8. DoK Day Europe 2022 @ KubeCon
Geo-distributed apps
require HA data layer
Applications that span multiple geographic locations for
high-availability, compliance and performance needs
9. DoK Day Europe 2022 @ KubeCon
Multi-Region Database
Deployment
With Read Replicas
○ Primary database instance is in one
geography:
○ Spans multiple zones or local regions
○ Handles both reads and writes
○ Read replicas are in other distant locations:
○ Takes over in case of a primary instance failure
○ Serve reads faster for local users
10. DoK Day Europe 2022 @ KubeCon
Multiple Standalone
Databases
With Async Replication
○ Each distant geography has its own
distributed database instance
○ Async replication between standalone
databases:
○ Bidirectional and unidirectional
○ But synchronously within a single database
instance
11. DoK Day Europe 2022 @ KubeCon
Geo-Partitioned
Multi-Region
Database Deployment
○ Single database instance that spans various
geographies
○ Data is pinned to a group of nodes from a
specific location (NA, EMEA, APAC)
○ Low latency for queries within a single
geography
○ High latency for cross-region queries
12. DoK Day Europe 2022 @ KubeCon
Learn More?!
○ Architecting for Scale by O’Reilly
○ Designing Data-Intensive Applications by O’Reilly
○ Deep dive into a multi-region DBs deployment
options
13. DoK Day Europe 2022 @ KubeCon
Thank You!
Join our community
yugabyte.com/slack
Try for yourself
cloud.yugabyte.com
@denismagda