How to provide enterprise high availability with MariaDB Platform

How to provide enterprise
high availability with
MariaDB Platform
SHANE K JOHNSON
Senior Director of Product Marketing
MariaDB Corporation

Agenda
● Fundamentals
● Replication
● Clustering
● Automatic failover
Basic concepts
● Read scaling
● Topologies
● Optimizations
Advanced strategies

MariaDB Server
InnoDB / MyRocks
Applications
C JDBC ODBC Node.js

Applications
Containers
MariaDB Platform
MariaDB MaxScale
CDC
MariaDB Server
InnoDB / MyRocks
MariaDB Server
ColumnStore
Transactional Analytical
Kubernetes (Helm) Docker (Compose)
C JDBC ODBC Node.js
Ingest streaming data
Kafka connector
Administration
SQL Diagnostic
Manager
SQLyog
MariaDB Backup
MariaDB Flashback
Import bulk data
Spark connector
C/Java/Python API

MariaDB technology
● Replication and clustering
● Sharding
● Compression
● Encryption
● Auditing
● Point-in-time rollback
● Versioned tables
● CTEs and window functions
● JSON and GIS functions
● Oracle compatibility
MariaDB Server
● Enterprise high availability
○ Automatic failover
○ Transaction replay
● Enterprise scalability
○ Read-write splitting
○ Replication server
● Enterprise security
○ Data masking
○ Query blocking
○ Result limiting
● Enterprise integration
○ Change-data-capture
○ Kafka connector
MariaDB MaxScale
● Columnar storage
● Distributed data
● Massively parallel processing
● Regression functions
● Spark connector
● Kafka connector
● Pentaho Kettle connector
MariaDB ColumnStore

Terminology
Automatic failover: a standby database was automatically promoted to primary
because the previous primary failed (or became otherwise unavailable)
Manual switchover: a standby database was manually promoted to primary by a
DBA to, for example, perform a rolling upgrade
Automatic rejoin: a previously failed primary database is recovered and
automatically reconfigured as a standby

Nines
99.9%
99.99%
99.999%
99.9999%
526.20 minutes (8.77 hours)
52.60 minutes
5.26 minutes
0.52 minutes (31.65 seconds)

Standby
Replication
(async/semi-sync)
Clustering
(synchronous)
Replication and clustering
Primary Primary
Primary
Primary
Standby Standby

Global transaction IDs
GTID = domain ID + server ID + sequence number
1. prevents conflicts between multiple masters
2. enable standbys to resume replication

Binary log (binlog)
Commit ID GTID Server ID Event type Position End position
100 0-1-200 1 Query 0 150
100 0-1-201 1 Query 151 500
100 0-1-202 1 Query 501 600
101 0-1-203 1 Query 601 800
101 0-1-204 1 Query 801 1000
Logical view of the binlog

Sequence
1. The standby IO thread requests binlog events, includes its current GTID
2. The primary returns binlog events for the next GTID(s)
3. The standby IO thread writes the binlog events to its relay log
4. The standby SQL thread reads the binlog events from its relay log
5. The standby SQL thread executes the binlog events and updates its current GTID

Primary
TX (GTID 1)
TX (GTID 2)
Standby
Relay log
1. request next transaction (GTID = 2)
TX (GTID 1)
TX (GTID 2)
TX (GTID 3)
Binary log
2. read next
transaction (GTID = 3)
3. reply with next transaction (GTID = 3)
TX (GTID 3)
4. write next transaction
(GTID = 3)
Sequence

Replication: async or
semi-sync

Standby
Replication
(asynchronous)
Replication
(semi-synchronous)
Clustering
(synchronous)
Replication: async vs. semi-sync
Primary Primary
Primary
Primary
Primary
Standby Standby Standby Standby Standby
async sync
async sync

Automatic failover: async replication
Standby 1
(GTID=2)
Standby 2
(GTID=1)
MariaDB MaxScale
(database proxy)
MariaDB MaxScale
(database proxy)
Primary
(GTID=3)
Standby 1
(GTID=2)
Standby 2
(GTID=1)
Primary
(GTID=3)

Automatic failover: async replication
MariaDB MaxScale
(database proxy)
MariaDB MaxScale
(database proxy)
MariaDB MaxScale
(database proxy)
Primary
(GTID=2)
Standby 2
(GTID=1)
Standby 1
(GTID=2)
Standby 2
(GTID=1)
Primary
(GTID=3)
Standby 1
(GTID=2)
Standby 2
(GTID=1)
Primary
(GTID=3)

Automatic failover: semi-sync replication
MariaDB MaxScale
(database proxy)
MariaDB MaxScale
(database proxy)
Standby 1
(GTID=2)
Standby 2
(GTID=3)
Primary
(GTID=3)
Standby 1
(GTID=2)
Standby 2
(GTID=3)
Primary
(GTID=3)

Automatic failover: semi-sync replication
MariaDB MaxScale
(database proxy)
MariaDB MaxScale
(database proxy)
MariaDB MaxScale
(database proxy)
Standby 1
(GTID=2)
Primary
(GTID=3)
Standby 1
(GTID=2)
Standby 2
(GTID=3)
Primary
(GTID=3)
Standby 1
(GTID=2)
Standby 2
(GTID=3)
Primary
(GTID=3)

Parameters: semi-sync replication
Variable Values Default
rpl_semi_sync_master_enabled 0 (OFF) | 1 (ON) 0 (OFF)
rpl_semi_sync_master_timeout 0 to n (ms) 10000 (10 seconds)

Use cases
Read-intensive: product catalogs
Mixed: shopping carts
Write-intensive: clickstream data
Asynchronous
Read-intensive: customer profiles
Mixed: inventory and pricing
Write-intensive: checkouts
Semi-synchronous

Concepts
Group communication ensures total ordering of messages sent from multiple nodes
Write sets contain all of the rows modified by a transaction, created during the commit phase
Global transaction ordering assigns writes sets a GTID (UUID + sequence number) so writes
are applied in the same order on every node
Certification ensures write sets are applied on all nodes or rejected on all nodes with
deterministic testing

Sequence
1. Synchronous
a. Originating node: create a write set
b. Originating node: assign a global transaction ID to the write set and replicate it
c. Originating node: apply the write set and commit the transaction
2. Asynchronous
a. Other nodes: certify the write set
b. Other nodes: apply the write set and commit the transaction

Sequence
Node
Row 3
Row 2
Row 1
Transaction
1. get
writes
Node
Node
2. send
writes
3. certify and apply
writes

Automatic failover
Node 1
(Priority = 1)
Node 2
(Priority = 2)
Node 3
(Priority = 3)
Node 1
(Priority = 1)
Node 2
(Priority = 2)
Node 3
(Priority = 3)
Node 2
(Priority = 2)
Node 3
(Priority = 3)
MariaDB MaxScale
(Proxy)
MariaDB MaxScale
(Proxy)
MariaDB MaxScale
(Proxy)

Use cases
Read intensive: account status
Mixed: package tracking
Write-intensive: payments
Clustering/synchronous

Use cases
Read-intensive: product catalogs
Mixed: shopping carts
Write-intensive: clickstream data
Replication/async
Read-intensive: customer profiles
Mixed: inventory and pricing
Write-intensive: checkouts
Replication/Semi-sync
Read intensive: account status
Mixed: package tracking
Write-intensive: payments
Clustering/synchronous

Parameters
auto_failover TRUE | FALSE FALSE
auto_rejoin TRUE | FALSE FALSE
switchover_on_low_disk_space TRUE | FALSE FALSE
failcount 1 to n 5
monitor_interval 100 to n (ms) 2000 (2 seconds)
verify_master_failure TRUE | FALSE FALSE
servers_no_promotion server names (CSV) N/A

What’s new
Connection failover: connection is migrated to the new primary
Delayed retry: retry queries after automatic failover has completed
Transaction replay: replay transactions from start if failover occurs mid transaction
Optimistic transactions: start transactions on standbys for session failover

What’s new
master_reconnection TRUE | FALSE FALSE
delayed_retry TRUE | FALSE FALSE
delayed_retry_timeout 0 to n (s) 10
transaction_replay TRUE | FALSE FALSE
optimistic_trx TRUE | FALSE FALSE
max_sescmd_history 0 to n 50

Read-write splitting: replication
Primary Standby Standby Primary Standby Standby Primary Standby Standby
writes reads
MariaDB MaxScale
(Proxy)
MariaDB MaxScale
(Proxy)
MariaDB MaxScale
(Proxy)

Read-write splitting: clustering
Node
(role=master)
Node
(role=slave)
writes reads
Node
(role=slave)
Node
(role=master)
Node
(role=slave)
Node
(role=slave)
Node
(role=master)
Node
(role=slave)
Node
(role=slave)
MariaDB MaxScale
(Proxy)
MariaDB MaxScale
(Proxy)
MariaDB MaxScale
(Proxy)

Read consistency: replication
If read consistency is required, enable causal reads.
● Takes advantage of GTID (MASTER_GTID_WAIT function)
● Waits for a standby to catch up to the client
● If it doesn’t catch up in time, the query is routed to the primary
causal_reads TRUE | FALSE FALSE
causal_reads_timeout 0 to n (s) 10

Read consistency: clustering
wsrep_sync_wait 0 (DISABLED)
1 (READ)
2 (UPDATE and DELETE)
3 (READ, UPDATE and DELETE)
4 (INSERT and REPLACE)
5 (READ, INSERT and REPLACE)
6 (UPDATE, DELETE, INSERT and REPLACE)
7 (READ, UPDATE, DELETE, INSERT and REPLACE)
8 (SHOW), 9-15 (1-7 + SHOW)
0 (DISABLED)
If read consistency is required, set the wsrep_sync_wait system variable to 1.

Multiple data centers: replication
Data Center (DC1, Active) Data Center (DC2, Passive)
Standby Standby Primary Primary Standby Standby
MariaDB MaxScale
(Proxy)
MariaDB MaxScale
(Proxy)

Multiple data centers: clustering
Data Center (DC1, Active) Data Center (DC2, Passive)
Node 1
(P1: priority=1,
P2: priority=3)
Node 2
(P1: priority=2,
P2: priority=2)
Node 3
(P1: priority=3,
P2: priority=1)
Clustering
(synchronous replication)
MariaDB MaxScale
(Proxy)
MariaDB MaxScale
(Proxy)

Dedicated read scaling: clustering
Primary Binlog
Server
Sec 1 Sec 2 Sec n
Standby
Writes
(port 3307)
Reads
(port 3308)
Standby
Cluster 1 Cluster 2
MariaDB MaxScale
(Proxy)

Reference deployment
Node 4
(arbiter)
Node 5 Node 6 Node 7
MariaDB MaxScale
(active)
MariaDB MaxScale
(active)
MariaDB MaxScale
(passive)
Standby
(1-hour delay)
Standby
(1-day delay)
Standby
(1-week delay)
Node 1 Node 2 Node 3
US West
US Central
US Central
99.999% availability (5.26 minutes downtime/year), RTO < 5 minutes, RPO = 0 seconds

Dedicated backup: replication
Standby 1
(backups)
Primary Standby 2
(reads)
Standby 3
(reads)
MariaDB
Backup
MariaDB MaxScale
(Proxy)

Replication: binlog
sync_binlog 0 (defer to OS), n (number of group commits to fsync) 0 (deter to OS)
binlog_format STATEMENT | ROW | MIXED MIXED
log_bin_compress 0 (OFF), 1 (ON) 0 (OFF)
1. You can fsync multiple transactions by enabling group commits (sync_binlog=1)
2. You can use the binlog ROW format if transactions take a long time or result in small changes
3. You can compress binlog events to reduce disk and network IO

Replication: parallelization
slave-parallel-mode optimistic | conservative | aggressive | minimal | none conservative
slave-parallel-threads 0 - n 0
binlog_commit_wait_count 0 - n 0
binlog_commit_wait_usec 0 - n 100000 (100ms)
read_binlog_speed_limit 0 (unlimited), n (kb) 0
1. You can execute transactions in parallel on standbys (slave-parallel-threads > 0)
2. You can throttle replication to reduce standby load on the primary

Clustering: async write/flush
innodb_flush_log_at_tx_commit 0 (write and flush once a second)
1 (write and flush during commit)
2 (write during commit, flush once a second)
1
1. You can fsync InnoDB logs asynchronously because synchronous replication
provides durability

How to provide enterprise high availability with MariaDB Platform

Recommended

Recommended

More Related Content

Similar to How to provide enterprise high availability with MariaDB Platform

Similar to How to provide enterprise high availability with MariaDB Platform (20)

Recently uploaded

Recently uploaded (20)

How to provide enterprise high availability with MariaDB Platform