Data High Availability With TIDB

•

0 likes•40 views

This document discusses data high availability with TiDB. It provides an overview of TiDB's architecture including TiKV for data storage using Raft consensus, Placement Driver (PD) for orchestration, and TiFlash for analytics. It describes how TiDB uses labels to place regions across nodes to achieve high availability and fault tolerance. It also discusses election processes, replication, and automatic failover to maintain high availability of the TiDB cluster.

Technology

Data High Availability With TiDB
Oct 28th, 2023
Mydbops MyWebinar - 28
Kabilesh PR
Co-Founder, Mydbops

Interested in Open Source technologies
Pingcap Certified TiDB Associate
Tech Speaker/Blogger
AWS certified Database speciality
AWS community builder / SME
Co-Founder, Mydbops
Kabilesh PR
About Me

Mydbops
at a
glance
300+
Clients
3000+
Servers
ISO & PCI DSS
Certified
7 Yrs of
Expertise
AWS Advanced
Consulting Partner
TiDB PingCap
Partner

Mydbops Services
Consulting
Services
Managed
Services
MySQL, MongoDB, PostgreSQL, TiDB & Cassandra
Targeted
Engagement
24 x 7
DBA Team

Agenda
What is TiDB ?
Architecture
Labels
High Availability

What is TiDB ?
An Open Source distributed SQL database by PingCAP
•
Scalable in both ways
•
Supports both Transactional and Analytical (HTAP)
•
Compatible with MySQL protocol
•
Highly available
•
Fault Tolerant
•

Architecture
TiDB cluster - SQL layer
TiKV - Storage layer
Placement Driver(PD) - Orchestrator
TiFlash - Analytical columnar storage

TiKV - Data Storage
Stateful
•
Persistent storage with RocksDB
•
Distributed storage & transaction support
•
Strong Consistency
•
Min 3 nodes (HA by default)
•

What is RAFT ?
RAFT is a Consensus based method to replicate data and maintain HA
•
Reliable, Replicated , Redundant, And Fault-Tolerant (RAFT)
•
A Raft group has a leader and followers
•
Leader handles RW ops
•
Followers Participate in Election & Data Reads
•

TiKV - Regions and Raft
Region - Smallest data unit for replication
•
Each region is replicated 3 times and distributed among TiKV
•
Regions are automatically balanced across TiKV nodes
•
The replicas for one region forms a RAFT group
•
Default Region size is 96M
•

Services (LB)
Stateful Sets
Operator Maturity
TiKV - Regions and Raft

Replication With Regions
Leader replicates log info to the followers
•
Wait for quorum ack and marks write as durable
•
From the logs data is synced to RocksDB KV
•
TiFlash sync from the log using Raft-learner
•

Replication Process
PROPOSE - Write request at leader written to Raftlog
•
APPEND - Raft-log is written locally to RAFTEngine
•
REPLICATE - Raft-log replicates to the followers
•
APPEND - Raft-log is written locally to RAFTEngine
•
COMMIT - ACK back to the leader on Durable Raft-log
•
APPLY - Write from RAFT log to TiKV RocksDB
•

Election Scenarios
RAFT always ensures to have only one leader for the regions
Heart Beat Failures - N/W failures , node crash
•
New Region Split
•
Node Restart
•
Election Timeout
•

Election Process
Raft Maintains Terms, A leader will last till this Period
Leader Emits Heart-beat to its followers
When Heart Beat fails due to node-crash or N/W
Followers change role to CANDIDATE also Increment the TERM
Vote itself & then sends VOTING request to other nodes along the TERM
value
On MAJORITY it becomes the leader and sends back the HB

Placement Driver - HA
PD consists at least 3 NODES
•
Auto Failover with ETCD cluster
•
RAFT with ETCD to have strong consistency
•
PD Leader servers the requests
•
PD maintains the Labels
•

Labels for High Availablity
Its a way to tell TiDB cluster how to place regions
•
By default PD places the region randomly in TiKV nodes
•
With Labels we can guide PD to place regions based on DC, RACK and Nodes
•

Use-Cases
https://www.pingcap.com/customers/
SAAS Application
IOT data , Time series
Ecommerce
Logistics
Gaming
Fintech

Limitations
Stored procedures & Functions
Triggers
Events
User-defined functions
Fulltext and Spatial index
Column-level privileges

Similar to Data High Availability With TIDB

AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...Amazon Web Services

AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...Amazon Web Services

Lambda architecture: from zero to OneSerg Masyutin

Keep Calm And Serilog Elasticsearch Kibana on .NET CoreMaciej Szymczyk

When Networks Meet Apps, Samuel Bercovici & Nati ShalomCloud Native Day Tel Aviv

When networks meets apps (open stack atlanta)Nati Shalom

Navigating Transactions: ACID Complexity in Modern DatabasesShivji Kumar Jha

Navigating Transactions: ACID Complexity in Modern Databases- Mydbops Open So...Mydbops

Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)Amazon Web Services

MySQL Transformation Case Study: 80% Cost Savings & Uninterrupted Availabilit...Mydbops

Traffic Engineering in LinkedIn BackboneAPNIC

Optimizing Presto Connector on Cloud StorageKai Sasaki

Migración desde BBDD propietarias a MariaDBMariaDB plc

Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...Fwdays

Capital One: Using Cassandra In Building A Reporting PlatformDataStax Academy

Introduction to ClustrixDBI Goo Lee

MyHeritage backend group - build to scaleRan Levy

Low latency high throughput streaming using Apache Apex and Apache KuduDataWorks Summit

Improving HDFS Availability with Hadoop RPC Quality of ServiceMing Ma

Mule SAP connectorAnkush Sharma

Similar to Data High Availability With TIDB (20)

AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...

AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...

Lambda architecture: from zero to One

Keep Calm And Serilog Elasticsearch Kibana on .NET Core

When Networks Meet Apps, Samuel Bercovici & Nati Shalom

When networks meets apps (open stack atlanta)

Navigating Transactions: ACID Complexity in Modern Databases

Navigating Transactions: ACID Complexity in Modern Databases- Mydbops Open So...

Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)

MySQL Transformation Case Study: 80% Cost Savings & Uninterrupted Availabilit...

Traffic Engineering in LinkedIn Backbone

Optimizing Presto Connector on Cloud Storage

Migración desde BBDD propietarias a MariaDB

Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...

Capital One: Using Cassandra In Building A Reporting Platform

Introduction to ClustrixDB

MyHeritage backend group - build to scale

Low latency high throughput streaming using Apache Apex and Apache Kudu

Improving HDFS Availability with Hadoop RPC Quality of Service

Mule SAP connector

Recently uploaded

costume and set research powerpoint presentationphoebematthew05

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

Pigging Solutions in Pet Food ManufacturingPigging Solutions

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

AI as an Interface for Commercial BuildingsMemoori

Install Stable Diffusion in windows machinePadma Pradeep

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Artificial intelligence in the post-deep learning eraDeakin University

Key Features Of Token Development (1).pptxLBM Solutions

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Understanding the Laravel MVC ArchitecturePixlogix Infotech

CloudStudio User manual (basic edition):comworks

Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Recently uploaded (20)

costume and set research powerpoint presentation

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

Pigging Solutions in Pet Food Manufacturing

DMCC Future of Trade Web3 - Special Edition

Connect Wave/ connectwave Pitch Deck Presentation

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

AI as an Interface for Commercial Buildings

Install Stable Diffusion in windows machine

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Artificial intelligence in the post-deep learning era

Key Features Of Token Development (1).pptx

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

Understanding the Laravel MVC Architecture

CloudStudio User manual (basic edition):

Unlocking the Potential of the Cloud for IBM Power Systems

SQL Database Design For Developers at php[tek] 2024

Benefits Of Flutter Compared To Other Frameworks

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Data High Availability With TIDB

1. Data High Availability With TiDB Oct 28th, 2023 Mydbops MyWebinar - 28 Kabilesh PR Co-Founder, Mydbops

2. Interested in Open Source technologies Pingcap Certified TiDB Associate Tech Speaker/Blogger AWS certified Database speciality AWS community builder / SME Co-Founder, Mydbops Kabilesh PR About Me

3. Mydbops at a glance 300+ Clients 3000+ Servers ISO & PCI DSS Certified 7 Yrs of Expertise AWS Advanced Consulting Partner TiDB PingCap Partner

4. Mydbops Services Consulting Services Managed Services MySQL, MongoDB, PostgreSQL, TiDB & Cassandra Targeted Engagement 24 x 7 DBA Team

5. Agenda What is TiDB ? Architecture Labels High Availability

6. What is TiDB ? An Open Source distributed SQL database by PingCAP • Scalable in both ways • Supports both Transactional and Analytical (HTAP) • Compatible with MySQL protocol • Highly available • Fault Tolerant •

7. Architecture

8. Architecture TiDB cluster - SQL layer TiKV - Storage layer Placement Driver(PD) - Orchestrator TiFlash - Analytical columnar storage

9. TiKV - Data Storage Stateful • Persistent storage with RocksDB • Distributed storage & transaction support • Strong Consistency • Min 3 nodes (HA by default) •

10. What is RAFT ? RAFT is a Consensus based method to replicate data and maintain HA • Reliable, Replicated , Redundant, And Fault-Tolerant (RAFT) • A Raft group has a leader and followers • Leader handles RW ops • Followers Participate in Election & Data Reads •

11. TiKV - Regions and Raft Region - Smallest data unit for replication • Each region is replicated 3 times and distributed among TiKV • Regions are automatically balanced across TiKV nodes • The replicas for one region forms a RAFT group • Default Region size is 96M •

12. Services (LB) Stateful Sets Operator Maturity TiKV - Regions and Raft

13. Replication With Regions Leader replicates log info to the followers • Wait for quorum ack and marks write as durable • From the logs data is synced to RocksDB KV • TiFlash sync from the log using Raft-learner •

14. Replication Process PROPOSE - Write request at leader written to Raftlog • APPEND - Raft-log is written locally to RAFTEngine • REPLICATE - Raft-log replicates to the followers • APPEND - Raft-log is written locally to RAFTEngine • COMMIT - ACK back to the leader on Durable Raft-log • APPLY - Write from RAFT log to TiKV RocksDB •

15. Replication With Regions

16. Election Scenarios RAFT always ensures to have only one leader for the regions Heart Beat Failures - N/W failures , node crash • New Region Split • Node Restart • Election Timeout •

17. Election Process Raft Maintains Terms, A leader will last till this Period Leader Emits Heart-beat to its followers When Heart Beat fails due to node-crash or N/W Followers change role to CANDIDATE also Increment the TERM Vote itself & then sends VOTING request to other nodes along the TERM value On MAJORITY it becomes the leader and sends back the HB

18. Election Process

19. Placement Driver - HA PD consists at least 3 NODES • Auto Failover with ETCD cluster • RAFT with ETCD to have strong consistency • PD Leader servers the requests • PD maintains the Labels •

20. Problem Statement

21. Labels for High Availablity Its a way to tell TiDB cluster how to place regions • By default PD places the region randomly in TiKV nodes • With Labels we can guide PD to place regions based on DC, RACK and Nodes •

22. Detailed Architecture

23. Use-Cases https://www.pingcap.com/customers/ SAAS Application IOT data , Time series Ecommerce Logistics Gaming Fintech

24. Limitations Stored procedures & Functions Triggers Events User-defined functions Fulltext and Spatial index Column-level privileges

25. Reach Us : Info@mydbops.com Thank You

Data High Availability With TIDB

Recommended

Recommended

More Related Content

Similar to Data High Availability With TIDB

Similar to Data High Availability With TIDB (20)

More from Mydbops

More from Mydbops (20)

Recently uploaded

Recently uploaded (20)

Data High Availability With TIDB