PolarDB

•Download as PPTX, PDF•

1 like•229 views

Manyi Lu

PolarDB introduction at ICDE 2019

Software

Database in the Cloud Era
PolarDB
A database architecture for the Cloud
Date: April 9th, 2019
Time: 15:50PM-17:50PM
Venue: 7003@ Parisian Grand Ballroom, the Parisian Macao

Database in the Cloud EraDatabase in the Cloud Era
Manyi Lu
Senior Director @ Alibaba Cloud
Bio:
Manyi Lu has 20 years experience in the database field.
She currently works at Alicloud, leading MySQL RDS and
PolarDB development. Previously, she worked as
engineering director at Oracle, heading the MySQL
optimizer team. She has also held various positions both
as product manager and engineering manager at Sun Micr
osystems.

Database in the Cloud Era
POLARDB：a Cloud Native Database
Emerging
Hardware
• NVM
• RDMA
• FPGA
Auto scaling
• Scaling up/down
• Paid by Usage
• Zero Downtime
Security
• Encryption
• Audit
• Access Control
Intelligence
• Self-configuration
• Self-optimization
• Self-diagnosis
• Self-healing
CLOUD NATIVE
User Oriented

Database in the Cloud Era
Cloud Native Architecture
• Scale compute and storage independently
• Shared storage
• Across AZ fail-over
• Optimize division of functionality between
storage and compute
• Tight integration with other cloud components
like metering, monitoring, control plan
• Optimize for hardware in the data centers
• Compatible with MySQL/PG etc
• Security
PolarProxy
PolarStore
PolarDB
Intelligent proxy
100% Compatible
Storage Optimized
For Database
PolarFS

Database in the Cloud Era
PolarStore: Architecture overview
- Designed for Emerging
Hardware
- Low Latency Oriented
- Active R/W – Active RO
- High Availability
libpfs
Host1
POLARDB
libpfs
POLARDB
Host2
volume 1 Volume 2
chunk1 chunk2 chunk1 chunk2
PolarSwitch
libpfs
POLARDB
volume 1
PolarSwitch
chunk1 chunk2
ChunkServer ChunkServer ChunkServer ChunkServer
chunk chunk chunk chunk
ParallelRaft
PolarCtrl
metadata
Key Components: 1. libpfs 2. PolarSwitch 3. ChunksServer 4. PolarCtrl
data route
control route

Database in the Cloud Era
PolarStore: Design for Emerging Hardware
- No Context Switch
- OS-bypass & zero-copy
RDMA-NIC
Network Over RDMA
libpfs
POLARDB
Memory
- Parallel Random I/O absorbed by Optane
- Excellent performance with less long tail latency issue
- No need of Over Provisioning
WAL Log in 3Dxpoint optane
RDMA Network
RDMA
RDMA-NIC
Optane
NVMe SSDs
Memory
Chunkserver 1
RDMA-NIC
Optane
NVMe SSDs
Memory
Chunkserver 3
RDMA-NIC
Optane
NVMe SSDs
Memory
Chunkserver 2
PolarDB write to shm

Database in the Cloud Era
Dynamic Scaling
Local
Storage
Fast Scaling
MySQL
POLARDB
Master
Local
Storage
Replica
Local
Storage
Replica
Master Replica Replica
Shared Storage
Upgrade 2vCPU to 32vCPU， only in 5 minutes
Add more Replicas, only in 5 minutes.
6,940
10,230
13,521
16,811
20,102
39,844
4,949
6,549
8,149
9,749
11,349
20,949
1 Replica 2 Replica 3 Replica 4 Replica 5 Replica 10
Replica
RDS MySQL POLARDB
Lower Cost: 30%~50% OFF
Total costs of 4vCPU 32G Memory 500G Storage
with different replica numbers
0
10000
20000
30000
40000

Database in the Cloud Era
Shared Nothing Logical Replication vs Shared Storage Physical Replication
Local Storage Local Storage
Master
POLARSTORE
Slave Master Slave
Data
Binlog
Redo
log Data
Master
Binlog
Slave
Binlog
Redo
log
Data
Redo
log
Data
Redo
log
Binlog
Physical Replication is much more reliable than Logical Replication

Database in the Cloud Era
Shared Nothing Logical replication vs Shared Storage Physical Replication

Database in the Cloud Era
HTAP - Parallel Query
Reduce Latency of Complex Queries
1024
512
256
128
64
32
16
8
4
2
DBT3 Query 6 Linear Scalability
1 2 4 8 16 32
tpch40
ideal_tpch40
tpch20
ideal_tpch20
tpch10
ideal_tpch10
tpch5
ideal_tpch5
One Query
Multiple Workers on Server
Parallel Scan on Storage Engine.
Workers Storage Engine

Database in the Cloud Era
Single-master
Single Endpoint Transparent Failover
Attacks Protection Causal Consistent Read
Proxy Cluster
Master Replica Replica
Shared Storage
Application
Replica
Read/Write Split
High Availability
Load Balance
Security

Database in the Cloud Era
Read and Write Split - Session Consistency

Database in the Cloud Era
Database in the Cloud Era
THANKS

What's hot

Spark Summit EU talk by Jiri SimsaSpark Summit

Scylla Virtual Workshop 2020ScyllaDB

HBaseConAsia2018 Track1-1: Use CCSMap to improve HBase YGC timeMichael Stack

Apache Drill (ver. 0.1, check ver. 0.2)Camuel Gilyadov

Building a Distributed Data Streaming Architecture for Modern Hardware with S...ScyllaDB

Renegotiating the boundary between database latency and consistencyScyllaDB

Amazon Aurora TechConnect LavanyaMurthy9

Latency and Consistency Tradeoffs in Modern Distributed DatabasesScyllaDB

Deep Dive into GPU Support in Apache Spark 3.xDatabricks

Fine Tuning and Enhancing Performance of Apache Spark JobsDatabricks

Presentation by TachyonNexus & Intel at Strata Singapore 2015Tachyon Nexus, Inc.

HBaseConEast2016: Splice machine open source rdbmsMichael Stack

Alluxio 2.0 & Near Real-time Big Data Platform w/ Spark & AlluxioAlluxio, Inc.

[Pgday.Seoul 2018] PostgreSQL 성능을 위해 개발된 라이브러리 OS 소개 apposhaPgDay.Seoul

Optimizing Performance and Computing Resource Efficiency of In-Memory Big Dat...Databricks

The Data Lake Engine Data Microservices in Spark using Apache Arrow FlightDatabricks

Training Slides: Basics 103: The Power of Tungsten Connector / ProxyContinuent

Building Data Product Based on Apache Spark at Airbnb with Jingwei Lu and Liy...Databricks

What's New in Amazon AuroraAmazon Web Services

HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...Michael Stack

What's hot (20)

Spark Summit EU talk by Jiri Simsa

Scylla Virtual Workshop 2020

HBaseConAsia2018 Track1-1: Use CCSMap to improve HBase YGC time

Apache Drill (ver. 0.1, check ver. 0.2)

Building a Distributed Data Streaming Architecture for Modern Hardware with S...

Renegotiating the boundary between database latency and consistency

Amazon Aurora TechConnect

Latency and Consistency Tradeoffs in Modern Distributed Databases

Deep Dive into GPU Support in Apache Spark 3.x

Fine Tuning and Enhancing Performance of Apache Spark Jobs

Presentation by TachyonNexus & Intel at Strata Singapore 2015

HBaseConEast2016: Splice machine open source rdbms

Alluxio 2.0 & Near Real-time Big Data Platform w/ Spark & Alluxio

[Pgday.Seoul 2018] PostgreSQL 성능을 위해 개발된 라이브러리 OS 소개 apposha

Optimizing Performance and Computing Resource Efficiency of In-Memory Big Dat...

The Data Lake Engine Data Microservices in Spark using Apache Arrow Flight

Training Slides: Basics 103: The Power of Tungsten Connector / Proxy

Building Data Product Based on Apache Spark at Airbnb with Jingwei Lu and Liy...

What's New in Amazon Aurora

HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...

Similar to PolarDB

AWS Community Day 2022 Shirish Joshi_Choosing between RDS and Aurora for MySQ...AWS Chicago

Database as a Service on the Oracle Database Appliance PlatformMaris Elsins

Amazon AuroraAmazon Web Services

Developing for Your Target Market - Social, Games & Mobile - AWS India Summit...Amazon Web Services

Building a High Performance Analytics PlatformSantanu Dey

Level up your SQL and Azure, by using RubrikJaap Brasser

5 Factors When Selecting a High Performance, Low Latency DatabaseScyllaDB

Disrupting the Storage Industry talk at SNIA Data Storage Innovation ConferenceAdrian Cockcroft

How Development Teams Cut Costs with ScyllaDB.pdfScyllaDB

Tackling Network Bottlenecks with Hardware Accelerations: Cloud vs. On-PremiseDatabricks

AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...ScyllaDB

Citrix Synergy 2014 - Syn233 Building and operating a Dev Ops cloud: best pra...Citrix

ZFS applianceFran Navarro

Data Warehouse Modernization - Big Data in the Cloud Success with Qubole on O...Qubole

Building Your Data Warehouse with Amazon RedshiftAmazon Web Services

AWS re:Invent 2016: Getting Started with Amazon Aurora (DAT203)Amazon Web Services

TechEvent 2019: DBaaS from Swisscom Cloud powered by Trivadis; Konrad Häfeli ...Trivadis

Amazon Aurora Getting started Guide -level 0kartraj

JOSA TechTalks - Downgrade your CostsJordan Open Source Association

To Serverless and BeyondScyllaDB

Similar to PolarDB (20)

AWS Community Day 2022 Shirish Joshi_Choosing between RDS and Aurora for MySQ...

Database as a Service on the Oracle Database Appliance Platform

Amazon Aurora

Developing for Your Target Market - Social, Games & Mobile - AWS India Summit...

Building a High Performance Analytics Platform

Level up your SQL and Azure, by using Rubrik

5 Factors When Selecting a High Performance, Low Latency Database

Disrupting the Storage Industry talk at SNIA Data Storage Innovation Conference

How Development Teams Cut Costs with ScyllaDB.pdf

Tackling Network Bottlenecks with Hardware Accelerations: Cloud vs. On-Premise

AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...

Citrix Synergy 2014 - Syn233 Building and operating a Dev Ops cloud: best pra...

ZFS appliance

Data Warehouse Modernization - Big Data in the Cloud Success with Qubole on O...

Building Your Data Warehouse with Amazon Redshift

AWS re:Invent 2016: Getting Started with Amazon Aurora (DAT203)

TechEvent 2019: DBaaS from Swisscom Cloud powered by Trivadis; Konrad Häfeli ...

Amazon Aurora Getting started Guide -level 0

JOSA TechTalks - Downgrade your Costs

To Serverless and Beyond

Recently uploaded

Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.

What is Binary Language? Computer Number SystemsJheuzeDellosa

Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions

Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ

chapter--4-software-project-planning.pptkotipi9215

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700

Asset Management Software - InfographicHr365.us smith

The Evolution of Karaoke From Analog to App.pdfPower Karaoke

Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin

The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

What is Fashion PLM and Why Do You Need ItWave PLM

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.

Professional Resume Template for Software DevelopersVinodh Ram

Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq

Recently uploaded (20)

Hand gesture recognition PROJECT PPT.pptx

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...

What is Binary Language? Computer Number Systems

Advancing Engineering with AI through the Next Generation of Strategic Projec...

Cloud Management Software Platforms: OpenStack

chapter--4-software-project-planning.ppt

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...

Asset Management Software - Infographic

The Evolution of Karaoke From Analog to App.pdf

Unit 1.1 Excite Part 1, class 9, cbse...

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...

The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications

What is Fashion PLM and Why Do You Need It

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...

Professional Resume Template for Software Developers

Salesforce Certified Field Service Consultant

PolarDB

1. Database in the Cloud Era PolarDB A database architecture for the Cloud Date: April 9th, 2019 Time: 15:50PM-17:50PM Venue: 7003@ Parisian Grand Ballroom, the Parisian Macao

2. Database in the Cloud EraDatabase in the Cloud Era Manyi Lu Senior Director @ Alibaba Cloud Bio: Manyi Lu has 20 years experience in the database field. She currently works at Alicloud, leading MySQL RDS and PolarDB development. Previously, she worked as engineering director at Oracle, heading the MySQL optimizer team. She has also held various positions both as product manager and engineering manager at Sun Micr osystems.

3. Database in the Cloud Era POLARDB：a Cloud Native Database Emerging Hardware • NVM • RDMA • FPGA Auto scaling • Scaling up/down • Paid by Usage • Zero Downtime Security • Encryption • Audit • Access Control Intelligence • Self-configuration • Self-optimization • Self-diagnosis • Self-healing CLOUD NATIVE User Oriented

4. Database in the Cloud Era Cloud Native Architecture • Scale compute and storage independently • Shared storage • Across AZ fail-over • Optimize division of functionality between storage and compute • Tight integration with other cloud components like metering, monitoring, control plan • Optimize for hardware in the data centers • Compatible with MySQL/PG etc • Security PolarProxy PolarStore PolarDB Intelligent proxy 100% Compatible Storage Optimized For Database PolarFS

5. Database in the Cloud Era PolarStore: Architecture overview - Designed for Emerging Hardware - Low Latency Oriented - Active R/W – Active RO - High Availability libpfs Host1 POLARDB libpfs POLARDB Host2 volume 1 Volume 2 chunk1 chunk2 chunk1 chunk2 PolarSwitch libpfs POLARDB volume 1 PolarSwitch chunk1 chunk2 ChunkServer ChunkServer ChunkServer ChunkServer chunk chunk chunk chunk ParallelRaft PolarCtrl metadata Key Components: 1. libpfs 2. PolarSwitch 3. ChunksServer 4. PolarCtrl data route control route

6. Database in the Cloud Era PolarStore: Design for Emerging Hardware - No Context Switch - OS-bypass & zero-copy RDMA-NIC Network Over RDMA libpfs POLARDB Memory - Parallel Random I/O absorbed by Optane - Excellent performance with less long tail latency issue - No need of Over Provisioning WAL Log in 3Dxpoint optane RDMA Network RDMA RDMA-NIC Optane NVMe SSDs Memory Chunkserver 1 RDMA-NIC Optane NVMe SSDs Memory Chunkserver 3 RDMA-NIC Optane NVMe SSDs Memory Chunkserver 2 PolarDB write to shm

7. Database in the Cloud Era Dynamic Scaling Local Storage Fast Scaling MySQL POLARDB Master Local Storage Replica Local Storage Replica Master Replica Replica Shared Storage Upgrade 2vCPU to 32vCPU， only in 5 minutes Add more Replicas, only in 5 minutes. 6,940 10,230 13,521 16,811 20,102 39,844 4,949 6,549 8,149 9,749 11,349 20,949 1 Replica 2 Replica 3 Replica 4 Replica 5 Replica 10 Replica RDS MySQL POLARDB Lower Cost: 30%~50% OFF Total costs of 4vCPU 32G Memory 500G Storage with different replica numbers 0 10000 20000 30000 40000

8. Database in the Cloud Era Shared Nothing Logical Replication vs Shared Storage Physical Replication Local Storage Local Storage Master POLARSTORE Slave Master Slave Data Binlog Redo log Data Master Binlog Slave Binlog Redo log Data Redo log Data Redo log Binlog Physical Replication is much more reliable than Logical Replication

9. Database in the Cloud Era Shared Nothing Logical replication vs Shared Storage Physical Replication

10. Database in the Cloud Era HTAP - Parallel Query Reduce Latency of Complex Queries 1024 512 256 128 64 32 16 8 4 2 DBT3 Query 6 Linear Scalability 1 2 4 8 16 32 tpch40 ideal_tpch40 tpch20 ideal_tpch20 tpch10 ideal_tpch10 tpch5 ideal_tpch5 One Query Multiple Workers on Server Parallel Scan on Storage Engine. Workers Storage Engine

11. Database in the Cloud Era Single-master Single Endpoint Transparent Failover Attacks Protection Causal Consistent Read Proxy Cluster Master Replica Replica Shared Storage Application Replica Read/Write Split High Availability Load Balance Security

12. Database in the Cloud Era Read and Write Split - Session Consistency

13. Database in the Cloud Era Database in the Cloud Era THANKS

Editor's Notes

I am Manyi Lu。 I previously worked in the MySQL optimizer team at Oracle.
We at Alicloud have been offering relational database as a service for many years. Based on our extensive experience with RDS we have developed a cloud native database, which we believe can better serve our cloud customers. What makes PolarDB unique are: It is built on emerging hardware. A cloud native database allows us to tightly integrate software and hardware in a way that it hardly achievable on premise. PolarDBs storage layer leverages a range of the modern hardware NVME, RDMA. and FPGA. One of the top benefits of moving to cloud compared to on-premise is we allow users to seamlessly scale up and down based on their business need. Security is always a concern when users move to cloud. But rest assured, we offer transparent data encryption, audit and access control. Intelligence： since we offer a managed service, we manage hundreds of thousands database instances, we need automation to reduce our maintenance cost as well as giving our users better user experience. We have extensive monitoring system in place and with that as a basis, we have automatic failure detection, self diagnosis and self healing, Cloud is all about economy of scale, we must reduce human involvement to achieve that.
PolarStore is a distributed share storage, it consists of a cluster of ChunkServers. Data is divided into trunks, and each trunk has 3 identical copies and we use parallel-raft protocol to guarantee the consistency between chunks. PFS, is a filesystem developed particularly for PolarDB, it allows the database engine to access PolarStore as local storage. PolarDB supports ROCE Ethernet networks, where the application passes through an RDMA network to write the contents of its memory directly into the memory address of the target machine, or read data directly from the memory of the target machine and into its own memory. In the middle, the communication protocol codec and relay mechanism are both handled by the RDMA card, without the need for any participation from the CPU. PolarFS uses a full user-space I/O stack, including RDMA and SPDK, to avoid the overhead of the kernel network stack and storage stack.
PolarDB uses the leading hardware technology。It uses Optane storage card as a cache to NMVe SSD. In this way, it ensures stable, low write latency, high throughput, and keeps cost performance ratio low for the entire system. PolarDB uses RDMA network between between chunk servers, and DB node and storage layer. RDMA more or less has removed networking as the performance bottleneck. The application through an RDMA network can write the contents of its memory directly into the memory address of the target machine, or read data directly from the memory of the target machine and into its own memory, without the need for any participation from the CPU. PolarFS uses a full user-space I/O stack, including RDMA and SPDK, to avoid the overhead of the kernel network stack and storage stack.
When your database load increases, you will want to add more replicas to support the increasing load. In POLARDB, adding more replicas does not require extra disk space. All server instances are using the same shared data files, while with traditional MySQL replication, each server will have its own copy of the data. So with the increasing number of replicas, the cost savings of POLARDB will increase. Scaling up will also be much faster since there will be no need to copy data when adding more replicas.
POLARDB uses physical replication instead of the logical replication used in traditional MySQL replication. This means that the redo log, which InnoDB writes to be able to recover from failures, is also used to replicate changes to other servers. So while the logical replication will write both logical and physical log to local storage on all servers, all POLARDB servers will share the same log files.
Left: binlog replication statement based, DDL starts on replica AFTER it is completed on master. On master, master can execute other transactions and generate log against the new schema right after the DDL operation complets. On slave, the DDL operation starts late, and must accumulate all log from the master until DDL is completed on the master. PolarDB physical replication No data change on replica due to shared disk, but MDL locking is needed, and invalidation of dictionary cache on replica.
Currently, we can only support single master and multiple replica. Single endpoint: one end point, proxy will handle load balancing and read/write splitt Transparant failover: When master fails, fail-over to RO. If AC failure, fail over to another AZ Causal consistent read: read your own write. MySQL asynch/semi synch replication doesn’t provide that, due to replication lag.
Update USER SET COMMIT, in the same package as ack, include the LSN , send to proxy. As an example, take one application that does an update to one row, commits the update, and then try to read the same row. We then need to make sure that the recently committed version of the row is reflected at the read replica where it is to be read. To achieve this, the master will return the log sequence number of the update to the proxy, and the proxy will tell the read replica to make sure the corresponding redo log record has been applied before the read is performed.

PolarDB

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to PolarDB

Similar to PolarDB (20)

Recently uploaded

Recently uploaded (20)

PolarDB

Editor's Notes