vFabric SQLFire Introduction

SQLFire Scalable SQL instead of NoSQL Jags Ramnarayan Chief Architect, GemFire Products Jags Ramnarayan

Agenda Various NoSQL attributes and why SQL SQLFire features + Demo Scalability patterns Hash partitioning Entity groups and collocation Scaling behavior using “data aware stored procedures” Consistency model How we do distributed transactions Shared nothing persistence

3 We Challenge the traditional RDBMS design NOT SQL First write to LOG Second write to Data files Buffers primarily tuned for IO ,[object Object]

Design roots don’t necessarily apply today

Disk synchronization bottlenecksConfidential

“Shared nothing” commodity clusters focus shifts to memory, distributing data and clustering Scale by partitioning the data and move behavior to data nodes HA within cluster and across data centers Add capacity to scale dynamically Common themes in next-gen DB architectures 4 NoSQL, Data Grids, Data Fabrics, NewSQL Confidential

What is different ? ,[object Object]

Column family (inspired by Google BigTable)

Most focus on making model less rigid than SQL

Consistency model is not ACIDLow scale Very high scale High scale Tunable Consistency Eventual STRICT – Full ACID (RDB) 5

What is our take with SQLFire? Eventual consistency is too difficult for the average developer Write(A,1)  Read(A) may return 2 or (1,2) SQL : Flexible, easily understood, strong type system essential for integrity as well as query engine efficiency

SQLFire Replicated, partitioned tables in memory. Redundancy through memory copies. Data resides on disk when you explicitly say so Powerful SQL engine: standard SQL for select, DML DDL has SQLF extensions Leverages GemFire data grid engine.

SQLFire Applications access the distributed DB using JDBC, ADO.NET Consistency model is FIFO, Tunable Distributed transactions without global locks

SQLFire Asynchronous replication over WAN Synchronous replication within cluster Clients failover, failback Easily integrate with existing DBs - caching framework to read through, write through or write behind

SQLFire When nodes are added, data and behavior is rebalanced without blocking current clients "Data aware procedures“ - standard Java stored procedures with "data aware" and parallelism extensions

Flexible Deployment Topologies Java Application cluster can host an embedded clustered database by just changing the URL jdbc:sqlfire:;mcast-port=33666;host-data=true Confidential 11

Flexible Deployment Topologies Confidential 12

Explore features through example Assume, thousands of flight rows, millions of flightavailability records

SQLF Creating Tables CREATE TABLE FLIGHTS ( FLIGHT_ID CHAR(6) NOT NULL PRIMARY KEY, SEGMENT_NUMBER INTEGER NOT NULL , ORIG_AIRPORT CHAR(3), DEPART_TIME TIME, … ) ; Hash partitioned on PK by default Table Partitioned Table Partitioned Table Partitioned Table SQLF SQLF SQLF

CREATE TABLE FLIGHTAVAILABILITY ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , FLIGHT_DATE DATE NOT NULL , ECONOMY_SEATS_TAKEN INTEGER DEFAULT 0, …) PARTITION BY COLUMN (FLIGHT_ID) COLOCATE WITH (FLIGHTS) CREATE TABLE FLIGHTS ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , ORIG_AIRPORT CHAR(3), DEPART_TIME TIME, …) PARTITION BY COLUMN (FLIGHT_ID); CREATE TABLE Airlines AIRLINE CHAR(2) NOT NULL PRIMARY KEY, AIRLINE_FULL VARCHAR(24), BASIC_RATE DOUBLE PRECISION, DISTANCE_DISCOUNT DOUBLE PRECISION,…. ) CREATE TABLE FLIGHTS ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , ORIG_AIRPORT CHAR(3), DEPART_TIME TIME, …) PARTITION BY COLUMN (FLIGHT_ID)REDUNDANCY 1; CREATE TABLE Airlines AIRLINE CHAR(2) NOT NULL PRIMARY KEY, AIRLINE_FULL VARCHAR(24), BASIC_RATE DOUBLE PRECISION, DISTANCE_DISCOUNT DOUBLE PRECISION,…. ) REPLICATE; Replicated Table Replicated Table Replicated Table Table Redundant Partition Redundant Partition Partitioned Table Partitioned Table Redundant Partition Partitioned Table SQLF SQLF SQLF SQLF Creating Tables Colocated Partition Colocated Partition Colocated Partition

CREATE TABLE Airlines AIRLINE CHAR(2) NOT NULL PRIMARY KEY, AIRLINE_FULL VARCHAR(24), BASIC_RATE DOUBLE PRECISION, DISTANCE_DISCOUNT DOUBLE PRECISION,…. ) Table SQLF SQLF SQLF SQLF Creating Tables

CREATE TABLE Airlines AIRLINE CHAR(2) NOT NULL PRIMARY KEY, AIRLINE_FULL VARCHAR(24), BASIC_RATE DOUBLE PRECISION, DISTANCE_DISCOUNT DOUBLE PRECISION,…. ) REPLICATE; Replicated Table Replicated Table Replicated Table SQLF SQLF SQLF SQLF Creating Tables

SQLF Creating Tables CREATE TABLE FLIGHTS ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , ORIG_AIRPORT CHAR(3), DEPART_TIME TIME, PARTITION BY COLUMN (FLIGHT_ID); Table Replicated Table Replicated Table Replicated Table Partitioned Table Partitioned Table Partitioned Table SQLF SQLF SQLF

CREATE TABLE FLIGHTS ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , ORIG_AIRPORT CHAR(3), DEPART_TIME TIME, …) PARTITION BY COLUMN (FLIGHT_ID)REDUNDANCY 1; Table Replicated Table Replicated Table Replicated Table Partitioned Table Partitioned Table Partitioned Table Redundant Partition Redundant Partition Redundant Partition SQLF SQLF SQLF SQLF Creating Tables

CREATE TABLE FLIGHTAVAILABILITY ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , FLIGHT_DATE DATE NOT NULL , ECONOMY_SEATS_TAKEN INTEGER DEFAULT 0, …) PARTITION BY COLUMN (FLIGHT_ID) COLOCATE WITH (FLIGHTS) Table Replicated Table Replicated Table Replicated Table Partitioned Table Partitioned Table Partitioned Table Colocated Partition Colocated Partition Colocated Partition Redundant Partition Redundant Partition Redundant Partition SQLF SQLF SQLF SQLF Creating Tables

By default, it is only the data dictionary that is persisted to disk. Table Replicated Table Replicated Table Replicated Table Partitioned Table Partitioned Table Partitioned Table Colocated Partition Colocated Partition Colocated Partition Redundant Partition Redundant Partition Redundant Partition SQLF SQLF SQLF SQLF Creating Tables

CREATE TABLE FLIGHTAVAILABILITY ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , FLIGHT_DATE DATE NOT NULL , ECONOMY_SEATS_TAKEN INTEGER DEFAULT 0, …) PARTITION BY COLUMN (FLIGHT_ID) COLOCATE WITH (FLIGHTS) PERSISTENT ; Table Replicated Table Replicated Table Replicated Table Partitioned Table Partitioned Table Partitioned Table Redundant Partition Redundant Partition Redundant Partition SQLF SQLF SQLF SQLF Creating Tables Colocated Partition Colocated Partition Colocated Partition

Partitioning Options To partition using the Primay Key, use: (Primary Key’s Java implementation must hash evenly across its range) PARTITION BY PRIMARY KEY CREATE TABLE FLIGHTS ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , ORIG_AIRPORT CHAR(3), DEPART_TIME TIME, … ) PARTITION BY PRIMARY KEY;

Partitioning Options When you wish to partition on a column or columns that are not the primary key, use: PARTITION BY COLUMN (column-name [ , column-name ]*) CREATE TABLE FLIGHTAVAILABILITY ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , FLIGHT_DATE DATE NOT NULL , ECONOMY_SEATS_TAKEN INTEGER DEFAULT 0, …) PARTITION BY COLUMN (FLIGHT_ID);

Partitioning Options You can partition entries based on a range of values of one of the columns: PARTITION BY RANGE (column-name ) ( VALUES BETWEEN value AND value [ , VALUES BETWEEN value AND value ]*) CREATE TABLE FLIGHTAVAILABILITY ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , FLIGHT_DATE DATE NOT NULL , ECONOMY_SEATS_TAKEN INTEGER DEFAULT 0, …) PARTITION BY RANGE ( economy_seats_taken ) ( VALUES BETWEEN 0 AND 50, VALUES BETWEEN 50 AND 100, VALUES BETWEEN 100 AND 500);

Partitioning Options You can explicitly partition entries based on a list of potential values of a column: PARTITION BY LIST ( column-name ) ( VALUES ( value [ , value ]* ) [ , VALUES ( value [ , value ]* ) ]* ) CREATE TABLE Orders (OrderId INT NOT NULL, ItemId INT, NumItems INT, CustomerId INT, OrderDate DATE, Priority INT, Status CHAR(10), CONSTRAINT Pk_Orders PRIMARY KEY (OrderId) CONSTRAINT Fk_Items FOREIGN KEY (ItemId) REFERENCES Items(ItemId)) PARTITION BY LIST ( Status ) ( VALUES ( 'pending', 'returned' ), VALUES ( 'shipped', 'received' ), VALUES ( 'hold' ));

Default Partitioning Yes Start Use explicit directives Is partitioning declared? No Is the referenced table partitioned on the foreign key? Yes Colocate with referenced table Yes Are there foreign keys? No If no PARTITION BY clause is specified, GemFire SQLF will automatically partition and colocate tables based on this algorithm. Yes Partition by primary key Is there a primary key? Hashing is performed on the Java implementation of the column’s type. No Yes Partition by the first UNIQUE column Are there UNIQUE columns? No Partition by internally generated row id

Demo default partitioned tables, colocation, persistent tables

Scaling with Partitioned tables

Hash partitioning for linear scaling Key Hashing provides single hop access to its partition But, what if the access is not based on the key … say, joins are involved

Hash partitioning only goes so far Consider this query : Select * from flights, flightAvailability where <equijoin flights with flightAvailability> and flightId ='xxx'; If both tables are hash partitioned the join logic will need execution on all nodes where flightavailability data is stored Distributed joins are expensive and inhibit scaling joins across distributed nodes could involve distributed locks and potentially a lot of intermediate data transfer across nodes EquiJOIN of rows across multiple nodes is not supported in SQLFire 1.0

Partition aware DB design Designer thinks about how data maps to partitions The main idea is to: minimize excessive data distribution by keeping the most frequently accessed and joined data collocated on partitions Collocate transaction working set on partitions so complex 2-phase commits/paxos commit is eliminated or minimized. Read Pat Helland’s “Life beyond Distributed Transactions” and the Google MegaStore paper

Partition aware DB design Turns out OLTP systems lend themselves well to this need Typically it is the number of entities that grows over time and not the size of the entity. Customer count perpetually grows, not the size of the customer info Most often access is very restricted and based on select entities given a FlightID, fetch flightAvailability records given a customerID, add/remove orders, shipment records Identify partition key for “Entity Group” "entity groups": set of entities across several related tables that can all share a single identifier flightIDis shared between the parent and child tables CustomerID shared between customer, order and shipment tables

Partition aware DB design Entity groups defined in SQLFire using “colocation” clause Entity group guaranteed to be collocated in presence of failures or rebalance Now, complex queries can be executed without requiring excessive distributed data access

Partition Aware DB design STAR schema design is the norm in OLTP design Fact tables (fast changing) are natural partitioning candidates Partition by: FlightID … Availability, history rows colocated with Flights Dimension tables are natural replicated table candidates Replicate Airlines, Countries, Cities on all nodes Dealing with Joins involving M-M relationships Can the one side of the M-M become a replicated table? If not, run the Join logic in a parallel stored procedure to minimize distribution Else, split the query into multiple queries in application

vFabric SQLFire Introduction

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to vFabric SQLFire Introduction

Similar to vFabric SQLFire Introduction (20)

Recently uploaded

Recently uploaded (20)

vFabric SQLFire Introduction

Editor's Notes