2 - Trafodion and Hadoop HBase

Trafodion
Transactional SQL-on-HBase
Trafodion and Hadoop / HBase
www.trafodion.org
HP © Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Trafodion innovation built upon Hadoop stack
Leverages Hadoop and
HBase for core modules
• Maintains API compatibility
• Inherited scalability and
availability
Differentiation
• ANSI SQL via ODBC/JDBC
• Relational schema abstraction
• Distributed transaction protection
• Mature SQL technology
• Automatic parallelism
Hadoop Trafodion
Client Application using
ODBC/JDBC on
Windows/Linux
Client Services for ODBC and JDBC
HBase
HP © Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject 2 to change without notice.
Hive
HDFS
Zookeeper
SQL Compiler / Optimizer / Executor
Distributed Transaction Manager
+

HBase vs. Trafodion comparison
HBase Trafodion + HBase
Data abstraction Key and value pair Relational schema
Physical Layout Column family store where
row data is stored together by
cells
Same except there is a single column
family with space-saving column
encoding
Column values Uninterpreted array of bytes Explicitly defined and enforced data
types
ACID Guarantee Single row atomicity Multi- SQL statements, tables, and
rows defined as part of transaction
Language API Get/put/delete SQL (Trafodion invokes native HBase
API)
Row Key Index Single (string) row key Composite (multi-column) row key
Secondary Indexes Not supported Arbitrary secondary key columns

Salting of row keys
How it works
• HBase table gets created, pre-split with one
region per salt value
• A hash value column, “_SALT_”, is added as a
prefix to the row key
• Salting is transparent to SQL statements
– Automatically computed during insert/update
statements
– Predicates automatically generated where feasible
– Minimal overhead for direct lookup by key value
Benefits
• Even data distributions across HBase regions
• Avoids region hotspots caused by insertion of
data in row key order
INSERT(s) SELECT(s)
HBase
Region
PART 1 PART 2 PART 3 PART 4
HDFS
CREATE TABLE t(a integer not null primary key, b
integer) SALT USING 4 PARTITIONS;
HBase
Region
HDFS
HBase
Region
HDFS
HBase
Region
HDFS

Trafodion and Hadoop – Better Together!
Leverages and extends Hadoop for transactional SQL workloads
Complete: Full-function ANSI SQL
Reuse existing SQL skills and improve developer productivity
Protected: Distributed ACID transactions
Guarantees data consistency across multiple rows, tables, SQL statements
Efficient: Optimized for low-latency read and write
transactions
Supports real-time transaction processing applications
Flexible: Schema flexibility and multi-structured data
Seamlessly integrates structured, unstructured, and semi-structured data
Interoperable: Standard ODBC/JDBC access
Works with existing tools and applications
Open: Hadoop and Linux distribution neutral
Easy to add to your existing infrastructure and no vendor lock-in
Reuse SQL
skills
Scale without
complexity
Complements
Hadoop
Reduce
Costs
Real-time
Performanc
e
+

See for yourself…
Come discover and develop on Trafodion
www.trafodion.org

2 - Trafodion and Hadoop HBase

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to 2 - Trafodion and Hadoop HBase

Similar to 2 - Trafodion and Hadoop HBase (20)

Recently uploaded

Recently uploaded (20)

2 - Trafodion and Hadoop HBase

Editor's Notes