Uploaded byDataWorks Summit

943 views

DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN

DeathStar is a system that runs HBase on YARN to provide easy, dynamic multi-tenant HBase clusters via YARN. It allows different applications to run HBase in separate application-specific clusters on a shared HDFS and YARN infrastructure. This provides strict isolation between applications and enables dynamic scaling of clusters as needed. Some key benefits are improved cluster utilization, easier capacity planning and configuration, and the ability to start new clusters on demand without lengthy provisioning times.

DeathStar
Easy, Dynamic, Multi-tenant HBase via
YARN
Ishan Chhabra, Nitin Aggarwal
Rocketfuel Inc.

In a not so distant
past…

1000 node cluster

Rogue Applications

Cannot customize per
application

Hard to capacity plan
or support new
applications

Key Insight:
HBase Multi-Tenancy
and Access Patterns

HBase
Service
Online Operational Store

HBase
Data Pipeline 1
Mutable Materialized View
Stream 1
Data Pipeline 2 Data Pipeline 3
Stream 2

HBase
Prep
Stage
Transient Cache
Stage 1 Stage 2 Stage 3

The Common Solution:
Separate Clusters

Non uniform network usage
Different DFSs’, leading to lot of copying of data
Low cluster utilization
High lead time for new applications

Run HBase on YARN
Built on top of Slider
Solution:
DeathStar

Hangar
App Cluster 3
Provisioning
Model
App
Cluster 2
App
Cluster 1
HDFS + YARN

(grid/deathstar): $ git commit
Capacity planning and configuration discussion
Create simple JSON config
As applications mature from
hangar to their cluster

Dynamic Cluster:
Make API call to start, stop and scale cluster
Static Cluster:
Good to go

lsv-hangar (20)
Clusters
Today
lsv-arp (100)
HDFS (1000 machines)
YARN
lsv-factdata
(80)
lsv-rtb-aux
(100)
lsv-attribution
(80)
lsv-user-
features (60)
lsv-user-geo-
features (10)
lsv-helios-
hbase (10)

Strict Isolation

Common HDFS
Layer
Bulkload
MapReduce over snapshots

Fits into
organization’s
capacity planning
model

Dynamic
config and
cluster size
changes

Clusters out of thin air

Hot swap a new cluster (human error / corruption)
Easier HBase version upgrades and testing
Temporary scale up for backfill
“Dynamic” enables interesting
use cases

Key Challenges and Solutions

Another failure mode
Taken care of by
auto restarts
RM HA in the works

Early Days:
Bugs
Slider did not
acknowledge container
allocations correctly
Fixed recently in 0.8:
SLIDER-828

Not easily reproducible,
still debugging
Zombie Regionservers
Early Days:
Bugs

Long running apps a
secondary use case
Logging, an unsolved problem
Store logs on local disks, considering
ELK

Usability

YARN/Slider lack certain
scheduling constraints
At most x instances per node for
spread and availability
Custom patch in-house

Rolling restarts for
config changes
Solved recently in 0.8: SLIDER-226

Data Locality

Metrics
Reporting
Custom hadoop metrics
OpenTSDB reporter
App name passed via
config

Multi-Tenancy?

HBase
Data Pipeline 1
Mutable Materialized View
Stream 1
Data Pipeline 2 Data Pipeline 3
Stream 2

Conclusion:
Is it for me?

Conclusion:
Is it worth it?

Thank you!
Questions?
Reach us at:
ishan@rocketfuel.com
naggarwal@rocketfuel.com

Recommended

PPTX

Operationalizing YARN based Hadoop Clusters in the Cloud

byDataWorks Summit/Hadoop Summit

PPTX

Apache Tez - A unifying Framework for Hadoop Data Processing

byDataWorks Summit

PPTX

Flexible and Real-Time Stream Processing with Apache Flink

byDataWorks Summit

PPTX

Disaster Recovery and Cloud Migration for your Apache Hive Warehouse

byDataWorks Summit

PPTX

Evolving HDFS to a Generalized Storage Subsystem

byDataWorks Summit/Hadoop Summit

PPTX

Deep Learning using Spark and DL4J for fun and profit

byDataWorks Summit/Hadoop Summit

PDF

Spark Uber Development Kit

byDataWorks Summit/Hadoop Summit

PPTX

A New "Sparkitecture" for modernizing your data warehouse

byDataWorks Summit/Hadoop Summit

PDF

Low latency high throughput streaming using Apache Apex and Apache Kudu

byDataWorks Summit

PDF

Cloudera Impala

byScott Leberknight

PPTX

A Container-based Sizing Framework for Apache Hadoop/Spark Clusters

byDataWorks Summit/Hadoop Summit

PPTX

Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks

byDataWorks Summit

PPTX

Hadoop in the Cloud - The what, why and how from the experts

byDataWorks Summit/Hadoop Summit

PPTX

Spark SQL versus Apache Drill: Different Tools with Different Rules

byDataWorks Summit/Hadoop Summit

PPTX

Apache Tez - A New Chapter in Hadoop Data Processing

byDataWorks Summit

PPTX

Managing Hadoop, HBase and Storm Clusters at Yahoo Scale

byDataWorks Summit/Hadoop Summit

PDF

Improving HDFS Availability with IPC Quality of Service

byDataWorks Summit

PPTX

End-to-End Security and Auditing in a Big Data as a Service Deployment

byDataWorks Summit/Hadoop Summit

PPTX

Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...

byDataWorks Summit/Hadoop Summit

PDF

2013 July 23 Toronto Hadoop User Group Hive Tuning

PPTX

To The Cloud and Back: A Look At Hybrid Analytics

byDataWorks Summit/Hadoop Summit

PDF

Hadoop 2 - Going beyond MapReduce

PDF

SQOOP - RDBMS to Hadoop

bySofian Hadiwijaya

PPTX

Hadoop from Hive with Stinger to Tez

byJan Pieter Posthuma

PPTX

Real-time Hadoop: The Ideal Messaging System for Hadoop

byDataWorks Summit/Hadoop Summit

PPTX

MapR-DB – The First In-Hadoop Document Database

byMapR Technologies

PPTX

Have your Cake and Eat it Too - Architecture for Batch and Real-time processing

byDataWorks Summit

PPTX

Architecting Applications with Hadoop

PPTX

Dawn of YARN @ Rocket Fuel

byDataWorks Summit

PPTX

Designing Data Pipelines Using Hadoop

byDataWorks Summit

More Related Content

PPTX

Operationalizing YARN based Hadoop Clusters in the Cloud

byDataWorks Summit/Hadoop Summit

PPTX

Apache Tez - A unifying Framework for Hadoop Data Processing

byDataWorks Summit

PPTX

Flexible and Real-Time Stream Processing with Apache Flink

byDataWorks Summit

PPTX

Disaster Recovery and Cloud Migration for your Apache Hive Warehouse

byDataWorks Summit

PPTX

Evolving HDFS to a Generalized Storage Subsystem

byDataWorks Summit/Hadoop Summit

PPTX

Deep Learning using Spark and DL4J for fun and profit

byDataWorks Summit/Hadoop Summit

PDF

Spark Uber Development Kit

byDataWorks Summit/Hadoop Summit

PPTX

A New "Sparkitecture" for modernizing your data warehouse

byDataWorks Summit/Hadoop Summit

Operationalizing YARN based Hadoop Clusters in the Cloud

byDataWorks Summit/Hadoop Summit

Apache Tez - A unifying Framework for Hadoop Data Processing

byDataWorks Summit

Flexible and Real-Time Stream Processing with Apache Flink

byDataWorks Summit

Disaster Recovery and Cloud Migration for your Apache Hive Warehouse

byDataWorks Summit

Evolving HDFS to a Generalized Storage Subsystem

byDataWorks Summit/Hadoop Summit

Deep Learning using Spark and DL4J for fun and profit

byDataWorks Summit/Hadoop Summit

Spark Uber Development Kit

byDataWorks Summit/Hadoop Summit

A New "Sparkitecture" for modernizing your data warehouse

byDataWorks Summit/Hadoop Summit

What's hot

PDF

Low latency high throughput streaming using Apache Apex and Apache Kudu

byDataWorks Summit

PDF

Cloudera Impala

byScott Leberknight

PPTX

A Container-based Sizing Framework for Apache Hadoop/Spark Clusters

byDataWorks Summit/Hadoop Summit

PPTX

Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks

byDataWorks Summit

PPTX

Hadoop in the Cloud - The what, why and how from the experts

byDataWorks Summit/Hadoop Summit

PPTX

Spark SQL versus Apache Drill: Different Tools with Different Rules

byDataWorks Summit/Hadoop Summit

PPTX

Apache Tez - A New Chapter in Hadoop Data Processing

byDataWorks Summit

PPTX

Managing Hadoop, HBase and Storm Clusters at Yahoo Scale

byDataWorks Summit/Hadoop Summit

PDF

Improving HDFS Availability with IPC Quality of Service

byDataWorks Summit

PPTX

End-to-End Security and Auditing in a Big Data as a Service Deployment

byDataWorks Summit/Hadoop Summit

PPTX

Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...

byDataWorks Summit/Hadoop Summit

PDF

2013 July 23 Toronto Hadoop User Group Hive Tuning

PPTX

To The Cloud and Back: A Look At Hybrid Analytics

byDataWorks Summit/Hadoop Summit

PDF

Hadoop 2 - Going beyond MapReduce

PDF

SQOOP - RDBMS to Hadoop

bySofian Hadiwijaya

PPTX

Hadoop from Hive with Stinger to Tez

byJan Pieter Posthuma

PPTX

Real-time Hadoop: The Ideal Messaging System for Hadoop

byDataWorks Summit/Hadoop Summit

PPTX

MapR-DB – The First In-Hadoop Document Database

byMapR Technologies

PPTX

Have your Cake and Eat it Too - Architecture for Batch and Real-time processing

byDataWorks Summit

PPTX

Architecting Applications with Hadoop

Low latency high throughput streaming using Apache Apex and Apache Kudu

byDataWorks Summit

Cloudera Impala

byScott Leberknight

A Container-based Sizing Framework for Apache Hadoop/Spark Clusters

byDataWorks Summit/Hadoop Summit

Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks

byDataWorks Summit

Hadoop in the Cloud - The what, why and how from the experts

byDataWorks Summit/Hadoop Summit

Spark SQL versus Apache Drill: Different Tools with Different Rules

byDataWorks Summit/Hadoop Summit

Apache Tez - A New Chapter in Hadoop Data Processing

byDataWorks Summit

Managing Hadoop, HBase and Storm Clusters at Yahoo Scale

byDataWorks Summit/Hadoop Summit

Improving HDFS Availability with IPC Quality of Service

byDataWorks Summit

End-to-End Security and Auditing in a Big Data as a Service Deployment

byDataWorks Summit/Hadoop Summit

Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...

byDataWorks Summit/Hadoop Summit

2013 July 23 Toronto Hadoop User Group Hive Tuning

To The Cloud and Back: A Look At Hybrid Analytics

byDataWorks Summit/Hadoop Summit

Hadoop 2 - Going beyond MapReduce

SQOOP - RDBMS to Hadoop

bySofian Hadiwijaya

Hadoop from Hive with Stinger to Tez

byJan Pieter Posthuma

Real-time Hadoop: The Ideal Messaging System for Hadoop

byDataWorks Summit/Hadoop Summit

MapR-DB – The First In-Hadoop Document Database

byMapR Technologies

Have your Cake and Eat it Too - Architecture for Batch and Real-time processing

byDataWorks Summit

Architecting Applications with Hadoop

Viewers also liked

PPTX

Dawn of YARN @ Rocket Fuel

byDataWorks Summit

PPTX

Designing Data Pipelines Using Hadoop

byDataWorks Summit

PPTX

One Click Hadoop Clusters - Anywhere (Using Docker)

byDataWorks Summit

PDF

The Most Valuable Customer on Earth-1298: Comic Book Analysis with Oracel's B...

byDataWorks Summit

PPTX

Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...

byDataWorks Summit

PPTX

Big Data Simplified - Is all about Ab'strakSHeN

byDataWorks Summit

PPTX

Realistic Synthetic Generation Allows Secure Development

byDataWorks Summit

PPT

Hadoop for Genomics__HadoopSummit2010

byYahoo Developer Network

PPTX

HBase and Drill: How loosley typed SQL is ideal for NoSQL

byDataWorks Summit

PDF

50 Shades of SQL

byDataWorks Summit

PPTX

Hadoop in Validated Environment - Data Governance Initiative

byDataWorks Summit

PPTX

Practical Distributed Machine Learning Pipelines on Hadoop

byDataWorks Summit

PPTX

Running Spark and MapReduce together in Production

byDataWorks Summit

PPTX

Karta an ETL Framework to process high volume datasets

byDataWorks Summit

PDF

Inspiring Travel at Airbnb [WIP]

byDataWorks Summit

PDF

Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...

byDataWorks Summit

PPTX

Carpe Datum: Building Big Data Analytical Applications with HP Haven

byDataWorks Summit

PPTX

NoSQL Needs SomeSQL

byDataWorks Summit

PPTX

Open Source SQL for Hadoop: Where are we and Where are we Going?

byDataWorks Summit

PPTX

Spark Application Development Made Easy

byDataWorks Summit

Dawn of YARN @ Rocket Fuel

byDataWorks Summit

Designing Data Pipelines Using Hadoop

byDataWorks Summit

One Click Hadoop Clusters - Anywhere (Using Docker)

byDataWorks Summit

The Most Valuable Customer on Earth-1298: Comic Book Analysis with Oracel's B...

byDataWorks Summit

Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...

byDataWorks Summit

Big Data Simplified - Is all about Ab'strakSHeN

byDataWorks Summit

Realistic Synthetic Generation Allows Secure Development

byDataWorks Summit

Hadoop for Genomics__HadoopSummit2010

byYahoo Developer Network

HBase and Drill: How loosley typed SQL is ideal for NoSQL

byDataWorks Summit

50 Shades of SQL

byDataWorks Summit

Hadoop in Validated Environment - Data Governance Initiative

byDataWorks Summit

Practical Distributed Machine Learning Pipelines on Hadoop

byDataWorks Summit

Running Spark and MapReduce together in Production

byDataWorks Summit

Karta an ETL Framework to process high volume datasets

byDataWorks Summit

Inspiring Travel at Airbnb [WIP]

byDataWorks Summit

Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...

byDataWorks Summit

Carpe Datum: Building Big Data Analytical Applications with HP Haven

byDataWorks Summit

NoSQL Needs SomeSQL

byDataWorks Summit

Open Source SQL for Hadoop: Where are we and Where are we Going?

byDataWorks Summit

Spark Application Development Made Easy

byDataWorks Summit

Similar to DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN

PPTX

HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN

PPTX

Slider: Applications on YARN

bySteve Loughran

PPTX

YARN Services

bySteve Loughran

PPTX

2013 11-19-hoya-status

bySteve Loughran

PPTX

Hoya for Code Review

bySteve Loughran

PPTX

Hadoop YARN Services

byDataWorks Summit

PPTX

Overview of slider project

bySteve Loughran

PPTX

Apache Hadoop YARN State of the Union

PPTX

Huhadoop - v1.1

byBig Data Joe™ Rossi

PDF

20140202 fosdem-nosql-devroom-hadoop-yarn

PDF

Combine SAS High-Performance Capabilities with Hadoop YARN

PDF

Hadoop ecosystem

PDF

Hadoop ecosystem

PDF

Discover.hdp2.2.h base.final[2]

PDF

Key trends in Big Data and new reference architecture from Hewlett Packard En...

PPTX

Running Services on YARN

byDataWorks Summit/Hadoop Summit

PPTX

Apache Hadoop YARN: Past, Present and Future

byDataWorks Summit

PPTX

Hadoop - Past, Present and Future - v1.1

byBig Data Joe™ Rossi

PPTX

Apache Slider

byShivaji Dutta

PPTX

ApacheCon North America 2014 - Apache Hadoop YARN: The Next-generation Distri...

HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN

Slider: Applications on YARN

bySteve Loughran

YARN Services

bySteve Loughran

2013 11-19-hoya-status

bySteve Loughran

Hoya for Code Review

bySteve Loughran

Hadoop YARN Services

byDataWorks Summit

Overview of slider project

bySteve Loughran

Apache Hadoop YARN State of the Union

Huhadoop - v1.1

byBig Data Joe™ Rossi

20140202 fosdem-nosql-devroom-hadoop-yarn

Combine SAS High-Performance Capabilities with Hadoop YARN

Hadoop ecosystem

Hadoop ecosystem

Discover.hdp2.2.h base.final[2]

Key trends in Big Data and new reference architecture from Hewlett Packard En...

Running Services on YARN

byDataWorks Summit/Hadoop Summit

Apache Hadoop YARN: Past, Present and Future

byDataWorks Summit

Hadoop - Past, Present and Future - v1.1

byBig Data Joe™ Rossi

Apache Slider

byShivaji Dutta

ApacheCon North America 2014 - Apache Hadoop YARN: The Next-generation Distri...

More from DataWorks Summit

PPTX

Data Science Crash Course

byDataWorks Summit

PPTX

Floating on a RAFT: HBase Durability with Apache Ratis

byDataWorks Summit

PPTX

Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi

byDataWorks Summit

PDF

HBase Tales From the Trenches - Short stories about most common HBase operati...

byDataWorks Summit

PPTX

Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...

byDataWorks Summit

PPTX

Managing the Dewey Decimal System

byDataWorks Summit

PPTX

Practical NoSQL: Accumulo's dirlist Example

byDataWorks Summit

PPTX

HBase Global Indexing to support large-scale data ingestion at Uber

byDataWorks Summit

PPTX

Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix

byDataWorks Summit

PPTX

Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi

byDataWorks Summit

PPTX

Supporting Apache HBase : Troubleshooting and Supportability Improvements

byDataWorks Summit

PPTX

Security Framework for Multitenant Architecture

byDataWorks Summit

PDF

Presto: Optimizing Performance of SQL-on-Anything Engine

byDataWorks Summit

PPTX

Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...

byDataWorks Summit

PPTX

Extending Twitter's Data Platform to Google Cloud

byDataWorks Summit

PPTX

Event-Driven Messaging and Actions using Apache Flink and Apache NiFi

byDataWorks Summit

PPTX

Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger

byDataWorks Summit

PPTX

Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...

byDataWorks Summit

PDF

Computer Vision: Coming to a Store Near You

byDataWorks Summit

PPTX

Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark

byDataWorks Summit

Data Science Crash Course

byDataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis

byDataWorks Summit

Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi

byDataWorks Summit

HBase Tales From the Trenches - Short stories about most common HBase operati...

byDataWorks Summit

Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...

byDataWorks Summit

Managing the Dewey Decimal System

byDataWorks Summit

Practical NoSQL: Accumulo's dirlist Example

byDataWorks Summit

HBase Global Indexing to support large-scale data ingestion at Uber

byDataWorks Summit

Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix

byDataWorks Summit

Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi

byDataWorks Summit

Supporting Apache HBase : Troubleshooting and Supportability Improvements

byDataWorks Summit

Security Framework for Multitenant Architecture

byDataWorks Summit

Presto: Optimizing Performance of SQL-on-Anything Engine

byDataWorks Summit

Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...

byDataWorks Summit

Extending Twitter's Data Platform to Google Cloud

byDataWorks Summit

Event-Driven Messaging and Actions using Apache Flink and Apache NiFi

byDataWorks Summit

Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger

byDataWorks Summit

Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...

byDataWorks Summit

Computer Vision: Coming to a Store Near You

byDataWorks Summit

Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark

byDataWorks Summit

Recently uploaded

PDF

Why Many Smart Device Platforms Fail to Scale?

byPromeraki Developments

PDF

Session 1/5: Enhancing Automation with Screenplay & Business Rules

bysuhanisingh58689

PDF

Explaining specific examples of purpose-specific BOM

byakipii ogaoga

PDF

Explaining the flow of purpose-specific BOM

byakipii ogaoga

PPTX

Working Session — Build a Document Understanding Automation Using an OOTB ML ...

PDF

Traditional-Security-Models-No-Longer-Work.pptx (1).pdf

byOhhproJunction

PPTX

Beyond the Algorithm: Designing Human-Centric Public Service with AI

PPTX

Automation Without Apprentices: How AI Challenges the Open Source Way

PPTX

Workflow and decision Automation with Flowable

PDF

The new book “Marketing in the Metaverse” spotlights Scott M. Graffius’ research

byScott M. Graffius

PDF

GDG Cloud Southlake #49: Pradeep R Kumar: Implications of Agentic AI for Iden...

byJames Anderson

PPTX

Tech Trends 2026: AI Agents, Quantum Computing, Robotics & Cybersecurity

byridwansassman

PDF

How does MES(Manufacturing Execution System) work?

byakipii ogaoga

PDF

shayk.online - Anonymous chat with Sinatra and WebSockets

byEleanor McHugh

PPTX

PPTX game guess the logo with a twistppt

PPTX

How Does an ICO Launchpad Work Step-by-Step Breakdown.pptx

PPTX

TravelTech Paris 2025 | Beyond the pipes: Why Channel Management isn’t really...

PPTX

apidays Paris 2025 | Integration is Feminist: Building Peace in Distributed S...

PDF

Self-Correction Failure Diagnostic: Detecting Drift in Complex Systems

bySystems Research Group

PDF

Antminer S23Hyd-580_Manual-V1.1-oneminers.pdf

byFranzhLawrenceBataan1

Why Many Smart Device Platforms Fail to Scale?

byPromeraki Developments

Session 1/5: Enhancing Automation with Screenplay & Business Rules

bysuhanisingh58689

Explaining specific examples of purpose-specific BOM

byakipii ogaoga

Explaining the flow of purpose-specific BOM

byakipii ogaoga

Working Session — Build a Document Understanding Automation Using an OOTB ML ...

Traditional-Security-Models-No-Longer-Work.pptx (1).pdf

byOhhproJunction

Beyond the Algorithm: Designing Human-Centric Public Service with AI

Automation Without Apprentices: How AI Challenges the Open Source Way

Workflow and decision Automation with Flowable

The new book “Marketing in the Metaverse” spotlights Scott M. Graffius’ research

byScott M. Graffius

GDG Cloud Southlake #49: Pradeep R Kumar: Implications of Agentic AI for Iden...

byJames Anderson

Tech Trends 2026: AI Agents, Quantum Computing, Robotics & Cybersecurity

byridwansassman

How does MES(Manufacturing Execution System) work?

byakipii ogaoga

shayk.online - Anonymous chat with Sinatra and WebSockets

byEleanor McHugh

PPTX game guess the logo with a twistppt

How Does an ICO Launchpad Work Step-by-Step Breakdown.pptx

TravelTech Paris 2025 | Beyond the pipes: Why Channel Management isn’t really...

apidays Paris 2025 | Integration is Feminist: Building Peace in Distributed S...

Self-Correction Failure Diagnostic: Detecting Drift in Complex Systems

bySystems Research Group

Antminer S23Hyd-580_Manual-V1.1-oneminers.pdf

byFranzhLawrenceBataan1

DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN

1.
DeathStar Easy, Dynamic, Multi-tenantHBase via YARN Ishan Chhabra, Nitin Aggarwal Rocketfuel Inc.
2.
In a notso distant past…
3.
1000 node cluster
4.
Rogue Applications
5.
Cannot customize per application
6.
Hard to capacityplan or support new applications
7.
Key Insight: HBase Multi-Tenancy andAccess Patterns
8.
HBase Service Online Operational Store
9.
HBase Data Pipeline 1 MutableMaterialized View Stream 1 Data Pipeline 2 Data Pipeline 3 Stream 2
10.
HBase Prep Stage Transient Cache Stage 1Stage 2 Stage 3
11.
The Common Solution: SeparateClusters
12.
Non uniform networkusage Different DFSs’, leading to lot of copying of data Low cluster utilization High lead time for new applications
13.
Run HBase onYARN Built on top of Slider Solution: DeathStar
14.
Hangar App Cluster 3 Provisioning Model App Cluster2 App Cluster 1 HDFS + YARN
15.
(grid/deathstar): $ gitcommit Capacity planning and configuration discussion Create simple JSON config As applications mature from hangar to their cluster
16.
Dynamic Cluster: Make APIcall to start, stop and scale cluster Static Cluster: Good to go
17.
lsv-hangar (20) Clusters Today lsv-arp (100) HDFS(1000 machines) YARN lsv-factdata (80) lsv-rtb-aux (100) lsv-attribution (80) lsv-user- features (60) lsv-user-geo- features (10) lsv-helios- hbase (10)
19.
Strict Isolation
20.
Common HDFS Layer Bulkload MapReduce oversnapshots
21.
Fits into organization’s capacity planning model
22.
Dynamic config and cluster size changes
23.
Clusters out ofthin air
24.
Hot swap anew cluster (human error / corruption) Easier HBase version upgrades and testing Temporary scale up for backfill “Dynamic” enables interesting use cases
25.
Key Challenges andSolutions
26.
Another failure mode Takencare of by auto restarts RM HA in the works
27.
Early Days: Bugs Slider didnot acknowledge container allocations correctly Fixed recently in 0.8: SLIDER-828
28.
Not easily reproducible, stilldebugging Zombie Regionservers Early Days: Bugs
29.
Long running appsa secondary use case Logging, an unsolved problem Store logs on local disks, considering ELK
30.
Usability
31.
YARN/Slider lack certain schedulingconstraints At most x instances per node for spread and availability Custom patch in-house
32.
Rolling restarts for configchanges Solved recently in 0.8: SLIDER-226
33.
Data Locality
34.
Metrics Reporting Custom hadoop metrics OpenTSDBreporter App name passed via config
35.
Multi-Tenancy?
36.
HBase Data Pipeline 1 MutableMaterialized View Stream 1 Data Pipeline 2 Data Pipeline 3 Stream 2
37.
Conclusion: Is it forme?
38.
Conclusion: Is it worthit?
39.
Thank you! Questions? Reach usat: ishan@rocketfuel.com naggarwal@rocketfuel.com

Editor's Notes

#2 Welcome to this talk on deathstar, our In house solution to easily and instantly provision hbase clusters via YARN.
#3 to understand why we are where we are today, lets go a little back in time. not too far, just a year go.
#4 1000 node HBase cluster co-located with our hadoop analytics cluster Powered lots of interesting and business critical applications (“high rising skyscrapers”)
#5 But once in a while, a rogue application would destroy the cluster, giving us sleepless nights It had become so bad that we ourselves were uncomfortable recommending usage or deploying our own application.
#6 Additionally, lot of important properties for HBase (like block cache, memstore, etc) are at the regionserver (worker) level, and cannot be customized for every application in a shared cluster eg. some applications need more block cache, some don’t need memstore due to bulkloads, etc.
#7 * Hard to understand what resources of the cluster an application is consuming in terms of compute, memory, network, and hard to capacity plan growth of the cluster and its applications.
#8 * Given these set of challenges that we were facing, we took a step back and tried to understand how applications were using HBase.
#9 * Use Case 1: People would build services that use HBase as their main storage engine. Only the service interacted with the cluster. No sharing.
#10 Use Case 2: HBase is very good at storing large amounts of data and providing point updates and reads Multiple streams of incoming data, joined and aggregated, and available to multiple data pipelines to consume further. Usually single writer, multiple readers.
#11 Use Case 3: Large out of memory cache for a data pipeline (series of mapreduce or spark jobs) Prep stage would load the data into the cache Further stages would use the data Data is not needed after the pipeline is finished, and can be cleaned up.
#12 Given these access patterns, we noticed that most tables do not need to be shared among multiple entities. Usually this problem is solved in some companies by having separate clusters.
#13 But we did not like that due to these problems High lead time reduces productivity, and slows down engineering. We like to move fast. We buy a standard machine. Some applications require high memory, others don’t. Often resources in the individual clusters are wasted. Data written to HBase cluster by MR jobs. Lot of copying between different cluster DFSs’. Copying of data between clusters usually leads to non uniform distribution of network traffic, and can bottleneck TOR switches
#14 Hence, we decided to skip the separate clusters solution, and jump onto running HBase in YARN containers. I hope I don’t have to clarify what is YARN to this audience.  We build the solution on top of the slider projects.
#15 We setup a simple provisioning model to make it easy for new applications to get started. The cluster runs a base layer of HDFS and YARN. We have a 30 odd HBase cluster called “Hangar” which is free for experiment on and has no SLAs. So if my girlfriend dumps me, I may decide to overcome my anger by wiping out the cluster, and nobody can complain. Fortunately that hasn’t happened yet.  Coming back, once a developer has built a prototype, we work with them and provision a separate cluster for them.
#16 As applications mature from hangar to their cluster We first sit down with the developer to understand how many HBase regionservers does the application need. What kind of configuration it needs in terms of block cache, memstore, RPC handler threads, and other tunable HBase parameters. The developer then creates a simple config and commits it into our codebase.
#17 Now there are 2 possibilities: If it is a static cluster (a cluster that is always running), then you are good to go. The system automatically brings up the cluster. If it is a dynamic cluster (a cluster that be started and stopped at any time), then the applications makes the API calls to start, stop and scale the cluster.
#18 Using this model we are running 8 clusters in production today, and growing. The clusters vary in size and configuration, from 10 containers, all the way up to 100 containers.
#19 * We found various benefits in on moving to this model, some which we had imagined, some that we hadn’t.
#20 We try to go with model of one cluster per HBase cluster as much as possible. This gives us strict isolation between various applications, and prevents the “godzilla” problem that we mentioned before.
#21 All of these clusters share a common HDFS layer, which is also used by our MR and spark jobs. This avoids the problems of data copying and non-uniform network usage. A lot of our data pipelines load data into their HBase clusters via bulkloads, and run mapreduce over snapshots for fast access, which becomes very easy given the shared HDFS cluster.
#22 This provisioning model plays well with our organization’s capacity planning model. Each team and their subteams get a share of the YARN cluster, which is enforced via hierarchical queues in YARN. A developer gets resources assigned to his team to provision his cluster, and can negotiate for more with his team lead as needed.
#23 Being able to change the size of a cluster and its configuration via a simple config instantly is truly liberating for developers. For example, this is just anther day in the life of the developer where he is increasing the number of nodes, and decreasing the block cache and memory allocation for his application as the application needs more RPCs and would not benefit from block cache. These changes are easy and frequent.
#24 * And finally, one can quickly bring clusters out of thin air, and dissolve them as needed.
#25 Being able to dynamically bring up and tear down clusters with different configurations enable many interesting use cases that we see in production. It becomes much easier to bring up a temporary cluster with a newer version of HBase to test applications, making HBase version upgrade process easier and surprise free. We have also interestingly seen cases where an application has written some bad or incorrect data, and wants to backfill and rewrite data. The cluster is provisioning to sustain read/write throughput of standard runs of an application, and it would take a lot of time to backfill data. In these cases, we simply scale up the cluster temporarily so allow for faster backfills of data, and then scale it down to support the usual throughput, making the process a lot easier for the developer. Finally, in cases where is massive corruption of data due to human error, one can simply create a new cluster and bring it up to speed, and hot swap it with the existing cluster for an application.
#26 We created this solution due to our urgent needs, and started a year ago at a time when the YARN and slider were still very young. They are still very young. Lets discuss some of the key challenges, some of which are solved, and some of which aren’t. This is to give you a fair idea so that you can take an informed decision if you take this route.
#27 By adding another system, YARN, between HBase and bare metal, we have added another set of failure modes. We have seen RM failures and restarts more than we would like to in production. This is taken care of today by our monitoring engine that automatically restarts clusters when they fail, so that we don’t have to manually. RM HA is in the works on the production YARN cluster to reduce failures.
#28 Slider is still a young project, and there can be critical bugs that can affect the entire YARN cluster and HBase applications. For example, slider had a bug where it would hoard containers and not acknowledge allocations correctly to YARN. This lead to undesirable preemption in our YARN cluster for other users. The problem was reported and fixed upstream, and the solution is a part of slider 0.8, released recently.
#29 We also infrequently see this weird issue of zombie regionservers running on machines, when they should have shutdown properly. Not easily reproducible, we are still debugging and will push fix upstream when we find it.
#30 A more fundamental problem is that long running applications have traditionally been a secondary use case for YARN, given that MR was the first and most widely used application on YARN. This is visible sometimes. Eg., till recently, YARN would aggregate the logs and store them on HDFS when the application finished, which makes sense for MR, but not for long running applications. Long running applications are long running and create huge logs. This would create huge logs on local disks, the aggregation process would invariably crash due to huge amounts of data copying, and the logs would be available for inspecting in real time. This was recently improved to log rotation and move after every 12 hours in YARN 2.6, but it is still not enough. Logs are very important for us to understand the behavior of HBase clusters, and so we decided to store them on a local disk on the node running the regionserver in a unique directly, and we login today directly into the machine to check logs when needed. We are considering ELK now to aggregate and search the logs in real time more easily.
#31 In general, usability is lacking given that slider is such a young project, and we have ended up building various parts around it. Eg it is hard to find the location of master page given that it is provisioned on a random node with a random port, and requires a bunch of steps. We build a simple UI to programmatically locate this and provide a simple “jump board”. Similarly, Slider requires one to specify all the configurations for every HBase cluster. We build a hierarchical config system where we maintain a set of sane based configurations for Hbase, and very application specific config essentially inherits and overrides parts as needed. And so on.
#32 Another big area of improvement needed for Slider and YARN is to add various kinds of scheduling constraints. For example, we needed the notion of scheduling at most x (usually 2-3) instances of Hbase servers on a single physical machine, to increase availability of the cluster in case of machine failure. When we started, slider would usually get almost all of the required containers on 2-3 nodes! We have a custom patch in house to support this, which we are working on contributing upstream.
#33 Slider/YARN did not have the ability to rolling restart an Hbase cluster, which is we find very useful to push config changes to a live cluster. This has been added recently to Slider and YARN.
#34 Another important point to consider is the locality of data on HDFS and scheduling of containers. There is no support for this today, and the containers are scheduled randomly. However, if the containers are stable and not killed, you end with locality due to compaction and new writes going to local disk first. We have been thinking about making changes to Slider and HBase to get maximum locality. Slider would try to schedule containers onto the machines that have the data for the cluster, and a custom region balancer would try to maximize data locality for the regions. We haven’t felt a strong need for it, yet.
#35 Finally, Hbase metrics are very important for us to fix problems and understand running clusters in general. We use OpenTSDB in house for our metrics storage and visualization needs. To make this new model play well with OpenTSDB, we created a custom hadoop metrics reporter to send metrics to OpenTSDB. We pass in the application name via config, which makes it easy to differentiate metrics for different HBase clusters in the UI.
#36 * Now that we have made multi-tenancy easy for use cases where different applications are using different HBase clusters, what about the case where they want to a share an HBase table.
#37 For example, what about the application pattern we talked about earlier of creating a mutable view by combining and aggregating multiple streams of data which is accessed by multiple data pipelines. We currently don’t face much problems for this use case today since the data pipelines usually use mapreduce over snapshots, which does not impact HBase Regionservers. We have started to some early problems though for some use cases, and we plan to solve them by using features like per user RPC queues in HBase, and adding our in similar vein as needed.
#38 So finally, having seen all the challenges behind the scenes, do I think this is right solution for people in the audience? If you are an intermediate to advanced user of HBase with multiple applications, then I definitely would say yes. The stack is maturing quickly, and the value provided is immense.
#39 And is it worth it? Hell yeah! If you want to get some good night sleep.