hbaseconasia2017: HBase Disaster Recovery Solution at Huawei

•

4 likes•1,423 views

Ashish Singhi HBase Disaster recovery solution aims to maintain high availability of HBase service in case of disaster of one HBase cluster with very minimal user intervention. This session will introduce the HBase disaster recovery use cases and the various solutions adopted at Huawei like. a) Cluster Read-Write mode b) DDL operations synchronization with standby cluster c) Mutation and bulk loaded data replication d) Further challenges and pending work hbaseconasia2017 hbasecon hbase https://www.eventbrite.com/e/hbasecon-asia-2017-tickets-34935546159#

Technology

1
HBase Disaster Recovery Solution
at Huawei
Ashish Singhi

2
About.html
• Senior Technical Leader at Huawei
• Around 6 years of experience in Big Data related projects
• Apache HBase Committer

3
Agenda
• Why Disaster Recovery ?
• Backup Vs Disaster Recovery
• HBase Disaster Recovery
• Solution
• Miscellaneous
• Future Work

4
Why Disaster Recovery ?
Cost of
Downtime

5
Agenda
• Why Disaster Recovery ?
• Backup Vs Disaster Recovery
• HBase Disaster Recovery
• Solution
• Miscellaneous
• Future Work

6
Backup Vs Disaster Recovery
Two different problems and solutions
Backup Disaster Recovery
Process Archive items to
cold media
Replicate to secondary
site
Infrastructure Medium level Duplicate of active
cluster (high level)
Cost Affordable Expensive
Restore process One to few at a
time
One to everything
Restore time Slow Fast
Production usage Common Rare

7
Agenda
• Why Disaster Recovery ?
• Backup Vs Disaster Recovery
• HBase Disaster Recovery
• Solution
• Miscellaneous
• Future Work

8
HBase Disaster Recovery
• HBase Disaster recovery is based on replication, which
mirrors data across a network in real time.
• The technology is used to move data from a local source
location to one or more target locations.
• Replication over WAN has become an ideal technology
for disaster recovery to prevent data loss in the event of
failure.

10
Active – Standby Cluster
Active Cluster
HBase
Standby Cluster
HBase
Write
Read
/hbase/clusterStat
e: standby
/hbase/clusterStat
e: active
ZooKeeper
Serves only
Read Client
Requests
ZooKeeper
Replication
Serves Read
and Write
Client
Requests

11
Agenda
• Why Disaster Recovery ?
• Backup Vs Disaster Recovery
• HBase Disaster Recovery
• Solution
• Miscellaneous
• Future Work

12
Replication
WAL
1
1
2
Region Server
Replication
Source/End Point
Replication
Source/End Point
Replication Source
Manager
Region Server
…/peers/
…/rs/
…/hfile-refs/
Source Cluster Peer Cluster 1 [tableCfs - 1]
1
3 1
Table
Batch
1
Replication Sink
1
Bulk load
Region Server
TableReplication Sink
Peer Cluster 2 [tableCfs - ]
12
1
Batch
Bulk load
12
1
1
Batch
Bulk load
ZooKeeper

13
Sync DDL Operations
• Synchronize the table properties across clusters
• Any change in the source cluster, reflects immediately in
the peer clusters.
• Does not break the replication.
• An additional option with DDL command to sync
• Internally sync those changes to peer clusters.

14
Sync Security related Data
• Synchronize security related HBase data across the
clusters
• Any update in the source cluster ACL, Quota or Visibility
Labels table, reflects immediately in peer clusters.
• A custom WAL entry filter is added in replication for this.
• Does not break the security for HBase data access.

15
Read Only Cluster
• Enable a cluster to serve only read requests
• A coprocessor based solution
• Standby cluster will serve all the read requests
• Standby cluster will serve write requests only if the requests
is coming from a,
• Super user
• From a list of accepted IPs

16
Cluster Recovery
Replication
Active Standby Cluster
HBase
Standby Active Cluster
HBase
Serves Read
and Write
Client
Requests
Write
Read
/hbase/clusterStat
e: standby active
/hbase/clusterStat
e: active standby
ZooKeeper
Serves only
Read Client
Requests
ZooKeeper

17
Agenda
• Why Disaster Recovery ?
• Backup Vs Disaster Recovery
• HBase Disaster Recovery
• Solution
• Miscellaneous
• Future Work

18
Miscellaneous
• Increased the default replication.source.ratio to 0.5
• Adaptive hbase.replication.rpc.timeout
• Active cluster HDFS server configurations are maintained
in Standby cluster ZooKeeper for bulk loaded data
replication.

19
Agenda
• Why Disaster Recovery ?
• Backup Vs Disaster Recovery
• HBase Disaster Recovery
• Solution
• Miscellaneous
• Future Work

20
Future work
• Move HBase Replication tracking from ZooKeeper to
HBase table (HBASE-15867)
• Copy bulk loaded data to peer with data locality
• Replication data network bandwidth throttling.

21
Thank You！
mailto: ashishsinghi@apache.org
Twitter: ashishsinghi89

What's hot

Meet HBase 1.0enissoz

hbaseconasia2017: Apache HBase at NeteaseHBaseCon

HBaseCon 2015: HBase at Scale in an Online and High-Demand EnvironmentHBaseCon

HBaseCon 2012 | Building a Large Search Platform on a Shoestring BudgetCloudera, Inc.

HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster Cloudera, Inc.

HBaseConAsia2018 Track1-1: Use CCSMap to improve HBase YGC timeMichael Stack

HBase at Bloomberg: High Availability Needs for the Financial IndustryHBaseCon

HBaseCon 2015: State of HBase Docs and How to ContributeHBaseCon

HBaseCon 2015- HBase @ FlipboardMatthew Blair

HBase: Extreme MakeoverHBaseCon

Digital Library Collection Management using HBaseHBaseCon

The State of HBase ReplicationHBaseCon

X-DB Replication Server and MMRAshnikbiz

Technical Introduction to PostgreSQL and PPASAshnikbiz

HBase and HDFS: Understanding FileSystem Usage in HBaseenissoz

HBaseConAsia2018 Track1-3: HBase at XiaomiMichael Stack

HBaseCon 2013: Project Valta - A Resource Management Layer over Apache HBaseCloudera, Inc.

Real-time HBase: Lessons from the CloudHBaseCon

HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...Michael Stack

HBaseCon 2012 | Base Metrics: What They Mean to You - ClouderaCloudera, Inc.

What's hot (20)

Meet HBase 1.0

hbaseconasia2017: Apache HBase at Netease

HBaseCon 2015: HBase at Scale in an Online and High-Demand Environment

HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget

HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster

HBaseConAsia2018 Track1-1: Use CCSMap to improve HBase YGC time

HBase at Bloomberg: High Availability Needs for the Financial Industry

HBaseCon 2015: State of HBase Docs and How to Contribute

HBaseCon 2015- HBase @ Flipboard

HBase: Extreme Makeover

Digital Library Collection Management using HBase

The State of HBase Replication

X-DB Replication Server and MMR

Technical Introduction to PostgreSQL and PPAS

HBase and HDFS: Understanding FileSystem Usage in HBase

HBaseConAsia2018 Track1-3: HBase at Xiaomi

HBaseCon 2013: Project Valta - A Resource Management Layer over Apache HBase

Real-time HBase: Lessons from the Cloud

HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...

HBaseCon 2012 | Base Metrics: What They Mean to You - Cloudera

Similar to hbaseconasia2017: HBase Disaster Recovery Solution at Huawei

The Straight Skinny on Cloud PlatformsHostway|HOSTING

DataStax | DataStax Enterprise Advanced Replication (Brian Hess & Cliff Gilmo...DataStax

From cache to in-memory data grid. Introduction to Hazelcast.Taras Matyashovsky

Backup and Disaster Recovery in Hadooplarsgeorge

4. (mjk) extreme performance 2Doina Draganescu

Planning For Catastrophe with IBM WAS and IBM BPMWASdev Community

AWS Elasticity and Auto ScalingChris Williams

Backup and Disaster Recovery in Hadoop DataWorks Summit/Hadoop Summit

impalapresentation-130130105033-phpapp02 (1)_221220_235919.pdfssusere05ec21

High Performance DrupalChapter Three

hbaseconasia2017: Removable singularity: a story of HBase upgrade in PinterestHBaseCon

Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsEsther Kundin

Seamless replication and disaster recovery for Apache Hive WarehouseDataWorks Summit

Seamless Replication and Disaster Recovery for Apache Hive WarehouseSankar H

SD Big Data Monthly Meetup #4 - Session 2 - WANDiscoBig Data Joe™ Rossi

BIWUG1303 - HA & DRBIWUG

Exchange Server 2013 : les mécanismes de haute disponibilité et la redondance...Microsoft Technet France

Scaling with sync_replication using Galera and EC2Marco Tusa

Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsEsther Kundin

Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #4: MS Azure Database MySQLContinuent

Similar to hbaseconasia2017: HBase Disaster Recovery Solution at Huawei (20)

The Straight Skinny on Cloud Platforms

DataStax | DataStax Enterprise Advanced Replication (Brian Hess & Cliff Gilmo...

From cache to in-memory data grid. Introduction to Hazelcast.

Backup and Disaster Recovery in Hadoop

4. (mjk) extreme performance 2

Planning For Catastrophe with IBM WAS and IBM BPM

AWS Elasticity and Auto Scaling

Backup and Disaster Recovery in Hadoop

impalapresentation-130130105033-phpapp02 (1)_221220_235919.pdf

High Performance Drupal

hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest

Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends

Seamless replication and disaster recovery for Apache Hive Warehouse

Seamless Replication and Disaster Recovery for Apache Hive Warehouse

SD Big Data Monthly Meetup #4 - Session 2 - WANDisco

BIWUG1303 - HA & DR

Exchange Server 2013 : les mécanismes de haute disponibilité et la redondance...

Scaling with sync_replication using Galera and EC2

Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends

Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #4: MS Azure Database MySQL

Recently uploaded

Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited

APIForce Zurich 5 April Automation LPDGMarianaLemus7

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

"ML in Production",Oleksandr BaganFwdays

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

costume and set research powerpoint presentationphoebematthew05

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

AI as an Interface for Commercial BuildingsMemoori

Gen AI in Business - Global Trends Report 2024.pdfAddepto

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

Recently uploaded (20)

Ensuring Technical Readiness For Copilot in Microsoft 365

APIForce Zurich 5 April Automation LPDG

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

DevEX - reference for building teams, processes, and platforms

Unraveling Multimodality with Large Language Models.pdf

SAP Build Work Zone - Overview L2-L3.pptx

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

"ML in Production",Oleksandr Bagan

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

Developer Data Modeling Mistakes: From Postgres to NoSQL

DMCC Future of Trade Web3 - Special Edition

Human Factors of XR: Using Human Factors to Design XR Systems

Dev Dives: Streamline document processing with UiPath Studio Web

costume and set research powerpoint presentation

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Streamlining Python Development: A Guide to a Modern Project Setup

SQL Database Design For Developers at php[tek] 2024

AI as an Interface for Commercial Buildings

Gen AI in Business - Global Trends Report 2024.pdf

Nell’iperspazio con Rocket: il Framework Web di Rust!

hbaseconasia2017: HBase Disaster Recovery Solution at Huawei

1. 1 HBase Disaster Recovery Solution at Huawei Ashish Singhi

2. 2 About.html • Senior Technical Leader at Huawei • Around 6 years of experience in Big Data related projects • Apache HBase Committer

3. 3 Agenda • Why Disaster Recovery ? • Backup Vs Disaster Recovery • HBase Disaster Recovery • Solution • Miscellaneous • Future Work

4. 4 Why Disaster Recovery ? Cost of Downtime

5. 5 Agenda • Why Disaster Recovery ? • Backup Vs Disaster Recovery • HBase Disaster Recovery • Solution • Miscellaneous • Future Work

6. 6 Backup Vs Disaster Recovery Two different problems and solutions Backup Disaster Recovery Process Archive items to cold media Replicate to secondary site Infrastructure Medium level Duplicate of active cluster (high level) Cost Affordable Expensive Restore process One to few at a time One to everything Restore time Slow Fast Production usage Common Rare

7. 7 Agenda • Why Disaster Recovery ? • Backup Vs Disaster Recovery • HBase Disaster Recovery • Solution • Miscellaneous • Future Work

8. 8 HBase Disaster Recovery • HBase Disaster recovery is based on replication, which mirrors data across a network in real time. • The technology is used to move data from a local source location to one or more target locations. • Replication over WAN has become an ideal technology for disaster recovery to prevent data loss in the event of failure.

9. 9 Deployment Strategies

10. 10 Active – Standby Cluster Active Cluster HBase Standby Cluster HBase Write Read /hbase/clusterStat e: standby /hbase/clusterStat e: active ZooKeeper Serves only Read Client Requests ZooKeeper Replication Serves Read and Write Client Requests

11. 11 Agenda • Why Disaster Recovery ? • Backup Vs Disaster Recovery • HBase Disaster Recovery • Solution • Miscellaneous • Future Work

12. 12 Replication WAL 1 1 2 Region Server Replication Source/End Point Replication Source/End Point Replication Source Manager Region Server …/peers/ …/rs/ …/hfile-refs/ Source Cluster Peer Cluster 1 [tableCfs - 1] 1 3 1 Table Batch 1 Replication Sink 1 Bulk load Region Server TableReplication Sink Peer Cluster 2 [tableCfs - ] 12 1 Batch Bulk load 12 1 1 Batch Bulk load ZooKeeper

13. 13 Sync DDL Operations • Synchronize the table properties across clusters • Any change in the source cluster, reflects immediately in the peer clusters. • Does not break the replication. • An additional option with DDL command to sync • Internally sync those changes to peer clusters.

14. 14 Sync Security related Data • Synchronize security related HBase data across the clusters • Any update in the source cluster ACL, Quota or Visibility Labels table, reflects immediately in peer clusters. • A custom WAL entry filter is added in replication for this. • Does not break the security for HBase data access.

15. 15 Read Only Cluster • Enable a cluster to serve only read requests • A coprocessor based solution • Standby cluster will serve all the read requests • Standby cluster will serve write requests only if the requests is coming from a, • Super user • From a list of accepted IPs

16. 16 Cluster Recovery Replication Active Standby Cluster HBase Standby Active Cluster HBase Serves Read and Write Client Requests Write Read /hbase/clusterStat e: standby active /hbase/clusterStat e: active standby ZooKeeper Serves only Read Client Requests ZooKeeper

17. 17 Agenda • Why Disaster Recovery ? • Backup Vs Disaster Recovery • HBase Disaster Recovery • Solution • Miscellaneous • Future Work

18. 18 Miscellaneous • Increased the default replication.source.ratio to 0.5 • Adaptive hbase.replication.rpc.timeout • Active cluster HDFS server configurations are maintained in Standby cluster ZooKeeper for bulk loaded data replication.

19. 19 Agenda • Why Disaster Recovery ? • Backup Vs Disaster Recovery • HBase Disaster Recovery • Solution • Miscellaneous • Future Work

20. 20 Future work • Move HBase Replication tracking from ZooKeeper to HBase table (HBASE-15867) • Copy bulk loaded data to peer with data locality • Replication data network bandwidth throttling.

21. 21 Thank You！ mailto: ashishsinghi@apache.org Twitter: ashishsinghi89

hbaseconasia2017: HBase Disaster Recovery Solution at Huawei

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to hbaseconasia2017: HBase Disaster Recovery Solution at Huawei

Similar to hbaseconasia2017: HBase Disaster Recovery Solution at Huawei (20)

More from HBaseCon

More from HBaseCon (20)

Recently uploaded

Recently uploaded (20)

hbaseconasia2017: HBase Disaster Recovery Solution at Huawei