Availability and Integrity in hadoop (Strata EU Edition)

Data Availability and Integrity
in Apache Hadoop
Steve Loughran
@steveloughran
stevel@apache.org

© Hortonworks Inc. 2012

Questions Hadoop Ops teams ask

• Can Hadoop keep my data safe?

• Can Hadoop keep my data available?

• What happens when things go wrong?

• Can you improve this?

Page 2

Can Hadoop Keep My Data Safe?
Switch

ToR Switch ToR Switch ToR Switch
file

block1
Name block2 DataNode DataNode
Node block3
…

DataNode DataNode

2ary
Name DataNode DataNode
Node

(Job
Tracker) DataNode DataNode

Page 3

Replication handles data integrity
• CRC32 checksum per 512 bytes
• Verified across datanodes on write
• Verified on all reads
• Background verification of all blocks (~weekly)
• Corrupt blocks re-replicated
• All replicas corrupt  operations team
intervention

2009: Yahoo! lost 19 out of 329M blocks on 20K
servers –bugs now fixed
Page 4

Harder: Switch failure
Switch

ToR Switch ToR Switch ToR Switch
file

block1
Name block2 DataNode DataNode
Node block3
…

DataNode DataNode

2ary
Node

(Job

Page 5

Bonded 1 GbE >1 switch
Avoids hardware problems, not software

Page 6

NameNode failure rare but costs
ToR Switch

1. Try to reboot/restart
NN IP

2. Bring up new Shared storage for
NameNode server Name filesystem image and
NN IP Node
-with same IP journal ("edit log")
-or restart DataNodes

2ary
Name (Secondary NN receives
Node
streamed journal and checkpoints
filesystem image)

Yahoo!: 22 NameNode failures on 25 clusters in 18 months = .99999 availability

Page 7

What to improve

• Address costs of NameNode failure in Hadoop 1

• Add live NN failover (HDFS 2.0)

• Eliminate shared storage (HDFS 2.x)

• Add resilience to the entire stack

Page 8

Full Stack HA
add resilience to planned/unplanned outages of
layers underneath

9

HA in Hadoop 1 (HDP1)
Use existing HA clustering technologies to add
cold failover of key manager services:
VMWare vSphere HA
RedHat HA Linux

10

RedHat HA Linux
ToR Switches

NN IP Name
DataNode DataNode
Node
IP1

NN IP Name
DataNode DataNode
Node
IP2

2NN IP 2ary
IP3 Node

JT IP
(Job
IP4

HA Linux: heartbeats & failover

Page 11

Linux HA Implementation

• Replace init.d script with “Resource Agent” script

• Probe deep state of HDFS, Job Tracker

• Detection & handling of hung process hard

• Test in virtual + physical environments

• Testing with physical clusters

Page 12

Yes, but does it work?

public void testKillHungNN() {
assertRestartsHDFS {
nnServer.kill(19,
"/var/run/hadoop/hadoop-hadoop-namenode.pid")
}
}

Groovy JUnit tests
“Tools of Chaos” to break remote hosts and
infrastructures

Page 13

And how long does it take?

Small cluster: 1-3 minutes

Medium Cluster: 2-4 Minutes

Where Medium == A Petabyte or less

Cold Failover is good enough for small/medium clusters
14

“Full Stack”: IPC client
Configurable retry & time to block
ipc.client.connect.max.retries
dfs.client.retry.policy.enabled

1. Blocking works for most clients (HBase, Pig…)

2. Failure-aware applications can tune/disable

3. Job tracker added “Safe Mode” for outages

Page 15

Putting it all together: Demo

Page 16

HA in Hadoop HDFS 2

Page 17

Hadoop 2.0 HA

Zoo-
Keeper Standby
Active IP1
Active
Failure- DataNode
Controller NN

Zoo-
Keeper

Active
Standby Standby
Active
Failure- DataNode
Controller NN IP2
Zoo-
Keeper

Page 18

When will HDFS 2 be ready?
Moving from alpha to beta ... production in 2013

Download and play with early releases!

Page 19

Moving forward
• Retry policies for all remote client
protocols/libraries in the stack.

• Dynamic (zookeeper?) service lookup

• YARN needs HA of Resource Manager, individual
MR clusters

• “No more Managers”

Page 20

Summary
• HDFS handles corruption and partial loss of data
today

• Hadoop 1 now has cold failover for small/medium
clusters

• Hadoop 2 adding hot failover

• Full Stack HA for resilience to outages

Page 21

Single Points of Failure
There's always a SPOF

Q. How do you find it?

A. It finds you

Page 22

Availability and Integrity in hadoop (Strata EU Edition)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Availability and Integrity in hadoop (Strata EU Edition)

Similar to Availability and Integrity in hadoop (Strata EU Edition) (20)

More from Steve Loughran

More from Steve Loughran (20)

Recently uploaded

Recently uploaded (20)

Availability and Integrity in hadoop (Strata EU Edition)

Editor's Notes