Fault Tolerance in HDFS
By: emad soltani nezhad
May 2018
Out Line:
 What is HDFS?
 Data distribution using HDFS
 Architecture HDFS
 Fault Tolerance in HDFS
What is HDFS?
 Hadoop Distributed File System
 Written in Java
 Open Source
 Swiss knife of the 21st century
Data distribution using HDFS
64 MGB 64 MGB 64 MGB
HDFS
Server1 Server2 Server3
Big Data
Data distribution using HDFS
Data distribution using HDFS
 Mont Blanc is the largest photograph in the world
 Panorama of 70,000 photos
 Image Size 45 TB
 Distribution and processed by Apache Hadoop
cluster with 20 nodes within 10 days
 May 2015
Data distribution using HDFS
Architecture HDFS
Fault Tolerance in HDFS
NameNode
DataNode
Heartbeat
Dead
Fault Tolerance in HDFS
Rack Awareness
Rack1-Cluster1 Rack2-Cluster2
Data Center1
Data Center2
Fault Tolerance in HDFS
Rack1-Cluster1
Rack2-Cluster2
Data Center-Larg Cluster
HDFS Federation
Rack3-Cluster3
Fault Tolerance in HDFS
Single Point of Failyre
Fault Tolerance in HDFS
NameNodeRedundantHeart Beat
ONE Cluster
Hi Availability
Fault Tolerance in HDFS
Fault Tolerance in HDFS

Fault Tolerance in HDFS

Editor's Notes

  • #14 SHOOT THE OTHER NODE IN THE HEAD