HDFS: Hadoop Distributed Filesystem

© Hortonworks Inc. 2013
HDFS: Hadoop Distributed FS
Steve Loughran, Hortonworks
stevel@hortonworks.com
@steveloughran
Big Data workshop, June 2013

© Hortonworks Inc.
What is a Filesystem?
• Durable store of data:
write, read, probe, delete
• Metadata for organisation:
locate, change
• A conceptual model for humans
• API for programmatic access to data & metadata
Page 2

© Hortonworks Inc.
Unix is the model & POSIX its API
• directories and files:
directories have children, files have data
• API: open, read, seek, write, stat, rename, unlink, flock
• Consistency: all sync()'d changes are globally visible
• Atomic metadata operations: mv, rm, mkdir
Page 3
Features are also constraints

© Hortonworks Inc
Relax constraints  scale and availability
Page 4
Scaleandavailability
Distance from Unix Filesystem model & API
ext4
NFS
+cross host
locks, sync
HDFS
+data locality
(seek+write)
locks
S3
+cross-site
append
metadata ops
consistency

© Hortonworks Inc.
HDFS: goals
• Store Petabytes of web data: logs, web snapshots
• Keep per-node costs down to afford more nodes
• Commodity x86 servers, storage (SAS), GbE LAN
• Open source software: O(1) costs
• O(1) operations
• Accept failure as a background noise
• Support computation in each server
Written for location aware applications -MapReduce,
Pregel/Giraph & others that can tolerate partial failures
Page 5

© Hortonworks Inc.
HDFS: what
• Open Source: hadoop.apache.org
• Java code on Linux, Unix, Windows
• Replication rather than RAID
–break file into blocks
–store across servers and racks
–delivers bandwidth and more locations for work
• Background work handles failures
–replication of under-replicated blocks
–rebalancing of unbalanced servers
–checksum verification of stored files
Location data for work schedulers
Page 6

© Hortonworks Inc.
Page 7
DataNode
DataNode
DataNode
DataNode
ToR Switch
DataNode
DataNode
DataNode
DataNode
ToR Switch
Switch
(Job
Tracker)
ToR Switch
2ary
Name
Node
Name
Node
file
block1
block2
block3
…
Hadoop HDFS: replication is the key

Some of largest filesystems ever
e.g. Facebook Prineville
45PB in 1 cluster, PUE 1.05

© Hortonworks Inc.
HDFS: Enterprise Checlist
•Auth: Kerberos
•Snapshots (in HDFSv2)
•NFS (in HDFSv2)
•HA metadata server, uses "Zookeeper"
Page 10

© Hortonworks Inc.
HDFS: what next?
•Exabytes in a single cluster.
•Cross cluster, cross-site
what constraints can be relaxed here?
•More efficient cold-data storage
•Evolving application needs.
•Networking: 2x1GbE, 4x1GbE , 10GbE
•Power budgets
Page 11

© Hortonworks Inc.
HDD  HDD+ SSD  SSD
•New solid state storage technologies
emerging
•When will HDDs go away?
•How to take advantage of mixed storage
•SSD retains the HDD metaphor, hides the
details (access bus, wear levelling)
Page 12
We need to give the OS and DFS control of the
storage, work with the application

HDFS: Hadoop Distributed Filesystem

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (8)

Similar to HDFS: Hadoop Distributed Filesystem

Similar to HDFS: Hadoop Distributed Filesystem (20)

More from Steve Loughran

More from Steve Loughran (20)

Recently uploaded

Recently uploaded (20)

HDFS: Hadoop Distributed Filesystem

Editor's Notes