Hadoop
Edit by Cassell Hsu
2013.04.19
Hadoop
Slave
Master

Slave
Slave
Hadoop
• Master
• NameNode
• JobTracker
• SecondaryNameNode
NameNode
• Where is NameNode?
• Master
• HDFS(Hadoop Distributed File System)
• What is NameNode?
• 資料之位置資訊 ??
• 資料之屬性
NameNode
• 位置資訊 ?
• 所有資料皆存放在 – DataNode
DataNode
• What is DataNode
• 存放資料
• Where is DataNode
• HDFS
• Slaves (and Master)
DataNode

User

128
Mb

DataNode

NameNode
A
64Mb

A
64Mb

B
64Mb

B
64Mb

Check hdfs-site.xml

DataNode
Hadoop

Slaves

Master
NameNode

DataNode

DataNode

DataNode

DataNode
Hadoop
• Master
• NameNode
• JobTracker
• SecondaryNameNode
JobTracker
• What is JobTracker?
• 排程工作 誰來工作?
• Where is JobTracker?
• Master
JobTracker & TaskTracker
JobTracker

TaskTracker

Where

Master

Slaves

What

排程工作

執行工作
Hadoop

Slaves

Master
NameNode

DataNode

JobTracker

DataNode
TaskTracker
Hadoop
• Master
• NameNode
• JobTracker
• SecondaryNameNode
SecondaryNameNode
• What is SecondaryNameNode?
• NameNode 發生錯誤時補救
• Where is SecondaryNameNode?
• Master
Hadoop

Slaves

Master
NameNode

DataNode

JobTracker
SecondaryNode

DataNode
TaskTracker
MapReduce
NameNode
User
Task
A

HDFS

Master
Task

Reduce
Slave1

A1

Final Result
Result1
Result2
Slave2

A2
MapReduce
• 檔案切割
• Hadoop 上區塊切割
• 程式指定

Hadoop 130419075715-phpapp02(1)