Hadoop
Edit by Cassell Hsu
                      2013.04.19
Hadoop
              Slave

Master        Slave

              Slave
Hadoop
• Master
 • NameNode
 • JobTracker
 • SecondaryNameNode
NameNode
• Where is NameNode?
 • Master
 • HDFS(Hadoop Distributed File System)
• What is NameNode?
 • 資料之位置資訊 ??
 • 資料之屬性
NameNode

• 位置資訊 ?
 • 所有資料皆存放在 – DataNode
DataNode
• What is DataNode
 • 存放資料
• Where is DataNode
 • HDFS
 • Slaves (and Master)
DataNode




User        NameNode                      DataNode

                  A      A
128             64Mb   64Mb


Mb                B      B
                64Mb   64Mb
                               DataNode

       Check hdfs-site.xml
Hadoop
                          Slaves
      Master
                         DataNode


NameNode   DataNode



                          DataNode



                         DataNode
Hadoop
• Master
 • NameNode
 • JobTracker
 • SecondaryNameNode
JobTracker
• What is JobTracker?
 • 排程工作 誰來工作?
• Where is JobTracker?
 • Master
JobTracker & TaskTracker
          JobTracker   TaskTracker


  Where     Master        Slaves


  What     排程工作         執行工作
Hadoop
                             Slaves
      Master
                              DataNode


                             TaskTracker
NameNode      DataNode

      JobTracker
Hadoop
• Master
 • NameNode
 • JobTracker
 • SecondaryNameNode
SecondaryNameNode
• What is SecondaryNameNode?
 • NameNode 發生錯誤時補救
• Where is SecondaryNameNode?
 • Master
Hadoop
                             Slaves
      Master
                              DataNode


                             TaskTracker
NameNode      DataNode

      JobTracker


    SecondaryNode
MapReduce
                NameNode   Reduce
                           Slave1     A1
         HDFS              Final Result
                             Result1
User             Master
                            Result2
  Task            Task
   A                        Slave2    A2
MapReduce

• 檔案切割
 • Hadoop 上區塊切割
   • 程式指定

Hadoop