Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
14,058 views

Published on

Hadoop Components and Operations Hadoop Distributed File System Unstructured Data§  Shuffle Phase - All name/value pair are sorted and grouped by their keys. Map Map Map Map§  Mapper sending the data to Reducers Map Map Map Map Map Map Map Map§  High Network Activity Map Map Map Map Map Map Map Map§  Reduce Phase – All values associates with a key are process for results, three phases Copy - get intermediate result from each data Shuffle Phase node local disk Merge - to reduce the number of files Key 1 Key 1 Key 1 Key 1 Key 1 Key 1 Key 1 Key 1 Reduce method Key 1 Key 1 Key 1 Key 1 Key 1 Key 2 Key 3 Key 4§  Output Replication Phase - Reducer replicating result to multiple nodes Highest Network Activity Reduce Reduce Reduce Reduce§  Network Activities Dependent on Workload Behavior Result/Output 17

Published in: Technology, Education
  • Be the first to comment

×