SlideShare a Scribd company logo
1 of 20
HDFS 
HADOOP DISTRIBUTED FILE SYSTEM
What is HDFS 
HDFS is a Filesystem designed for storing very large files running on 
cluster of commodity hardware 
It provides fault-tolerant storage layer for Hadoop and other sub-projects 
It is a distributed file system that provides high-throughput access to 
application data 
It stores data reliably even in case of hardware faliure
HDFS Architecture 
There are two types of nodes, works in master-slaves pattern 
ā—¦ Master (or called as Namenode) ā€“ It manages all the slave nodes and assign 
work to slaves 
ā—¦ Slave (or called as Datanode) ā€“ Actual worker nodes, which does actual task
HDFS Architecture
Distributed storage 
Data is distributed across the cluster of nodes 
Data is splitted into smaller pieces and stored across multiple nodes of 
the cluster 
In this way it provides a way to Map-Reduce to process subset of large 
data parallely on multiple nodes 
Distributed storage also provides fault-tolerance capability
ļƒ Namenode Datanode 
Two Daemons Runs for HDFS (for storage) 
ā—¦ NameNode 
ā—¦ This daemon runs on Master 
ā—¦ Namenode stores all the metadata like filename, path, number of blocks, blockIds, block 
locations, number of replicas, slave related config, etc. 
ā—¦ DataNode 
ā—¦ This daemon runs on all the slaves 
ā—¦ Datanodes stores the actual data
ļƒ Namenode Datanode
ļƒ Blocks 
Files are splitted into number of blocks 
Unlike 1 KB block size of OS filesystem, HDFSā€™s default block size is 64 
MB (which can be increased according to the requirements) 
It saves disk seek time, and other advantage is in the case of processing 
(1 Mapper processes 1 block at a time)
ļƒ Blocks
Replication 
In HDFS all the data blocks are replicated across the cluster of nodes 
Ideally one replica is present at one geographical location 
The default replication factor is 3, which can be changed according to 
the requirements. 
We can change replication factor to desired value by editing 
configuration files
ļƒ High Availability 
HDFS provides high availability of data by creating multiple replicas of 
data blocks 
If a network link or node or some hardware goes down data will be 
available from some other path 
If a hardware goes down data can be accessed from other nodes or 
other paths as we have replicated data (default 3 replicas)
ļƒ Reliable Data Storage 
Data is stored quite reliably in HDFS, this feature is related to High 
Availability. 
Since data is highly available due to replication, hence if a node or few 
nodes or some hardware crashes, the data will be stored reliably. 
We can again balance the cluster by making the replication factor to 
desirable value (by running few commands) 
In other words we can say Hadoop (HDFS) is fault tolerant
ļƒ Reliable Data Storage
ļƒ Scalability 
Two ways to scale the Filesystem 
ā—¦ Add more disks on the nodes of the cluster 
ā—¦ For this we need to edit the configuration files and add corresponding entries of newly added 
disks 
ā—¦ Horizontal Scaling 
ā—¦ Another option is to add more nodes in the cluster 
ā—¦ We can add n number of nodes in the cluster on the fly (without any downtime)
File Access & other Operations 
Programming 
CLI (Command Line Interface) 
We can perform almost all the operations which we can do on local 
filesystem 
HDFS provides different access rights rwx to ugo 
We can also browse the filesystem by browser (http://Master-IP:50070)
Read a File 
To read a file from HDFS, client needs to interact with Namenode, as 
namenode is single place where all the the metadata is stored 
Namenode provides the address of the slaves where file is stored 
Client will interact with respective datanodes to read the file 
Namenode also provide a token to the client which it shows to data 
node for authentication
Read a File
Write a File 
To write a file into HDFS, client needs to interact with namenode. 
Namenode provides the address of the slave on which client will start 
writing the data 
As soon as client finishes writing the block, the slave starts copying the 
block to another slave which in tern copy the block to another slave (3 
replicas by default) 
After required replicas are created then it will send the acknowledge to 
the client 
Authentication process (same as file read)
Write a File
Thank You

More Related Content

What's hot

ŠœŠ°ŃŃˆŃ‚Š°Š±ŠøруŠµŠ¼Š¾ŃŃ‚ŃŒ Hadoop Š² Facebook. Š”Š¼ŠøтрŠøŠ¹ ŠœŠ¾Š»ŃŒŠŗŠ¾Š², Facebook
ŠœŠ°ŃŃˆŃ‚Š°Š±ŠøруŠµŠ¼Š¾ŃŃ‚ŃŒ Hadoop Š² Facebook. Š”Š¼ŠøтрŠøŠ¹ ŠœŠ¾Š»ŃŒŠŗŠ¾Š², FacebookŠœŠ°ŃŃˆŃ‚Š°Š±ŠøруŠµŠ¼Š¾ŃŃ‚ŃŒ Hadoop Š² Facebook. Š”Š¼ŠøтрŠøŠ¹ ŠœŠ¾Š»ŃŒŠŗŠ¾Š², Facebook
ŠœŠ°ŃŃˆŃ‚Š°Š±ŠøруŠµŠ¼Š¾ŃŃ‚ŃŒ Hadoop Š² Facebook. Š”Š¼ŠøтрŠøŠ¹ ŠœŠ¾Š»ŃŒŠŗŠ¾Š², Facebook
yaevents
Ā 
Coordinating Metadata Replication: Survival Strategy for Distributed Systems
Coordinating Metadata Replication: Survival Strategy for Distributed SystemsCoordinating Metadata Replication: Survival Strategy for Distributed Systems
Coordinating Metadata Replication: Survival Strategy for Distributed Systems
Konstantin V. Shvachko
Ā 

What's hot (20)

Distributed file systems dfs
Distributed file systems   dfsDistributed file systems   dfs
Distributed file systems dfs
Ā 
Hadoop at a glance
Hadoop at a glanceHadoop at a glance
Hadoop at a glance
Ā 
ŠœŠ°ŃŃˆŃ‚Š°Š±ŠøруŠµŠ¼Š¾ŃŃ‚ŃŒ Hadoop Š² Facebook. Š”Š¼ŠøтрŠøŠ¹ ŠœŠ¾Š»ŃŒŠŗŠ¾Š², Facebook
ŠœŠ°ŃŃˆŃ‚Š°Š±ŠøруŠµŠ¼Š¾ŃŃ‚ŃŒ Hadoop Š² Facebook. Š”Š¼ŠøтрŠøŠ¹ ŠœŠ¾Š»ŃŒŠŗŠ¾Š², FacebookŠœŠ°ŃŃˆŃ‚Š°Š±ŠøруŠµŠ¼Š¾ŃŃ‚ŃŒ Hadoop Š² Facebook. Š”Š¼ŠøтрŠøŠ¹ ŠœŠ¾Š»ŃŒŠŗŠ¾Š², Facebook
ŠœŠ°ŃŃˆŃ‚Š°Š±ŠøруŠµŠ¼Š¾ŃŃ‚ŃŒ Hadoop Š² Facebook. Š”Š¼ŠøтрŠøŠ¹ ŠœŠ¾Š»ŃŒŠŗŠ¾Š², Facebook
Ā 
HDFS Design Principles
HDFS Design PrinciplesHDFS Design Principles
HDFS Design Principles
Ā 
Hadoop Interacting with HDFS
Hadoop Interacting with HDFSHadoop Interacting with HDFS
Hadoop Interacting with HDFS
Ā 
Interacting with hdfs
Interacting with hdfsInteracting with hdfs
Interacting with hdfs
Ā 
Hadoop HDFS
Hadoop HDFSHadoop HDFS
Hadoop HDFS
Ā 
Snapshot in Hadoop Distributed File System
Snapshot in Hadoop Distributed File SystemSnapshot in Hadoop Distributed File System
Snapshot in Hadoop Distributed File System
Ā 
March 2011 HUG: HDFS Federation
March 2011 HUG: HDFS FederationMarch 2011 HUG: HDFS Federation
March 2011 HUG: HDFS Federation
Ā 
Hadoop and HDFS
Hadoop and HDFSHadoop and HDFS
Hadoop and HDFS
Ā 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introduction
Ā 
Hadoop HDFS Detailed Introduction
Hadoop HDFS Detailed IntroductionHadoop HDFS Detailed Introduction
Hadoop HDFS Detailed Introduction
Ā 
HDFS User Reference
HDFS User ReferenceHDFS User Reference
HDFS User Reference
Ā 
Hadoop HDFS NameNode HA
Hadoop HDFS NameNode HAHadoop HDFS NameNode HA
Hadoop HDFS NameNode HA
Ā 
Coordinating Metadata Replication: Survival Strategy for Distributed Systems
Coordinating Metadata Replication: Survival Strategy for Distributed SystemsCoordinating Metadata Replication: Survival Strategy for Distributed Systems
Coordinating Metadata Replication: Survival Strategy for Distributed Systems
Ā 
HDFS Trunncate: Evolving Beyond Write-Once Semantics
HDFS Trunncate: Evolving Beyond Write-Once SemanticsHDFS Trunncate: Evolving Beyond Write-Once Semantics
HDFS Trunncate: Evolving Beyond Write-Once Semantics
Ā 
Hadoop Introduction
Hadoop IntroductionHadoop Introduction
Hadoop Introduction
Ā 
Introduction to HDFS and MapReduce
Introduction to HDFS and MapReduceIntroduction to HDFS and MapReduce
Introduction to HDFS and MapReduce
Ā 
Hadoop HDFS Concepts
Hadoop HDFS ConceptsHadoop HDFS Concepts
Hadoop HDFS Concepts
Ā 
Hadoop HDFS Concepts
Hadoop HDFS ConceptsHadoop HDFS Concepts
Hadoop HDFS Concepts
Ā 

Viewers also liked

Chapter 8 distributed file systems
Chapter 8 distributed file systemsChapter 8 distributed file systems
Chapter 8 distributed file systems
AbDul ThaYyal
Ā 
Royalton Punta Cana
Royalton Punta Cana Royalton Punta Cana
Royalton Punta Cana
chglat
Ā 
Sono Innamorato Di Te
Sono Innamorato Di TeSono Innamorato Di Te
Sono Innamorato Di Te
dargula70
Ā 
Spencer Bahamas Option
Spencer Bahamas OptionSpencer Bahamas Option
Spencer Bahamas Option
chglat
Ā 
Ocsid test
Ocsid testOcsid test
Ocsid test
enhee_92
Ā 
ŠšŃ€ŠµŠ°Ń‚ŠøŠ²Š½Ń‹Š¹. 3-хŠ»ŠµŃ‚ŠøŠµ. ŠæŠ»Š°Š½Ń‹ Š½Š° Š±Š»ŠøŠ¶Š°Š¹ŃˆŠµŠµ Š²Ń€ŠµŠ¼Ń
ŠšŃ€ŠµŠ°Ń‚ŠøŠ²Š½Ń‹Š¹. 3-хŠ»ŠµŃ‚ŠøŠµ. ŠæŠ»Š°Š½Ń‹ Š½Š° Š±Š»ŠøŠ¶Š°Š¹ŃˆŠµŠµ Š²Ń€ŠµŠ¼ŃŠšŃ€ŠµŠ°Ń‚ŠøŠ²Š½Ń‹Š¹. 3-хŠ»ŠµŃ‚ŠøŠµ. ŠæŠ»Š°Š½Ń‹ Š½Š° Š±Š»ŠøŠ¶Š°Š¹ŃˆŠµŠµ Š²Ń€ŠµŠ¼Ń
ŠšŃ€ŠµŠ°Ń‚ŠøŠ²Š½Ń‹Š¹. 3-хŠ»ŠµŃ‚ŠøŠµ. ŠæŠ»Š°Š½Ń‹ Š½Š° Š±Š»ŠøŠ¶Š°Š¹ŃˆŠµŠµ Š²Ń€ŠµŠ¼Ń
Alexandr Mikhailov
Ā 
Servant plan II
Servant plan IIServant plan II
Servant plan II
mgiiicm
Ā 

Viewers also liked (20)

Distributed File Systems
Distributed File SystemsDistributed File Systems
Distributed File Systems
Ā 
Distributed File Systems: An Overview
Distributed File Systems: An OverviewDistributed File Systems: An Overview
Distributed File Systems: An Overview
Ā 
Chapter 8 distributed file systems
Chapter 8 distributed file systemsChapter 8 distributed file systems
Chapter 8 distributed file systems
Ā 
Chapter 17 - Distributed File Systems
Chapter 17 - Distributed File SystemsChapter 17 - Distributed File Systems
Chapter 17 - Distributed File Systems
Ā 
Unico Riv Maya
Unico Riv MayaUnico Riv Maya
Unico Riv Maya
Ā 
Royalton Punta Cana
Royalton Punta Cana Royalton Punta Cana
Royalton Punta Cana
Ā 
Š”Ń€ŃƒŠ·ŃŒŃ-рŠµŠ“Š°ŠŗтŠ¾Ń€Ń‹. ŠšŠ°Šŗ Š”ŠœŠ˜ стŠ°Š½Š¾Š²ŃŃ‚ся сŠ¾Ń†ŠøŠ°Š»ŃŒŠ½Ń‹Š¼Šø
Š”Ń€ŃƒŠ·ŃŒŃ-рŠµŠ“Š°ŠŗтŠ¾Ń€Ń‹. ŠšŠ°Šŗ Š”ŠœŠ˜ стŠ°Š½Š¾Š²ŃŃ‚ся сŠ¾Ń†ŠøŠ°Š»ŃŒŠ½Ń‹Š¼ŠøŠ”Ń€ŃƒŠ·ŃŒŃ-рŠµŠ“Š°ŠŗтŠ¾Ń€Ń‹. ŠšŠ°Šŗ Š”ŠœŠ˜ стŠ°Š½Š¾Š²ŃŃ‚ся сŠ¾Ń†ŠøŠ°Š»ŃŒŠ½Ń‹Š¼Šø
Š”Ń€ŃƒŠ·ŃŒŃ-рŠµŠ“Š°ŠŗтŠ¾Ń€Ń‹. ŠšŠ°Šŗ Š”ŠœŠ˜ стŠ°Š½Š¾Š²ŃŃ‚ся сŠ¾Ń†ŠøŠ°Š»ŃŒŠ½Ń‹Š¼Šø
Ā 
R meetup talk
R meetup talkR meetup talk
R meetup talk
Ā 
24 7
24 724 7
24 7
Ā 
Livingwaterspresents
LivingwaterspresentsLivingwaterspresents
Livingwaterspresents
Ā 
Sono Innamorato Di Te
Sono Innamorato Di TeSono Innamorato Di Te
Sono Innamorato Di Te
Ā 
Secrets Playa Mujeres
Secrets Playa MujeresSecrets Playa Mujeres
Secrets Playa Mujeres
Ā 
Kauai
KauaiKauai
Kauai
Ā 
DB - Leadership experiences
DB -  Leadership experiencesDB -  Leadership experiences
DB - Leadership experiences
Ā 
2013 10 31 halloween
2013 10 31 halloween2013 10 31 halloween
2013 10 31 halloween
Ā 
Yasmin2
Yasmin2Yasmin2
Yasmin2
Ā 
Spencer Bahamas Option
Spencer Bahamas OptionSpencer Bahamas Option
Spencer Bahamas Option
Ā 
Ocsid test
Ocsid testOcsid test
Ocsid test
Ā 
ŠšŃ€ŠµŠ°Ń‚ŠøŠ²Š½Ń‹Š¹. 3-хŠ»ŠµŃ‚ŠøŠµ. ŠæŠ»Š°Š½Ń‹ Š½Š° Š±Š»ŠøŠ¶Š°Š¹ŃˆŠµŠµ Š²Ń€ŠµŠ¼Ń
ŠšŃ€ŠµŠ°Ń‚ŠøŠ²Š½Ń‹Š¹. 3-хŠ»ŠµŃ‚ŠøŠµ. ŠæŠ»Š°Š½Ń‹ Š½Š° Š±Š»ŠøŠ¶Š°Š¹ŃˆŠµŠµ Š²Ń€ŠµŠ¼ŃŠšŃ€ŠµŠ°Ń‚ŠøŠ²Š½Ń‹Š¹. 3-хŠ»ŠµŃ‚ŠøŠµ. ŠæŠ»Š°Š½Ń‹ Š½Š° Š±Š»ŠøŠ¶Š°Š¹ŃˆŠµŠµ Š²Ń€ŠµŠ¼Ń
ŠšŃ€ŠµŠ°Ń‚ŠøŠ²Š½Ń‹Š¹. 3-хŠ»ŠµŃ‚ŠøŠµ. ŠæŠ»Š°Š½Ń‹ Š½Š° Š±Š»ŠøŠ¶Š°Š¹ŃˆŠµŠµ Š²Ń€ŠµŠ¼Ń
Ā 
Servant plan II
Servant plan IIServant plan II
Servant plan II
Ā 

Similar to Hdfs

big data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing databig data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing data
preetik9044
Ā 
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
Simplilearn
Ā 
Hadoop Distributed File System
Hadoop Distributed File SystemHadoop Distributed File System
Hadoop Distributed File System
Milad Sobhkhiz
Ā 

Similar to Hdfs (20)

Introduction_to_HDFS sun.pptx
Introduction_to_HDFS sun.pptxIntroduction_to_HDFS sun.pptx
Introduction_to_HDFS sun.pptx
Ā 
module 2.pptx
module 2.pptxmodule 2.pptx
module 2.pptx
Ā 
Introduction to HDFS
Introduction to HDFSIntroduction to HDFS
Introduction to HDFS
Ā 
big data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing databig data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing data
Ā 
Hadoop distributed file system
Hadoop distributed file systemHadoop distributed file system
Hadoop distributed file system
Ā 
Hadoop
HadoopHadoop
Hadoop
Ā 
Hadoop
HadoopHadoop
Hadoop
Ā 
Hadoop HDFS Architeture and Design
Hadoop HDFS Architeture and DesignHadoop HDFS Architeture and Design
Hadoop HDFS Architeture and Design
Ā 
Introduction to Hadoop Distributed File System(HDFS).pptx
Introduction to Hadoop Distributed File System(HDFS).pptxIntroduction to Hadoop Distributed File System(HDFS).pptx
Introduction to Hadoop Distributed File System(HDFS).pptx
Ā 
Unit 1
Unit 1Unit 1
Unit 1
Ā 
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
Ā 
Apache Hadoop In Theory And Practice
Apache Hadoop In Theory And PracticeApache Hadoop In Theory And Practice
Apache Hadoop In Theory And Practice
Ā 
Big data with HDFS and Mapreduce
Big data  with HDFS and MapreduceBig data  with HDFS and Mapreduce
Big data with HDFS and Mapreduce
Ā 
Hadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data AnalyticsHadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data Analytics
Ā 
Big data interview questions and answers
Big data interview questions and answersBig data interview questions and answers
Big data interview questions and answers
Ā 
Hadoop Distributed File System
Hadoop Distributed File SystemHadoop Distributed File System
Hadoop Distributed File System
Ā 
HADOOP.pptx
HADOOP.pptxHADOOP.pptx
HADOOP.pptx
Ā 
Hadoop data management
Hadoop data managementHadoop data management
Hadoop data management
Ā 
Hadoop
HadoopHadoop
Hadoop
Ā 
Big Data Reverse Knowledge Transfer.pptx
Big Data Reverse Knowledge Transfer.pptxBig Data Reverse Knowledge Transfer.pptx
Big Data Reverse Knowledge Transfer.pptx
Ā 

More from Chirag Ahuja (10)

Deploy hadoop cluster
Deploy hadoop clusterDeploy hadoop cluster
Deploy hadoop cluster
Ā 
Word count example in hadoop mapreduce using java
Word count example in hadoop mapreduce using javaWord count example in hadoop mapreduce using java
Word count example in hadoop mapreduce using java
Ā 
Big data introduction
Big data introductionBig data introduction
Big data introduction
Ā 
Flume
FlumeFlume
Flume
Ā 
Hbase
HbaseHbase
Hbase
Ā 
Pig
PigPig
Pig
Ā 
Hive : WareHousing Over hadoop
Hive :  WareHousing Over hadoopHive :  WareHousing Over hadoop
Hive : WareHousing Over hadoop
Ā 
Mapreduce advanced
Mapreduce advancedMapreduce advanced
Mapreduce advanced
Ā 
MapReduce basic
MapReduce basicMapReduce basic
MapReduce basic
Ā 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introduction
Ā 

Recently uploaded

Call Girls Bommasandra Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
Call Girls Bommasandra Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...Call Girls Bommasandra Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
Call Girls Bommasandra Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
amitlee9823
Ā 
Escorts Service Kumaraswamy Layout ā˜Ž 7737669865ā˜Ž Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ā˜Ž 7737669865ā˜Ž Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ā˜Ž 7737669865ā˜Ž Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ā˜Ž 7737669865ā˜Ž Book Your One night Stand (B...
amitlee9823
Ā 
Call Girls In Attibele ā˜Ž 7737669865 šŸ„µ Book Your One night Stand
Call Girls In Attibele ā˜Ž 7737669865 šŸ„µ Book Your One night StandCall Girls In Attibele ā˜Ž 7737669865 šŸ„µ Book Your One night Stand
Call Girls In Attibele ā˜Ž 7737669865 šŸ„µ Book Your One night Stand
amitlee9823
Ā 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
Ā 
Call Girls Indiranagar Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
Call Girls Indiranagar Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...Call Girls Indiranagar Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
Call Girls Indiranagar Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
amitlee9823
Ā 
Call Girls Hsr Layout Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service Ba...
amitlee9823
Ā 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
Ā 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
Ā 

Recently uploaded (20)

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
Ā 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
Ā 
Call Girls Bommasandra Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
Call Girls Bommasandra Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...Call Girls Bommasandra Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
Call Girls Bommasandra Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
Ā 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
Ā 
Escorts Service Kumaraswamy Layout ā˜Ž 7737669865ā˜Ž Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ā˜Ž 7737669865ā˜Ž Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ā˜Ž 7737669865ā˜Ž Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ā˜Ž 7737669865ā˜Ž Book Your One night Stand (B...
Ā 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Ā 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
Ā 
Call Girls In Attibele ā˜Ž 7737669865 šŸ„µ Book Your One night Stand
Call Girls In Attibele ā˜Ž 7737669865 šŸ„µ Book Your One night StandCall Girls In Attibele ā˜Ž 7737669865 šŸ„µ Book Your One night Stand
Call Girls In Attibele ā˜Ž 7737669865 šŸ„µ Book Your One night Stand
Ā 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Ā 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
Ā 
BDSMāš”Call Girls in Mandawali Delhi >ą¼’8448380779 Escort Service
BDSMāš”Call Girls in Mandawali Delhi >ą¼’8448380779 Escort ServiceBDSMāš”Call Girls in Mandawali Delhi >ą¼’8448380779 Escort Service
BDSMāš”Call Girls in Mandawali Delhi >ą¼’8448380779 Escort Service
Ā 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
Ā 
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Ā 
Call Girls Indiranagar Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
Call Girls Indiranagar Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...Call Girls Indiranagar Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
Call Girls Indiranagar Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service B...
Ā 
Call Girls Hsr Layout Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call šŸ‘— 7737669865 šŸ‘— Top Class Call Girl Service Ba...
Ā 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
Ā 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Ā 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Ā 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Ā 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
Ā 

Hdfs

  • 2. What is HDFS HDFS is a Filesystem designed for storing very large files running on cluster of commodity hardware It provides fault-tolerant storage layer for Hadoop and other sub-projects It is a distributed file system that provides high-throughput access to application data It stores data reliably even in case of hardware faliure
  • 3. HDFS Architecture There are two types of nodes, works in master-slaves pattern ā—¦ Master (or called as Namenode) ā€“ It manages all the slave nodes and assign work to slaves ā—¦ Slave (or called as Datanode) ā€“ Actual worker nodes, which does actual task
  • 5. Distributed storage Data is distributed across the cluster of nodes Data is splitted into smaller pieces and stored across multiple nodes of the cluster In this way it provides a way to Map-Reduce to process subset of large data parallely on multiple nodes Distributed storage also provides fault-tolerance capability
  • 6. ļƒ Namenode Datanode Two Daemons Runs for HDFS (for storage) ā—¦ NameNode ā—¦ This daemon runs on Master ā—¦ Namenode stores all the metadata like filename, path, number of blocks, blockIds, block locations, number of replicas, slave related config, etc. ā—¦ DataNode ā—¦ This daemon runs on all the slaves ā—¦ Datanodes stores the actual data
  • 8. ļƒ Blocks Files are splitted into number of blocks Unlike 1 KB block size of OS filesystem, HDFSā€™s default block size is 64 MB (which can be increased according to the requirements) It saves disk seek time, and other advantage is in the case of processing (1 Mapper processes 1 block at a time)
  • 10. Replication In HDFS all the data blocks are replicated across the cluster of nodes Ideally one replica is present at one geographical location The default replication factor is 3, which can be changed according to the requirements. We can change replication factor to desired value by editing configuration files
  • 11. ļƒ High Availability HDFS provides high availability of data by creating multiple replicas of data blocks If a network link or node or some hardware goes down data will be available from some other path If a hardware goes down data can be accessed from other nodes or other paths as we have replicated data (default 3 replicas)
  • 12. ļƒ Reliable Data Storage Data is stored quite reliably in HDFS, this feature is related to High Availability. Since data is highly available due to replication, hence if a node or few nodes or some hardware crashes, the data will be stored reliably. We can again balance the cluster by making the replication factor to desirable value (by running few commands) In other words we can say Hadoop (HDFS) is fault tolerant
  • 14. ļƒ Scalability Two ways to scale the Filesystem ā—¦ Add more disks on the nodes of the cluster ā—¦ For this we need to edit the configuration files and add corresponding entries of newly added disks ā—¦ Horizontal Scaling ā—¦ Another option is to add more nodes in the cluster ā—¦ We can add n number of nodes in the cluster on the fly (without any downtime)
  • 15. File Access & other Operations Programming CLI (Command Line Interface) We can perform almost all the operations which we can do on local filesystem HDFS provides different access rights rwx to ugo We can also browse the filesystem by browser (http://Master-IP:50070)
  • 16. Read a File To read a file from HDFS, client needs to interact with Namenode, as namenode is single place where all the the metadata is stored Namenode provides the address of the slaves where file is stored Client will interact with respective datanodes to read the file Namenode also provide a token to the client which it shows to data node for authentication
  • 18. Write a File To write a file into HDFS, client needs to interact with namenode. Namenode provides the address of the slave on which client will start writing the data As soon as client finishes writing the block, the slave starts copying the block to another slave which in tern copy the block to another slave (3 replicas by default) After required replicas are created then it will send the acknowledge to the client Authentication process (same as file read)