Dfs

Distributed
File System
BY: Benlahrech Freiha Hanan
2019/2020

Contenent
• Introduction
• DFS
• How it works
• DFS Concepts
• File service Model
• NoSQL
• Most poppular DFSs
• NFS as an exemple
• Advantages/Challenges
• Conclusion

Introduction
A File System is a subsystem of the operating system that
performs file management activities such as Organization,
Storing, Retrieval, Naming,, sharing, and Protection of files.
Distributed file system (DFS)
• A method of storing and accessing files based in a
client/server architecture.
• A distributed implementation of the classical time-sharing
model of a file system, in which multiple users share files and
storage resources.

DFS
• In a distributed file system, one or more
central servers store files that can be
accessed, with proper authorization rights,
by any number of remote clients in the
network.

Distribution Concept
• Distribute blocks of data sets across multiple nodes.
• Each node has its own computing power;
which gives the ability of DFS to parallel processing data blocks.

Replication Concept
DFS will replicate data blocks on different clusters by copy the same pieces of
information into multiple clusters on different racks.
This will help to achieve Fault Tolerance and High Concurrency

File Service Models
Upload/download Model:
• files move between server and clients
• few operations (read file & write file)
• requires storage at client
• Good if whole file is accessed
Remote access: Model
• files stay at server
• rich interface with many operations
• less space at client,
• Efficient for small accesses

NoSQL
• Database management Non
SQL
• It does not support
relational databases
• Used for distributed
transaction processing
across multiple databases

Most Known Implementation of DFS
• NFS
• MouseFS
• HDFS
• Ceph
• GlusterFS

Local and Remote FS accessible on
NFS Client

The Advantages of DFS
• Scalability
• Fault Tolerance
• High Concurrency

Challenges
• Transparent access
User sees single, global file system regardless of location
• Scalable performance
Performance does not degrade as more clients are added
• Fault Tolerance
Client and server identify and respond appropriately when other crashes
• Consistency
See same directory and file contents on different clients at same time
• Security
Secure communication and user authentication
• Tension across these goals
Example: Caching helps performance, but hurts consistency

Conclusion
• Distributed file system is the new evolved version of
file system
• It can be advantageous because
Distribution of documents becomes easier to multiple
clients
Centralized storage system so client machines are not
using their resources to store files.

References
• https://www.mindtory.com/an-introduction-to-distributed-file-system/
• https://www.slideshare.net/PhilippeJulio/hadoop-architecture/10-
DISTRIBUTED_FILE_SYSTEMS_System_that
• https://slideplayer.com/slide/4910941/
• https://subscription.packtpub.com/book/big_data_and_business_intellige
nce/9781789612899/1/ch01lvl1sec12/understanding-the-supported-
nosql-data-models
• https://www.slideserve.com/elvis/distributed-systems-course-distributed-
file-systems
• https://slideplayer.com/slide/8943606/
• https://www.assignmenthelp.net/distributed_file_system

Dfs

More Related Content

What's hot

Similar to Dfs

Recently uploaded

Dfs

Editor's Notes