SlideShare a Scribd company logo
1 of 21
Distributed
File System
BY: Benlahrech Freiha Hanan
2019/2020
Contenent
• Introduction
• DFS
• How it works
• DFS Concepts
• File service Model
• NoSQL
• Most poppular DFSs
• NFS as an exemple
• Advantages/Challenges
• Conclusion
Introduction
A File System is a subsystem of the operating system that
performs file management activities such as Organization,
Storing, Retrieval, Naming,, sharing, and Protection of files.
Distributed file system (DFS)
• A method of storing and accessing files based in a
client/server architecture.
• A distributed implementation of the classical time-sharing
model of a file system, in which multiple users share files and
storage resources.
DFS
• In a distributed file system, one or more
central servers store files that can be
accessed, with proper authorization rights,
by any number of remote clients in the
network.
How DFS
works?
DFS
Concepts
Distribution Concept
• Distribute blocks of data sets across multiple nodes.
• Each node has its own computing power;
which gives the ability of DFS to parallel processing data blocks.
Replication Concept
DFS will replicate data blocks on different clusters by copy the same pieces of
information into multiple clusters on different racks.
This will help to achieve Fault Tolerance and High Concurrency
Fault Tolerance Concept
High Concurrency Concept
File Service Models
Upload/download Model:
• files move between server and clients
• few operations (read file & write file)
• requires storage at client
• Good if whole file is accessed
Remote access: Model
• files stay at server
• rich interface with many operations
• less space at client,
• Efficient for small accesses
NoSQL
• Database management Non
SQL
• It does not support
relational databases
• Used for distributed
transaction processing
across multiple databases
Most Known Implementation of DFS
• NFS
• MouseFS
• HDFS
• Ceph
• GlusterFS
NFS as an
exemple
NFS Architecture
Local and Remote FS accessible on
NFS Client
The Advantages of DFS
• Scalability
• Fault Tolerance
• High Concurrency
Challenges
• Transparent access
User sees single, global file system regardless of location
• Scalable performance
Performance does not degrade as more clients are added
• Fault Tolerance
Client and server identify and respond appropriately when other crashes
• Consistency
See same directory and file contents on different clients at same time
• Security
Secure communication and user authentication
• Tension across these goals
Example: Caching helps performance, but hurts consistency
Conclusion
• Distributed file system is the new evolved version of
file system
• It can be advantageous because
Distribution of documents becomes easier to multiple
clients
Centralized storage system so client machines are not
using their resources to store files.
References
• https://www.mindtory.com/an-introduction-to-distributed-file-system/
• https://www.slideshare.net/PhilippeJulio/hadoop-architecture/10-
DISTRIBUTED_FILE_SYSTEMS_System_that
• https://slideplayer.com/slide/4910941/
• https://subscription.packtpub.com/book/big_data_and_business_intellige
nce/9781789612899/1/ch01lvl1sec12/understanding-the-supported-
nosql-data-models
• https://www.slideserve.com/elvis/distributed-systems-course-distributed-
file-systems
• https://slideplayer.com/slide/8943606/
• https://www.assignmenthelp.net/distributed_file_system

More Related Content

What's hot

Taking the open cloud to 11
Taking the open cloud to 11Taking the open cloud to 11
Taking the open cloud to 11Joe Brockmeier
 
Dustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveDustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveGluster.org
 
Docker y azure container service
Docker y azure container serviceDocker y azure container service
Docker y azure container serviceFernando Mejía
 
What is a Server
What is a ServerWhat is a Server
What is a ServerKuwait10
 
Directory Write Leases in MagFS
Directory Write Leases in MagFSDirectory Write Leases in MagFS
Directory Write Leases in MagFSMaginatics
 
Maginatics Cloud Storage Platform - MCSP 3.0 Technical Highlights
Maginatics Cloud Storage Platform - MCSP 3.0 Technical HighlightsMaginatics Cloud Storage Platform - MCSP 3.0 Technical Highlights
Maginatics Cloud Storage Platform - MCSP 3.0 Technical HighlightsMaginatics
 
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics
 
Introducción a CloudStack
Introducción a CloudStackIntroducción a CloudStack
Introducción a CloudStackHollman Enciso
 
AMPLIFY MFT: ensuring continuous service availability
AMPLIFY MFT: ensuring continuous service availabilityAMPLIFY MFT: ensuring continuous service availability
AMPLIFY MFT: ensuring continuous service availabilityAxway
 
[WSO2Con EU 2017] File Processing and Websockets with Ballerina
[WSO2Con EU 2017] File Processing and Websockets with Ballerina[WSO2Con EU 2017] File Processing and Websockets with Ballerina
[WSO2Con EU 2017] File Processing and Websockets with BallerinaWSO2
 
JasperWorld 2012: Reinventing Data Management by Max Schireson
JasperWorld 2012: Reinventing Data Management by Max SchiresonJasperWorld 2012: Reinventing Data Management by Max Schireson
JasperWorld 2012: Reinventing Data Management by Max SchiresonMongoDB
 
Efficient data maintaince in GlusterFS using Databases
Efficient data maintaince in GlusterFS using DatabasesEfficient data maintaince in GlusterFS using Databases
Efficient data maintaince in GlusterFS using DatabasesJoseph Elwin Fernandes
 
Writing Space and the Cassandra NoSQL DBMS
Writing Space and the Cassandra NoSQL DBMSWriting Space and the Cassandra NoSQL DBMS
Writing Space and the Cassandra NoSQL DBMSDataStax Academy
 
IWMW 1997: WWW Caching
IWMW 1997: WWW CachingIWMW 1997: WWW Caching
IWMW 1997: WWW CachingIWMW
 
Windows 8 dddd (beekelaar)
Windows 8 dddd (beekelaar)Windows 8 dddd (beekelaar)
Windows 8 dddd (beekelaar)hypervnu
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS
 

What's hot (20)

Txlf2012
Txlf2012Txlf2012
Txlf2012
 
Taking the open cloud to 11
Taking the open cloud to 11Taking the open cloud to 11
Taking the open cloud to 11
 
Dustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveDustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep Dive
 
Docker y azure container service
Docker y azure container serviceDocker y azure container service
Docker y azure container service
 
Server vs client
Server vs clientServer vs client
Server vs client
 
What is a Server
What is a ServerWhat is a Server
What is a Server
 
Directory Write Leases in MagFS
Directory Write Leases in MagFSDirectory Write Leases in MagFS
Directory Write Leases in MagFS
 
Azure storage
Azure storageAzure storage
Azure storage
 
Maginatics Cloud Storage Platform - MCSP 3.0 Technical Highlights
Maginatics Cloud Storage Platform - MCSP 3.0 Technical HighlightsMaginatics Cloud Storage Platform - MCSP 3.0 Technical Highlights
Maginatics Cloud Storage Platform - MCSP 3.0 Technical Highlights
 
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
 
Introducción a CloudStack
Introducción a CloudStackIntroducción a CloudStack
Introducción a CloudStack
 
AMPLIFY MFT: ensuring continuous service availability
AMPLIFY MFT: ensuring continuous service availabilityAMPLIFY MFT: ensuring continuous service availability
AMPLIFY MFT: ensuring continuous service availability
 
[WSO2Con EU 2017] File Processing and Websockets with Ballerina
[WSO2Con EU 2017] File Processing and Websockets with Ballerina[WSO2Con EU 2017] File Processing and Websockets with Ballerina
[WSO2Con EU 2017] File Processing and Websockets with Ballerina
 
Storage for VDI
Storage for VDIStorage for VDI
Storage for VDI
 
JasperWorld 2012: Reinventing Data Management by Max Schireson
JasperWorld 2012: Reinventing Data Management by Max SchiresonJasperWorld 2012: Reinventing Data Management by Max Schireson
JasperWorld 2012: Reinventing Data Management by Max Schireson
 
Efficient data maintaince in GlusterFS using Databases
Efficient data maintaince in GlusterFS using DatabasesEfficient data maintaince in GlusterFS using Databases
Efficient data maintaince in GlusterFS using Databases
 
Writing Space and the Cassandra NoSQL DBMS
Writing Space and the Cassandra NoSQL DBMSWriting Space and the Cassandra NoSQL DBMS
Writing Space and the Cassandra NoSQL DBMS
 
IWMW 1997: WWW Caching
IWMW 1997: WWW CachingIWMW 1997: WWW Caching
IWMW 1997: WWW Caching
 
Windows 8 dddd (beekelaar)
Windows 8 dddd (beekelaar)Windows 8 dddd (beekelaar)
Windows 8 dddd (beekelaar)
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
 

Similar to Dfs

DFS PPT.pptx
DFS PPT.pptxDFS PPT.pptx
DFS PPT.pptxVMahesh5
 
Distributed file system
Distributed file systemDistributed file system
Distributed file systemNaza hamed Jan
 
Big Data Architecture Workshop - Vahid Amiri
Big Data Architecture Workshop -  Vahid AmiriBig Data Architecture Workshop -  Vahid Amiri
Big Data Architecture Workshop - Vahid Amiridatastack
 
Chapter 17 - Distributed File Systems
Chapter 17 - Distributed File SystemsChapter 17 - Distributed File Systems
Chapter 17 - Distributed File SystemsWayne Jones Jnr
 
Introduction to distributed file systems
Introduction to distributed file systemsIntroduction to distributed file systems
Introduction to distributed file systemsViet-Trung TRAN
 
Hadoop Distributed File System
Hadoop Distributed File SystemHadoop Distributed File System
Hadoop Distributed File SystemMilad Sobhkhiz
 
Microsoft's distributed file system
Microsoft's distributed file systemMicrosoft's distributed file system
Microsoft's distributed file systemSimranSharma215
 
Big Data Storage Concepts from the "Big Data concepts Technology and Architec...
Big Data Storage Concepts from the "Big Data concepts Technology and Architec...Big Data Storage Concepts from the "Big Data concepts Technology and Architec...
Big Data Storage Concepts from the "Big Data concepts Technology and Architec...raghdooosh
 
CS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMSCS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMSKathirvel Ayyaswamy
 
Introduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud ComputingIntroduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud ComputingRutuja751147
 
Ch16 OS
Ch16 OSCh16 OS
Ch16 OSC.U
 

Similar to Dfs (20)

DFS PPT.pptx
DFS PPT.pptxDFS PPT.pptx
DFS PPT.pptx
 
Chapter-5-DFS.ppt
Chapter-5-DFS.pptChapter-5-DFS.ppt
Chapter-5-DFS.ppt
 
Distributed file system
Distributed file systemDistributed file system
Distributed file system
 
12. dfs
12. dfs12. dfs
12. dfs
 
Nfs
NfsNfs
Nfs
 
11. dfs
11. dfs11. dfs
11. dfs
 
Hdfs
HdfsHdfs
Hdfs
 
Big Data Architecture Workshop - Vahid Amiri
Big Data Architecture Workshop -  Vahid AmiriBig Data Architecture Workshop -  Vahid Amiri
Big Data Architecture Workshop - Vahid Amiri
 
Chapter 17 - Distributed File Systems
Chapter 17 - Distributed File SystemsChapter 17 - Distributed File Systems
Chapter 17 - Distributed File Systems
 
Introduction to distributed file systems
Introduction to distributed file systemsIntroduction to distributed file systems
Introduction to distributed file systems
 
Hadoop Distributed File System
Hadoop Distributed File SystemHadoop Distributed File System
Hadoop Distributed File System
 
Microsoft's distributed file system
Microsoft's distributed file systemMicrosoft's distributed file system
Microsoft's distributed file system
 
Big Data Storage Concepts from the "Big Data concepts Technology and Architec...
Big Data Storage Concepts from the "Big Data concepts Technology and Architec...Big Data Storage Concepts from the "Big Data concepts Technology and Architec...
Big Data Storage Concepts from the "Big Data concepts Technology and Architec...
 
CS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMSCS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMS
 
Introduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud ComputingIntroduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud Computing
 
Hadoop data management
Hadoop data managementHadoop data management
Hadoop data management
 
OSCh16
OSCh16OSCh16
OSCh16
 
Ch16 OS
Ch16 OSCh16 OS
Ch16 OS
 
OS_Ch16
OS_Ch16OS_Ch16
OS_Ch16
 
Giraffa - November 2014
Giraffa - November 2014Giraffa - November 2014
Giraffa - November 2014
 

Recently uploaded

XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 

Recently uploaded (20)

XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 

Dfs

  • 1. Distributed File System BY: Benlahrech Freiha Hanan 2019/2020
  • 2. Contenent • Introduction • DFS • How it works • DFS Concepts • File service Model • NoSQL • Most poppular DFSs • NFS as an exemple • Advantages/Challenges • Conclusion
  • 3. Introduction A File System is a subsystem of the operating system that performs file management activities such as Organization, Storing, Retrieval, Naming,, sharing, and Protection of files. Distributed file system (DFS) • A method of storing and accessing files based in a client/server architecture. • A distributed implementation of the classical time-sharing model of a file system, in which multiple users share files and storage resources.
  • 4. DFS • In a distributed file system, one or more central servers store files that can be accessed, with proper authorization rights, by any number of remote clients in the network.
  • 6.
  • 8. Distribution Concept • Distribute blocks of data sets across multiple nodes. • Each node has its own computing power; which gives the ability of DFS to parallel processing data blocks.
  • 9. Replication Concept DFS will replicate data blocks on different clusters by copy the same pieces of information into multiple clusters on different racks. This will help to achieve Fault Tolerance and High Concurrency
  • 12. File Service Models Upload/download Model: • files move between server and clients • few operations (read file & write file) • requires storage at client • Good if whole file is accessed Remote access: Model • files stay at server • rich interface with many operations • less space at client, • Efficient for small accesses
  • 13. NoSQL • Database management Non SQL • It does not support relational databases • Used for distributed transaction processing across multiple databases
  • 14. Most Known Implementation of DFS • NFS • MouseFS • HDFS • Ceph • GlusterFS
  • 17. Local and Remote FS accessible on NFS Client
  • 18. The Advantages of DFS • Scalability • Fault Tolerance • High Concurrency
  • 19. Challenges • Transparent access User sees single, global file system regardless of location • Scalable performance Performance does not degrade as more clients are added • Fault Tolerance Client and server identify and respond appropriately when other crashes • Consistency See same directory and file contents on different clients at same time • Security Secure communication and user authentication • Tension across these goals Example: Caching helps performance, but hurts consistency
  • 20. Conclusion • Distributed file system is the new evolved version of file system • It can be advantageous because Distribution of documents becomes easier to multiple clients Centralized storage system so client machines are not using their resources to store files.
  • 21. References • https://www.mindtory.com/an-introduction-to-distributed-file-system/ • https://www.slideshare.net/PhilippeJulio/hadoop-architecture/10- DISTRIBUTED_FILE_SYSTEMS_System_that • https://slideplayer.com/slide/4910941/ • https://subscription.packtpub.com/book/big_data_and_business_intellige nce/9781789612899/1/ch01lvl1sec12/understanding-the-supported- nosql-data-models • https://www.slideserve.com/elvis/distributed-systems-course-distributed- file-systems • https://slideplayer.com/slide/8943606/ • https://www.assignmenthelp.net/distributed_file_system

Editor's Notes

  1. In Big Data, we deal with multiple clusters (computers) often. One of the main advantages of Big Data which is that it goes beyond the capabilities of one single super powerful server with extremely high computing power. The whole idea of Big Data is to distribute data across multiple clusters and to make use of computing power of each cluster (node) to process information. Distributed file system is a system that can handle accessing data across multiple clusters (nodes). Distributed file systems can be advantageous because they make it easier to distribute documents to multiple clients and they provide a centralized storage system so that client machines are not using their resources to store files.
  2. How Distributed file system (DFS) works? Distributed file system works as follows: Distribution: Distribute blocks of data sets across multiple nodes. Each node has its own computing power; which gives the ability of DFS to parallel processing data blocks. Replication: Distributed file system will also replicate data blocks on different clusters by copy the same pieces of information into multiple clusters on different racks. This will help to achieve the following: Fault Tolerance: recover data block in case of cluster failure or Rack failure. High Concurrency: avail same piece of data to be processed by multiple clients at the same time. It is done using the computation power of each node to parallel process data blocks.
  3. upload/download: files move between server and clients, few operations (read file & write file), simple, requires storage at client, good if whole file is accessed remote access: files stay at server, rich interface with many operations, less space at client, efficient for small accesses
  4. Key/value: This is a persistent dictionary. It is best for when we know the key and we need to retrieve the associated value for the key. Column, wide-column, or column-family: This organizes related data into columns instead of the typical organization in rows. It is best for when we need to query across specific columns in the database. Document: This allows persisting JSON objects (documents), which can include nested objects or arrays of other objects. Graph: This allows you to persist edges and nodes with their properties. It is best for when we need to store and navigate through complex relationships.
  5. What are the Advantages of Distributed File System (DFS)? Distributed file system provides the following main advantages: Scalability: You can scale up your infrastructure by adding more racks or clusters to your system. Fault Tolerance: Data replication will help to achieve fault tolerance in the following cases: Cluster is down Rack is down Rack is disconnected from the network. Job failure or restart. High Concurrency: utilize the compute power of each node to handle multiple client requests (in a parallel way) at the same time. The following figure illustrates the main concept of high concurrency and how it can be achieved by data replication on multiple clusters. Access from multiple clients Same user on different machines can access same files Simplifies sharing Different users on different machines can read/write to same files Simplifies administration One shared server to maintain (and backup) Improve reliability Add RAID storage to server Access from multiple clients Same user on different machines can access same files Simplifies sharing Different users on different machines can read/write to same files Simplifies administration One shared server to maintain (and backup) Improve reliability Add RAID storage to server
  6. Challenges Transparent access Scalable performance Fault Tolerance User sees single, global file system regardless of location Scalable performance Performance does not degrade as more clients are added Fault Tolerance Client and server identify and respond appropriately when other crashes Consistency See same directory and file contents on different clients at same time Security Secure communication and user authentication Tension across these goals Example: Caching helps performance, but hurts consistency