SlideShare a Scribd company logo
1 of 7
Download to read offline
The  Google  File  System	
2015-­‐08-­‐22	
  さとうかずま
 GFS  architecture	
A	
  GSF	
  cluster	
  consists	
  of	
  a	
  single	
  master	
  and	
  mul;ple	
  chunkservers	
  
and	
  is	
  accessed	
  by	
  mul;ple	
  clients	
  
2	
Ghemawat(2003),	
  The	
  Google	
  File	
  System	
•  Master	
  maintains	
  	
  
all	
  file	
  system	
  metadata	
  which	
  
includes	
  loca;ons	
  of	
  chunks	
  
•  Chunkservers	
  store	
  fix-­‐sized	
  chunks	
  
into	
  which	
  files	
  are	
  divided	
  
master	
metadata	
chunk	
 chunk	
chunk	
clients
Different  points  in  the  design  space	
A.  Treatment	
  of	
  component	
  failures	
  as	
  the	
  norm	
  	
  
	
  
B.  Op;miza;on	
  for	
  huge	
  files	
  
C.  Benefit	
  from	
  co-­‐designing	
  	
  
the	
  applica;ons	
  and	
  the	
  file	
  system	
  API	
3	
Ghemawat(2003),	
  The	
  Google	
  File	
  System
A.  Why  not  treat  component  failures  as  the  excep>on?	
Among	
  hundreds	
  of	
  servers	
  in	
  a	
  GFS	
  cluster,	
  some	
  are	
  bound	
  to	
  be	
  
unavailable	
  at	
  any	
  given	
  ;me	
  
Quality	
  
•  The	
  system	
  is	
  build	
  from	
  many	
  inexpensive	
  commodity	
  
Quan;ty	
  
•  hundreds	
  of	
  servers	
  in	
  a	
  GFS	
  cluster	
4	
Ghemawat(2003),	
  The	
  Google	
  File	
  System
A.  Fault  tolerance	
The	
  GFS	
  provides	
  fault	
  tolerance	
  by..	
  	
  
a.  Constant	
  monitoring	
  
b.  Replica;ng	
  crucial	
  data	
  
c.  Fast	
  and	
  automa;c	
  recovery	
a.  Exchange	
  	
  
Heatbeat	
  message	
b.	
  chunk	
  replica;on	
  
5	
Ghemawat(2003),	
  The	
  Google	
  File	
  System	
master	
clients	
termina;on	
normal	
abnormal	
Cause	
 restore	
killing	
  process	
excep;on	
in	
  seconds	
in	
  seconds	
c.	
  Not	
  dis;nguish	
  between	
  	
  
	
  	
  	
  	
  normal	
  and	
  abnormal	
  termina;on	
  
B.  op>miza>on	
A	
  Chunk	
  size	
  64MB,	
  which	
  is	
  much	
  larger	
  then	
  typical	
  file	
  system	
  
block	
  sizes,	
  offers	
  3	
  advantages	
  
6	
Ghemawat(2003),	
  The	
  Google	
  File	
  System	
•  Keep	
  the	
  metadata	
  in	
  memory	
•  A	
  client	
  is	
  more	
  likely	
  to	
  perform	
  
many	
  opera;ons	
  on	
  a	
  given	
  chunk	
  
•  Applica;ons	
  mostly	
  read	
  and	
  write	
  
large	
  files	
  sequen;ally	
  
•  Reduce	
  client-­‐master	
  interac;on	
  
•  Reduce	
  network	
  overhead	
  
•  Reduce	
  the	
  size	
  of	
  the	
  metadata	
  
stored	
  on	
  the	
  master	
  
C.  Benefit  from  co-­‐designing  	
7	
Ghemawat(2003),	
  The	
  Google	
  File	
  System	
Applica;on	
GFS	
Record	
  append	
  opera;on	
  
Most	
  files	
  are	
  mutated	
  by	
  appending	
  new	
  data	
  
rather	
  than	
  overwri;ng	
  exis;ng	
  data	
mechanism	
 Record	
  append	
  allows	
  mul;ple	
  clients	
  to	
  
append	
  data	
  to	
  the	
  same	
  file	
  concurrently	
  
API	
Layer	
 Descrip;on	
Mul;ple	
  clients	
  can	
  append	
  concurrently	
  to	
  a	
  file	
  	
  
without	
  extra	
  synchroniza;on	
  between	
  them

More Related Content

Viewers also liked (8)

William Isbell Portfolio for Uploads
William Isbell Portfolio for UploadsWilliam Isbell Portfolio for Uploads
William Isbell Portfolio for Uploads
 
Telemática
TelemáticaTelemática
Telemática
 
MapReduce: Simplified Data Processing On Large Clusters
MapReduce: Simplified Data Processing On Large ClustersMapReduce: Simplified Data Processing On Large Clusters
MapReduce: Simplified Data Processing On Large Clusters
 
Impuesto
ImpuestoImpuesto
Impuesto
 
Power Point
Power PointPower Point
Power Point
 
kannur as a folklore destination
kannur as a folklore destination kannur as a folklore destination
kannur as a folklore destination
 
Tutela interpretacion
Tutela interpretacionTutela interpretacion
Tutela interpretacion
 
MECANISMOS DE PROTECCION DE DERECHOS FUNDAMENTALES S.s semana clase 4
 MECANISMOS DE PROTECCION DE DERECHOS FUNDAMENTALES S.s semana clase 4 MECANISMOS DE PROTECCION DE DERECHOS FUNDAMENTALES S.s semana clase 4
MECANISMOS DE PROTECCION DE DERECHOS FUNDAMENTALES S.s semana clase 4
 

Similar to The google file system

Similar to The google file system (20)

storage-systems.pptx
storage-systems.pptxstorage-systems.pptx
storage-systems.pptx
 
Gfs sosp2003
Gfs sosp2003Gfs sosp2003
Gfs sosp2003
 
Gfs
GfsGfs
Gfs
 
Seminar Report on Google File System
Seminar Report on Google File SystemSeminar Report on Google File System
Seminar Report on Google File System
 
Cluster based storage - Nasd and Google file system - advanced operating syst...
Cluster based storage - Nasd and Google file system - advanced operating syst...Cluster based storage - Nasd and Google file system - advanced operating syst...
Cluster based storage - Nasd and Google file system - advanced operating syst...
 
GFS presenttn.pptx
GFS presenttn.pptxGFS presenttn.pptx
GFS presenttn.pptx
 
Google file system GFS
Google file system GFSGoogle file system GFS
Google file system GFS
 
advanced Google file System
advanced Google file Systemadvanced Google file System
advanced Google file System
 
MongoDB Days Silicon Valley: Best Practices for Upgrading to MongoDB
MongoDB Days Silicon Valley: Best Practices for Upgrading to MongoDBMongoDB Days Silicon Valley: Best Practices for Upgrading to MongoDB
MongoDB Days Silicon Valley: Best Practices for Upgrading to MongoDB
 
google file system
google file systemgoogle file system
google file system
 
Lalit
LalitLalit
Lalit
 
Google file system
Google file systemGoogle file system
Google file system
 
Gfs论文
Gfs论文Gfs论文
Gfs论文
 
The google file system
The google file systemThe google file system
The google file system
 
Google file system
Google file systemGoogle file system
Google file system
 
Google - Bigtable
Google - BigtableGoogle - Bigtable
Google - Bigtable
 
Google File System
Google File SystemGoogle File System
Google File System
 
Next Generation Cloud Computing With Google - RightScale Compute 2013
Next Generation Cloud Computing With Google - RightScale Compute 2013Next Generation Cloud Computing With Google - RightScale Compute 2013
Next Generation Cloud Computing With Google - RightScale Compute 2013
 
Google Bigtable
Google BigtableGoogle Bigtable
Google Bigtable
 
Let the Tiger Roar!
Let the Tiger Roar!Let the Tiger Roar!
Let the Tiger Roar!
 

Recently uploaded

Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
drm1699
 

Recently uploaded (20)

Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
 
Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?
 
Your Ultimate Web Studio for Streaming Anywhere | Evmux
Your Ultimate Web Studio for Streaming Anywhere | EvmuxYour Ultimate Web Studio for Streaming Anywhere | Evmux
Your Ultimate Web Studio for Streaming Anywhere | Evmux
 
Abortion Clinic Pretoria ](+27832195400*)[ Abortion Clinic Near Me ● Abortion...
Abortion Clinic Pretoria ](+27832195400*)[ Abortion Clinic Near Me ● Abortion...Abortion Clinic Pretoria ](+27832195400*)[ Abortion Clinic Near Me ● Abortion...
Abortion Clinic Pretoria ](+27832195400*)[ Abortion Clinic Near Me ● Abortion...
 
Software Engineering - Introduction + Process Models + Requirements Engineering
Software Engineering - Introduction + Process Models + Requirements EngineeringSoftware Engineering - Introduction + Process Models + Requirements Engineering
Software Engineering - Introduction + Process Models + Requirements Engineering
 
OpenChain Webinar: AboutCode and Beyond - End-to-End SCA
OpenChain Webinar: AboutCode and Beyond - End-to-End SCAOpenChain Webinar: AboutCode and Beyond - End-to-End SCA
OpenChain Webinar: AboutCode and Beyond - End-to-End SCA
 
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with GraphGraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
 
Effective Strategies for Wix's Scaling challenges - GeeCon
Effective Strategies for Wix's Scaling challenges - GeeConEffective Strategies for Wix's Scaling challenges - GeeCon
Effective Strategies for Wix's Scaling challenges - GeeCon
 
[GeeCON2024] How I learned to stop worrying and love the dark silicon apocalypse
[GeeCON2024] How I learned to stop worrying and love the dark silicon apocalypse[GeeCON2024] How I learned to stop worrying and love the dark silicon apocalypse
[GeeCON2024] How I learned to stop worrying and love the dark silicon apocalypse
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
 
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
 
Community is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletCommunity is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea Goulet
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
 
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit MilanWorkshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
 
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
 
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
 
AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdf
AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdfAzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdf
AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdf
 
Lessons Learned from Building a Serverless Notifications System.pdf
Lessons Learned from Building a Serverless Notifications System.pdfLessons Learned from Building a Serverless Notifications System.pdf
Lessons Learned from Building a Serverless Notifications System.pdf
 
Test Automation Design Patterns_ A Comprehensive Guide.pdf
Test Automation Design Patterns_ A Comprehensive Guide.pdfTest Automation Design Patterns_ A Comprehensive Guide.pdf
Test Automation Design Patterns_ A Comprehensive Guide.pdf
 

The google file system

  • 1. The  Google  File  System 2015-­‐08-­‐22  さとうかずま
  • 2.  GFS  architecture A  GSF  cluster  consists  of  a  single  master  and  mul;ple  chunkservers   and  is  accessed  by  mul;ple  clients   2 Ghemawat(2003),  The  Google  File  System •  Master  maintains     all  file  system  metadata  which   includes  loca;ons  of  chunks   •  Chunkservers  store  fix-­‐sized  chunks   into  which  files  are  divided   master metadata chunk chunk chunk clients
  • 3. Different  points  in  the  design  space A.  Treatment  of  component  failures  as  the  norm       B.  Op;miza;on  for  huge  files   C.  Benefit  from  co-­‐designing     the  applica;ons  and  the  file  system  API 3 Ghemawat(2003),  The  Google  File  System
  • 4. A.  Why  not  treat  component  failures  as  the  excep>on? Among  hundreds  of  servers  in  a  GFS  cluster,  some  are  bound  to  be   unavailable  at  any  given  ;me   Quality   •  The  system  is  build  from  many  inexpensive  commodity   Quan;ty   •  hundreds  of  servers  in  a  GFS  cluster 4 Ghemawat(2003),  The  Google  File  System
  • 5. A.  Fault  tolerance The  GFS  provides  fault  tolerance  by..     a.  Constant  monitoring   b.  Replica;ng  crucial  data   c.  Fast  and  automa;c  recovery a.  Exchange     Heatbeat  message b.  chunk  replica;on   5 Ghemawat(2003),  The  Google  File  System master clients termina;on normal abnormal Cause restore killing  process excep;on in  seconds in  seconds c.  Not  dis;nguish  between            normal  and  abnormal  termina;on  
  • 6. B.  op>miza>on A  Chunk  size  64MB,  which  is  much  larger  then  typical  file  system   block  sizes,  offers  3  advantages   6 Ghemawat(2003),  The  Google  File  System •  Keep  the  metadata  in  memory •  A  client  is  more  likely  to  perform   many  opera;ons  on  a  given  chunk   •  Applica;ons  mostly  read  and  write   large  files  sequen;ally   •  Reduce  client-­‐master  interac;on   •  Reduce  network  overhead   •  Reduce  the  size  of  the  metadata   stored  on  the  master  
  • 7. C.  Benefit  from  co-­‐designing   7 Ghemawat(2003),  The  Google  File  System Applica;on GFS Record  append  opera;on   Most  files  are  mutated  by  appending  new  data   rather  than  overwri;ng  exis;ng  data mechanism Record  append  allows  mul;ple  clients  to   append  data  to  the  same  file  concurrently   API Layer Descrip;on Mul;ple  clients  can  append  concurrently  to  a  file     without  extra  synchroniza;on  between  them