SlideShare a Scribd company logo
1 of 15
Download to read offline
Cumulus:
 Filesystem Backup to the Cloud
Michael Vrable, Stefan Savage, and Geoffrey M.
                    Voelker



      Presented by Hakim Weatherspoon
Good „ole Days


     Replace your tape drives
     with something truly scalable
Amazon S3 to the rescue
In Spring 2006, Amazon released a new
storage API: Put, Get, List, Delete




           Build whatever you want!

                      Quickly
Backing up the new way (S3)
Smart




    Scales


     - no longer our concern... Amazon's concern
     - all servers backup in parallel
Cheap

     - old cost =XXXper year
     - new cost =YYYper year
         where YYY < XXX
     
Thin vs Thick Cloud
• E.g. Amazons S3 vs EMC‟s MozyPro
• Thin
  – Can change provider easier
  – Applications can work across providers
• Thick
  – Better performance
  – Locked into a provider
  – Provider can go out of business
Cumulus
• Simple storage backup utility for Thin Clouds
• Evaluates efficacy of cloud storage
• Working prototype
  – http://www.cs.ucsd.edu/~mvrable/cumulus/
Outline
•   Motivation/Intro
•   Related Work
•   Design
•   Evaluation
•   Thoughts and Conclusions
Related Work
Outline
• Motivation/Intro
• Related Work
• Design
  –   API
  –   Segments
  –   Snapshots
  –   Subfileincrementals
  –   Cleaning
  –   restoring
• Evaluation
• Thoughts and Conclusions
API
• Same as S3
  – Put, Get, List, Delete


• Thin cloud – does not rely on integrated services
  – Can easily change provider and network protocols
  – S3, FTP, SFTP


• WORM Model
  – Write-once, read-many
  – Requires writing new entirely file if changes occur
  – What are the cleaning overheads?
Segments
• Aggregation via Segment Goals
  – Avoid costs due to small files
     • S3 charges on per file bases
     • Many small files
  – Avoid costs in network protocols
     • Small files have higher latency and other overheads
  – Compression
     • inter-file similarities
  – Privacy
     • Hide file boundaries
• Negative consequences?
  – Need an entire segment to write
Snapshots
Sub-File Incrementals
• Only stored changed part of files
• New snapshots point to old objects when data
  unchanged
  • Byte ranges – portions of old objcts to be reused
Segment Cleaning
• Similar to a log-structured file system (LFS)
• Clean based on utilization of segment, α
  – α = 0, no cleaning
  – α = 1, clean with the slightest change
• Cumulus
  –   attempts to find an equilibrium for α
  –   Uses a different process to clean
  –   Marks a local database as “expired”
  –   Then, next snapshot will not refer to expired segment
Restore
• Full Restore
  – Download all segments for a snapshot
• Partial Restore
  – Download snapshot descriptor, metadata, and only
    necessary segments


• What happens if client machine dies?
• How is latest snapshot descriptors identified?
• What about sharing between client machines?

More Related Content

What's hot

Load Balancing
Load BalancingLoad Balancing
Load Balancingnashniv
 
Scylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
Scylla Summit 2018: Meshify - A Case Study, or Petshop SeamonstersScylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
Scylla Summit 2018: Meshify - A Case Study, or Petshop SeamonstersScyllaDB
 
eZ Publish Asynchronous Content Publishing
eZ Publish Asynchronous Content PublishingeZ Publish Asynchronous Content Publishing
eZ Publish Asynchronous Content PublishingBertrand Dunogier
 
Cloud brew cloudcamp
Cloud brew cloudcampCloud brew cloudcamp
Cloud brew cloudcampHenry Been
 
Selecting the Right Cloud Host
Selecting the Right Cloud HostSelecting the Right Cloud Host
Selecting the Right Cloud HostAhsan Saleem
 
Windows Azure Service Bus
Windows Azure Service BusWindows Azure Service Bus
Windows Azure Service BusPavel Revenkov
 
AWS Customer Presentation - HotPads
AWS Customer Presentation - HotPadsAWS Customer Presentation - HotPads
AWS Customer Presentation - HotPadsAmazon Web Services
 
A brief introduction to CloudFormation
A brief introduction to CloudFormationA brief introduction to CloudFormation
A brief introduction to CloudFormationSWIFTotter Solutions
 
Percona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWSPercona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWSPythian
 
Escalabilidade com Lambda e Elastic Beanstalk – Parte I
Escalabilidade com Lambda e Elastic Beanstalk – Parte IEscalabilidade com Lambda e Elastic Beanstalk – Parte I
Escalabilidade com Lambda e Elastic Beanstalk – Parte ILeandro Silva
 
Experiences sharing about Lambda, Kinesis, and Postgresql
Experiences sharing about Lambda, Kinesis, and PostgresqlExperiences sharing about Lambda, Kinesis, and Postgresql
Experiences sharing about Lambda, Kinesis, and PostgresqlOkis Chuang
 
Getting Maximum Performance from Amazon Redshift: Complex Queries
Getting Maximum Performance from Amazon Redshift: Complex QueriesGetting Maximum Performance from Amazon Redshift: Complex Queries
Getting Maximum Performance from Amazon Redshift: Complex Queriestimonk
 
Rencore Webinar: Developing Secure and Performant JavaScript for SharePoint
Rencore Webinar: Developing Secure and Performant JavaScript for SharePointRencore Webinar: Developing Secure and Performant JavaScript for SharePoint
Rencore Webinar: Developing Secure and Performant JavaScript for SharePointRencore
 
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...DataStax Academy
 
Microsoft Azure Media Services
Microsoft Azure Media ServicesMicrosoft Azure Media Services
Microsoft Azure Media ServicesPavel Revenkov
 
Heap Dump Analysis - AEM: Real World Issues
Heap Dump Analysis - AEM: Real World IssuesHeap Dump Analysis - AEM: Real World Issues
Heap Dump Analysis - AEM: Real World IssuesKanika Gera
 
Pagespeed SEO 2018: Google Mobile Index Optimization
Pagespeed SEO 2018: Google Mobile Index OptimizationPagespeed SEO 2018: Google Mobile Index Optimization
Pagespeed SEO 2018: Google Mobile Index OptimizationRalf Schwoebel
 

What's hot (20)

Load Balancing
Load BalancingLoad Balancing
Load Balancing
 
Aws, an intro to startups
Aws, an intro to startupsAws, an intro to startups
Aws, an intro to startups
 
Scylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
Scylla Summit 2018: Meshify - A Case Study, or Petshop SeamonstersScylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
Scylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
 
eZ Publish Asynchronous Content Publishing
eZ Publish Asynchronous Content PublishingeZ Publish Asynchronous Content Publishing
eZ Publish Asynchronous Content Publishing
 
Cloud brew cloudcamp
Cloud brew cloudcampCloud brew cloudcamp
Cloud brew cloudcamp
 
Architecture et coût
Architecture et coûtArchitecture et coût
Architecture et coût
 
Selecting the Right Cloud Host
Selecting the Right Cloud HostSelecting the Right Cloud Host
Selecting the Right Cloud Host
 
Windows Azure Service Bus
Windows Azure Service BusWindows Azure Service Bus
Windows Azure Service Bus
 
AWS Customer Presentation - HotPads
AWS Customer Presentation - HotPadsAWS Customer Presentation - HotPads
AWS Customer Presentation - HotPads
 
A brief introduction to CloudFormation
A brief introduction to CloudFormationA brief introduction to CloudFormation
A brief introduction to CloudFormation
 
Percona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWSPercona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWS
 
Escalabilidade com Lambda e Elastic Beanstalk – Parte I
Escalabilidade com Lambda e Elastic Beanstalk – Parte IEscalabilidade com Lambda e Elastic Beanstalk – Parte I
Escalabilidade com Lambda e Elastic Beanstalk – Parte I
 
Experiences sharing about Lambda, Kinesis, and Postgresql
Experiences sharing about Lambda, Kinesis, and PostgresqlExperiences sharing about Lambda, Kinesis, and Postgresql
Experiences sharing about Lambda, Kinesis, and Postgresql
 
Getting Maximum Performance from Amazon Redshift: Complex Queries
Getting Maximum Performance from Amazon Redshift: Complex QueriesGetting Maximum Performance from Amazon Redshift: Complex Queries
Getting Maximum Performance from Amazon Redshift: Complex Queries
 
Performance stack
Performance stackPerformance stack
Performance stack
 
Rencore Webinar: Developing Secure and Performant JavaScript for SharePoint
Rencore Webinar: Developing Secure and Performant JavaScript for SharePointRencore Webinar: Developing Secure and Performant JavaScript for SharePoint
Rencore Webinar: Developing Secure and Performant JavaScript for SharePoint
 
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
 
Microsoft Azure Media Services
Microsoft Azure Media ServicesMicrosoft Azure Media Services
Microsoft Azure Media Services
 
Heap Dump Analysis - AEM: Real World Issues
Heap Dump Analysis - AEM: Real World IssuesHeap Dump Analysis - AEM: Real World Issues
Heap Dump Analysis - AEM: Real World Issues
 
Pagespeed SEO 2018: Google Mobile Index Optimization
Pagespeed SEO 2018: Google Mobile Index OptimizationPagespeed SEO 2018: Google Mobile Index Optimization
Pagespeed SEO 2018: Google Mobile Index Optimization
 

Viewers also liked (7)

Creating Resuable Learning Objects for Your Classroom
Creating Resuable Learning Objects for Your ClassroomCreating Resuable Learning Objects for Your Classroom
Creating Resuable Learning Objects for Your Classroom
 
2008 Q1 Google Earnings Slides
2008 Q1 Google Earnings Slides2008 Q1 Google Earnings Slides
2008 Q1 Google Earnings Slides
 
20070930 Google 10 Q
20070930 Google 10 Q20070930 Google 10 Q
20070930 Google 10 Q
 
citigroup October 16, 2008 - Third Quarter Financial Supplement
citigroup October 16, 2008 - Third Quarter  Financial Supplementcitigroup October 16, 2008 - Third Quarter  Financial Supplement
citigroup October 16, 2008 - Third Quarter Financial Supplement
 
Janneth
JannethJanneth
Janneth
 
Ebay News 2002 7 18 Earnings
Ebay News 2002 7 18 EarningsEbay News 2002 7 18 Earnings
Ebay News 2002 7 18 Earnings
 
Web Development or SDK?
Web Development or SDK?Web Development or SDK?
Web Development or SDK?
 

Similar to Quixote

Solaris Linux Performance, Tools and Tuning
Solaris Linux Performance, Tools and TuningSolaris Linux Performance, Tools and Tuning
Solaris Linux Performance, Tools and TuningAdrian Cockcroft
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11CloudExpoEurope
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11aseager
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11aseager
 
Storage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems PresentationStorage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems Presentationandyman3000
 
Reducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard ConsolidationReducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard ConsolidationAmazon Web Services
 
Our Multi-Year Journey to a 10x Faster Confluent Cloud
Our Multi-Year Journey to a 10x Faster Confluent CloudOur Multi-Year Journey to a 10x Faster Confluent Cloud
Our Multi-Year Journey to a 10x Faster Confluent CloudHostedbyConfluent
 
How to build a state-of-the-art rails cluster
How to build a state-of-the-art rails clusterHow to build a state-of-the-art rails cluster
How to build a state-of-the-art rails clusterTim Lossen
 
Getting Started with Amazon Aurora
Getting Started with Amazon AuroraGetting Started with Amazon Aurora
Getting Started with Amazon AuroraAmazon Web Services
 
Running Oracle EBS in the cloud (UKOUG APPS16 edition)
Running Oracle EBS in the cloud (UKOUG APPS16 edition)Running Oracle EBS in the cloud (UKOUG APPS16 edition)
Running Oracle EBS in the cloud (UKOUG APPS16 edition)Andrejs Prokopjevs
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics PlatformSantanu Dey
 
Getting started with amazon aurora - Toronto
Getting started with amazon aurora - TorontoGetting started with amazon aurora - Toronto
Getting started with amazon aurora - TorontoAmazon Web Services
 
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech Talks
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech TalksDeep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech Talks
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech TalksAmazon Web Services
 
Storage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailStorage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailInternet World
 
Benchmarking your cloud performance with top 4 global public clouds
Benchmarking your cloud performance with top 4 global public cloudsBenchmarking your cloud performance with top 4 global public clouds
Benchmarking your cloud performance with top 4 global public cloudsdata://disrupted®
 
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)Andrejs Prokopjevs
 
Perfmon And Profiler 101
Perfmon And Profiler 101Perfmon And Profiler 101
Perfmon And Profiler 101Quest Software
 
Getting Started with Amazon Aurora
Getting Started with Amazon AuroraGetting Started with Amazon Aurora
Getting Started with Amazon AuroraAmazon Web Services
 

Similar to Quixote (20)

Solaris Linux Performance, Tools and Tuning
Solaris Linux Performance, Tools and TuningSolaris Linux Performance, Tools and Tuning
Solaris Linux Performance, Tools and Tuning
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11
 
Magee Dday2 Fixing App Performance Italiano
Magee Dday2 Fixing App Performance ItalianoMagee Dday2 Fixing App Performance Italiano
Magee Dday2 Fixing App Performance Italiano
 
Storage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems PresentationStorage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems Presentation
 
Reducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard ConsolidationReducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard Consolidation
 
Our Multi-Year Journey to a 10x Faster Confluent Cloud
Our Multi-Year Journey to a 10x Faster Confluent CloudOur Multi-Year Journey to a 10x Faster Confluent Cloud
Our Multi-Year Journey to a 10x Faster Confluent Cloud
 
How to build a state-of-the-art rails cluster
How to build a state-of-the-art rails clusterHow to build a state-of-the-art rails cluster
How to build a state-of-the-art rails cluster
 
Getting Started with Amazon Aurora
Getting Started with Amazon AuroraGetting Started with Amazon Aurora
Getting Started with Amazon Aurora
 
Running Oracle EBS in the cloud (UKOUG APPS16 edition)
Running Oracle EBS in the cloud (UKOUG APPS16 edition)Running Oracle EBS in the cloud (UKOUG APPS16 edition)
Running Oracle EBS in the cloud (UKOUG APPS16 edition)
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
 
Getting started with amazon aurora - Toronto
Getting started with amazon aurora - TorontoGetting started with amazon aurora - Toronto
Getting started with amazon aurora - Toronto
 
Amazon Aurora (Debanjan Saha) - AWS DB Day
Amazon Aurora (Debanjan Saha) - AWS DB DayAmazon Aurora (Debanjan Saha) - AWS DB Day
Amazon Aurora (Debanjan Saha) - AWS DB Day
 
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech Talks
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech TalksDeep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech Talks
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech Talks
 
Storage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailStorage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, Whiptail
 
Benchmarking your cloud performance with top 4 global public clouds
Benchmarking your cloud performance with top 4 global public cloudsBenchmarking your cloud performance with top 4 global public clouds
Benchmarking your cloud performance with top 4 global public clouds
 
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
 
Perfmon And Profiler 101
Perfmon And Profiler 101Perfmon And Profiler 101
Perfmon And Profiler 101
 
Getting Started with Amazon Aurora
Getting Started with Amazon AuroraGetting Started with Amazon Aurora
Getting Started with Amazon Aurora
 

More from ceiparua

Rosalia de castro
Rosalia de castroRosalia de castro
Rosalia de castroceiparua
 
Moluscomania
MoluscomaniaMoluscomania
Moluscomaniaceiparua
 
Roni e Carapuchiña xigante namóranse nadando
Roni e Carapuchiña xigante namóranse nadandoRoni e Carapuchiña xigante namóranse nadando
Roni e Carapuchiña xigante namóranse nadandoceiparua
 
Roni e Carapuchiña xigante namóranse
Roni e Carapuchiña xigante namóranseRoni e Carapuchiña xigante namóranse
Roni e Carapuchiña xigante namóranseceiparua
 
Traballo por proxectos, de Azucena Arias Correa
Traballo por proxectos, de Azucena Arias CorreaTraballo por proxectos, de Azucena Arias Correa
Traballo por proxectos, de Azucena Arias Correaceiparua
 
Competencias Basicas Xunta
Competencias Basicas XuntaCompetencias Basicas Xunta
Competencias Basicas Xuntaceiparua
 
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08ceiparua
 
Reciclamos
ReciclamosReciclamos
Reciclamosceiparua
 
Comparámonos
ComparámonosComparámonos
Comparámonosceiparua
 
Carapuchiña vermella xigante
Carapuchiña vermella xiganteCarapuchiña vermella xigante
Carapuchiña vermella xiganteceiparua
 
Cartas de felicitación a Agustín Fdez Paz
Cartas de felicitación a Agustín Fdez PazCartas de felicitación a Agustín Fdez Paz
Cartas de felicitación a Agustín Fdez Pazceiparua
 
Empusa Pennata
Empusa PennataEmpusa Pennata
Empusa Pennataceiparua
 
Horta A RúA 08
Horta A RúA 08Horta A RúA 08
Horta A RúA 08ceiparua
 
FelicitacióN Nadal 08 Ceip A RúA
FelicitacióN Nadal 08 Ceip A RúAFelicitacióN Nadal 08 Ceip A RúA
FelicitacióN Nadal 08 Ceip A RúAceiparua
 

More from ceiparua (14)

Rosalia de castro
Rosalia de castroRosalia de castro
Rosalia de castro
 
Moluscomania
MoluscomaniaMoluscomania
Moluscomania
 
Roni e Carapuchiña xigante namóranse nadando
Roni e Carapuchiña xigante namóranse nadandoRoni e Carapuchiña xigante namóranse nadando
Roni e Carapuchiña xigante namóranse nadando
 
Roni e Carapuchiña xigante namóranse
Roni e Carapuchiña xigante namóranseRoni e Carapuchiña xigante namóranse
Roni e Carapuchiña xigante namóranse
 
Traballo por proxectos, de Azucena Arias Correa
Traballo por proxectos, de Azucena Arias CorreaTraballo por proxectos, de Azucena Arias Correa
Traballo por proxectos, de Azucena Arias Correa
 
Competencias Basicas Xunta
Competencias Basicas XuntaCompetencias Basicas Xunta
Competencias Basicas Xunta
 
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08
 
Reciclamos
ReciclamosReciclamos
Reciclamos
 
Comparámonos
ComparámonosComparámonos
Comparámonos
 
Carapuchiña vermella xigante
Carapuchiña vermella xiganteCarapuchiña vermella xigante
Carapuchiña vermella xigante
 
Cartas de felicitación a Agustín Fdez Paz
Cartas de felicitación a Agustín Fdez PazCartas de felicitación a Agustín Fdez Paz
Cartas de felicitación a Agustín Fdez Paz
 
Empusa Pennata
Empusa PennataEmpusa Pennata
Empusa Pennata
 
Horta A RúA 08
Horta A RúA 08Horta A RúA 08
Horta A RúA 08
 
FelicitacióN Nadal 08 Ceip A RúA
FelicitacióN Nadal 08 Ceip A RúAFelicitacióN Nadal 08 Ceip A RúA
FelicitacióN Nadal 08 Ceip A RúA
 

Recently uploaded

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Recently uploaded (20)

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

Quixote

  • 1. Cumulus: Filesystem Backup to the Cloud Michael Vrable, Stefan Savage, and Geoffrey M. Voelker Presented by Hakim Weatherspoon
  • 2. Good „ole Days Replace your tape drives with something truly scalable
  • 3. Amazon S3 to the rescue In Spring 2006, Amazon released a new storage API: Put, Get, List, Delete Build whatever you want! Quickly
  • 4. Backing up the new way (S3) Smart Scales  - no longer our concern... Amazon's concern - all servers backup in parallel Cheap - old cost =XXXper year - new cost =YYYper year where YYY < XXX 
  • 5. Thin vs Thick Cloud • E.g. Amazons S3 vs EMC‟s MozyPro • Thin – Can change provider easier – Applications can work across providers • Thick – Better performance – Locked into a provider – Provider can go out of business
  • 6. Cumulus • Simple storage backup utility for Thin Clouds • Evaluates efficacy of cloud storage • Working prototype – http://www.cs.ucsd.edu/~mvrable/cumulus/
  • 7. Outline • Motivation/Intro • Related Work • Design • Evaluation • Thoughts and Conclusions
  • 9. Outline • Motivation/Intro • Related Work • Design – API – Segments – Snapshots – Subfileincrementals – Cleaning – restoring • Evaluation • Thoughts and Conclusions
  • 10. API • Same as S3 – Put, Get, List, Delete • Thin cloud – does not rely on integrated services – Can easily change provider and network protocols – S3, FTP, SFTP • WORM Model – Write-once, read-many – Requires writing new entirely file if changes occur – What are the cleaning overheads?
  • 11. Segments • Aggregation via Segment Goals – Avoid costs due to small files • S3 charges on per file bases • Many small files – Avoid costs in network protocols • Small files have higher latency and other overheads – Compression • inter-file similarities – Privacy • Hide file boundaries • Negative consequences? – Need an entire segment to write
  • 13. Sub-File Incrementals • Only stored changed part of files • New snapshots point to old objects when data unchanged • Byte ranges – portions of old objcts to be reused
  • 14. Segment Cleaning • Similar to a log-structured file system (LFS) • Clean based on utilization of segment, α – α = 0, no cleaning – α = 1, clean with the slightest change • Cumulus – attempts to find an equilibrium for α – Uses a different process to clean – Marks a local database as “expired” – Then, next snapshot will not refer to expired segment
  • 15. Restore • Full Restore – Download all segments for a snapshot • Partial Restore – Download snapshot descriptor, metadata, and only necessary segments • What happens if client machine dies? • How is latest snapshot descriptors identified? • What about sharing between client machines?
  • 16. Outline • Motivation/Intro • Related Work • Design • Evaluation – Performance Case Study – Monetary Case Study • Thoughts and Conclusions
  • 18. Backup over time (user trace)
  • 19. Backup w/out Segment Cleaning (user trace)
  • 20. Average Daily Storage (fileserver)
  • 21. Average Daily Upload (fileserver)
  • 22. Average Segments per Day (fileserver)
  • 23. Storage overhead for 16MB Segment (fileserver)
  • 26. Monetary Case Study • Storage: $0.15 per GB . Month • Upload: $0.10 per GB • Segment: $0.01 per 1000 files uploaded • We are charged this amount, so please be careful with your labs and projects!!!
  • 28. Costs for Backup (fileserver)
  • 29. Monetary Cost Comparison (user trace)
  • 30. Outline • Motivation/Intro • Related Work • Design • Evaluation • Thoughts and Conclusions
  • 31. Discussion • Thoughts? • Did paper make case for Thin Clouds? • Sharing between clients ignored? • What every happened to P2P?!
  • 33. Next Time • Read NFS and write review: – Design and Implementation of a Network File System, Sandberg, Goldberg, Kleiman, Walsh, and Lyon, USENIX 1985 • Do Lab 0 • Check website for updated schedule