SlideShare a Scribd company logo
1 of 9
Download to read offline
Swift in
    the Small
                 OpenStack Meetup
                           June 29, 2011
        Computer History Museum
                 Mountain View, CA
          Joe Arnold - Cloudscaling
                 twitter: @joearnold
         blog: http://joearnold.com
Wednesday, June 29, 2011

- The theme of tonight is Corporate IT.
- The promise of OpenStack for Corporate IT is the ability to take advantage of
-- all the great tooling,
-- all the great services,
-- all the compatible applications that use infrastructure cloud services as a platform.
- It gives the ability to deploy cloud infrastructure in-house.

- Tonight I’ll be covering OpenStack Object Storage, Swift -- In the Small
- Raise of hands: How many have downloaded and installed either Swift?
Wednesday, June 29, 2011

- Swift is an Object Storage system that was designed for scale.
- This was one of the first clusters we deployed.
- It’s a petabyte of useable storage. It can serve a lot of users.
- For the spinning disks of aluminum, bent sheet metal, forged iron for the racks, strands of
glass, and silicon wafers, etc... A deployment like this is a great deal at $500,000 and a
million dollars.
- But not everyone needs a petabyte out of the gate.
- Even for these deployments, we have staging clusters in the range of 80-100 TB
Wednesday, June 29, 2011

- Challenge for this ‘Corporate IT’ theme is what a small-scale Object Storage (Swift) cluster
would look like.
- What does it take and what compromises are made when scaling down something designed
for large-scale.

- This, for example, is a 4-U, 36 drive system from ComputerLINK. ComputerLINK was nice
enough to provide a demo unit for the meetup tonight.
- I’ll be powering it up in a few minutes and if you’re interested, you can come over and we
can start pulling drives and watch data get replicated around.
Zone 2




                           Zone     Zone             Zone            Zone




Wednesday, June 29, 2011

Why is this a challenge? — Zones

- Swift is designed for large-scale deployments.
- The mechanisms for replication and data distribution are built on the concept that data is
distributed across isolated failure boundaries. These isolated failure boundaries are called
zones.

- Unlike RAID systems, data isn’t chopped up and distributed throughout the system.
- With Swift whole files are distributed throughout the system. Each copy of the data resides
in a different zone.

- Swift stores 3 copies of the data, so at least 4 zones are required. (in case 1 zone fails)
- Preferably 5 zones (so that 2 zones can fail).

- In the big clusters, failure boundaries can be separate racks with their own networking
components.
- In medium deployments, a physical node can represent a zone.

- For smaller deployments with fewer then 4 nodes, drives need to be grouped together to
form pseudo-failure boundaries.
- A grouping of drives is simply declared a zone.

- Here is a scheme for starting small and growing the cluster bit-by-bit (well.. terabyte-by-
terabyte).
4 Disks 4 Zones




Wednesday, June 29, 2011

- For a single storage node the minimum configuration would have 4 drives for data + 1
boot drive.
- Each disk is a zone.
- If a single drive fails, it’s data will be replicated to the remaining 3 drives in the system.
- The system would grow, 4-disks at at time (one in each zone) until the chassis was full.
Zone 1              Zone 2



                                   Zone 3              Zone 4




Wednesday, June 29, 2011

- The strategy here is to split the zones evenly across the two nodes.

- The addition of an additional node does increases availability (assuming that load
balancing is configured),
- but it does does not create a master-slave configuration. If one of the nodes is down ½ of
your zones are unavailable.

- The good news is that if one of the nodes is down (½ of your zones), data is still
accessible.
- This is because because at least one of the zones will still up on the remaining node.

- The bad news is that there is still a 1 in 2 chance that writes will fail
- because at least two of three zones need to be written to for the write to be considered
successful.
Zone 1               ⅓ Zone 4




                                    Zone 2               ⅓ Zone 4



                                    Zone 3               ⅓ Zone 4




Wednesday, June 29, 2011

- The addition of a third node further enables distribution of zones across the nodes.

- Something strange is going on here by putting whole zones in each node,
- but breaking up zone 4 into thirds and distributing across the three nodes.
- This is done to enable smoother rebalancing when going to 4 nodes.

- Again, if a single node is down, data will be available, but there will be a 1 if 5 chance that
a write would fail.
Zone 1                                Zone 2



                             Zone 3                                Zone 4




Wednesday, June 29, 2011

- The strategy of breaking up Zone 4 into thirds with 3 nodes, is to make this transition
easier.

- The cluster can be configured with zone 4 entirely on that new server,
- then the remaining zones can slowly be rebalanced to fold-in the newly vacated drives on
their node.

- Now, if a single node fails, writes will be successful as at least two zones will be available.
Wednesday, June 29, 2011

- Why small-scale Swift?
- Using OpenStack Object Storage is a private-cloud alternative to S3, CloudFiles, etc.
- This enables private cloud builders to start out with a single machine their own data center
and scale-up as their needs grow.

- Why not use RAID?
- Why not use a banana? :) It’s a different storage system, used for different purposes.
- Going with a private deployment of Object Storage gives something that looks and feels
just like Rackspace Cloud Files.
- App developers don’t need to attach a volume to use the storage system and assets can be
served directly to end users or to a CDN.

- The bottom line is that a small deployment can transition smoothly into a larger
deployment.

- The great thing about OpenStack being open-source software is that it gives us the
freedom to build and design systems however we see fit.

More Related Content

Recently uploaded

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 

Recently uploaded (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Featured

Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

Featured (20)

Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 

OpenStack Meetup - Swift in the Small

  • 1. Swift in the Small OpenStack Meetup June 29, 2011 Computer History Museum Mountain View, CA Joe Arnold - Cloudscaling twitter: @joearnold blog: http://joearnold.com Wednesday, June 29, 2011 - The theme of tonight is Corporate IT. - The promise of OpenStack for Corporate IT is the ability to take advantage of -- all the great tooling, -- all the great services, -- all the compatible applications that use infrastructure cloud services as a platform. - It gives the ability to deploy cloud infrastructure in-house. - Tonight I’ll be covering OpenStack Object Storage, Swift -- In the Small - Raise of hands: How many have downloaded and installed either Swift?
  • 2. Wednesday, June 29, 2011 - Swift is an Object Storage system that was designed for scale. - This was one of the first clusters we deployed. - It’s a petabyte of useable storage. It can serve a lot of users. - For the spinning disks of aluminum, bent sheet metal, forged iron for the racks, strands of glass, and silicon wafers, etc... A deployment like this is a great deal at $500,000 and a million dollars. - But not everyone needs a petabyte out of the gate. - Even for these deployments, we have staging clusters in the range of 80-100 TB
  • 3. Wednesday, June 29, 2011 - Challenge for this ‘Corporate IT’ theme is what a small-scale Object Storage (Swift) cluster would look like. - What does it take and what compromises are made when scaling down something designed for large-scale. - This, for example, is a 4-U, 36 drive system from ComputerLINK. ComputerLINK was nice enough to provide a demo unit for the meetup tonight. - I’ll be powering it up in a few minutes and if you’re interested, you can come over and we can start pulling drives and watch data get replicated around.
  • 4. Zone 2 Zone Zone Zone Zone Wednesday, June 29, 2011 Why is this a challenge? — Zones - Swift is designed for large-scale deployments. - The mechanisms for replication and data distribution are built on the concept that data is distributed across isolated failure boundaries. These isolated failure boundaries are called zones. - Unlike RAID systems, data isn’t chopped up and distributed throughout the system. - With Swift whole files are distributed throughout the system. Each copy of the data resides in a different zone. - Swift stores 3 copies of the data, so at least 4 zones are required. (in case 1 zone fails) - Preferably 5 zones (so that 2 zones can fail). - In the big clusters, failure boundaries can be separate racks with their own networking components. - In medium deployments, a physical node can represent a zone. - For smaller deployments with fewer then 4 nodes, drives need to be grouped together to form pseudo-failure boundaries. - A grouping of drives is simply declared a zone. - Here is a scheme for starting small and growing the cluster bit-by-bit (well.. terabyte-by- terabyte).
  • 5. 4 Disks 4 Zones Wednesday, June 29, 2011 - For a single storage node the minimum configuration would have 4 drives for data + 1 boot drive. - Each disk is a zone. - If a single drive fails, it’s data will be replicated to the remaining 3 drives in the system. - The system would grow, 4-disks at at time (one in each zone) until the chassis was full.
  • 6. Zone 1 Zone 2 Zone 3 Zone 4 Wednesday, June 29, 2011 - The strategy here is to split the zones evenly across the two nodes. - The addition of an additional node does increases availability (assuming that load balancing is configured), - but it does does not create a master-slave configuration. If one of the nodes is down ½ of your zones are unavailable. - The good news is that if one of the nodes is down (½ of your zones), data is still accessible. - This is because because at least one of the zones will still up on the remaining node. - The bad news is that there is still a 1 in 2 chance that writes will fail - because at least two of three zones need to be written to for the write to be considered successful.
  • 7. Zone 1 ⅓ Zone 4 Zone 2 ⅓ Zone 4 Zone 3 ⅓ Zone 4 Wednesday, June 29, 2011 - The addition of a third node further enables distribution of zones across the nodes. - Something strange is going on here by putting whole zones in each node, - but breaking up zone 4 into thirds and distributing across the three nodes. - This is done to enable smoother rebalancing when going to 4 nodes. - Again, if a single node is down, data will be available, but there will be a 1 if 5 chance that a write would fail.
  • 8. Zone 1 Zone 2 Zone 3 Zone 4 Wednesday, June 29, 2011 - The strategy of breaking up Zone 4 into thirds with 3 nodes, is to make this transition easier. - The cluster can be configured with zone 4 entirely on that new server, - then the remaining zones can slowly be rebalanced to fold-in the newly vacated drives on their node. - Now, if a single node fails, writes will be successful as at least two zones will be available.
  • 9. Wednesday, June 29, 2011 - Why small-scale Swift? - Using OpenStack Object Storage is a private-cloud alternative to S3, CloudFiles, etc. - This enables private cloud builders to start out with a single machine their own data center and scale-up as their needs grow. - Why not use RAID? - Why not use a banana? :) It’s a different storage system, used for different purposes. - Going with a private deployment of Object Storage gives something that looks and feels just like Rackspace Cloud Files. - App developers don’t need to attach a volume to use the storage system and assets can be served directly to end users or to a CDN. - The bottom line is that a small deployment can transition smoothly into a larger deployment. - The great thing about OpenStack being open-source software is that it gives us the freedom to build and design systems however we see fit.