Architectures for open and scalable clouds

Randy Bias
Randy BiasCloud Pioneer, Founding Member of OpenStack Foundation, and Technology Disruptor
Architectures for open and scalable clouds
February 14, 2012

Randy Bias, CTO & Co-founder




                               CCA - NoDerivs 3.0 Unported License - Usage OK, no modifications, full attribution
Our Perspective on Cloud Computing
      It came from the large Internet players.




                        2
A Story of Two Clouds




          3
A Story of Two Clouds




          4
Tenets of Open & Scalable Clouds



1. Avoid vendor lock-in like bubonic plague
  • See also Open Cloud Initiative (opencloudinitiative.org)

2. Simplicity scales, complexity fails
  • 10x bigger == 100x more complex
3. TCO matters; measuring ROI is critical to success
4. Security is paramount ... but different
5. Risk acceptance over risk mitigation
6. Agility & iteration over big bang



                                  5
This is a BIG Topic



• What I am covering today is patterns in:
  • Hardware and software
  • Networking, storage, and compute
• NOT covered today:
  • Cloud operations
  • Infrastructure software engineering
  • Measuring success through operational excellence
  • Security



                              6
Open Clouds
  (briefly)




     7
A Word on ‘Open’




        8
Here we go ...



• Elements:
  • Open APIs & protocols
  • Open hardware
  • Open networking
  • Open source software (OSS)
• Combined with:
  • Architectural patterns, best
    practices, & de facto standards
  • Operational excellence


                               9
Open APIs & Protocols




          10
Open Hardware




      11
Open Networking
Published Networking
      Blueprints




                              12
Open Source Software




Open Cloud OS




                          13
Open & Scalable
Cloud Patterns



       14
Threads



•   Small failure domains are less impacting
•   Loose-coupling minimizes cascade failures
•   Scale-out over scale-up with exceptions
•   More AND cheaper
•   State synchronization is dangerous (remember CAP)
•   Everything has an API
•   Automation ONLY works w/ homogeneity & modularity
•   Lowest common denominator (LCD) services (LBaaS vs F5aaS)
•   People are the number one source of failures



                                 15
Pattern:
                   Loose coupling


Synchronous, blocking
 calls mean cascading
        failures.




                               Async, non-block calls
                                  mean failure in
                                     isolation.



                          16
Pattern:
Open source software

  Excessive software
 taxation is the past.




  You can always fork.




  Black boxes create
        lock-in.

              17
Pattern:
Uptime in software - self management

Hardware fails.
Software fails.
 People fail.

                   Only software can
                    measure itself &
                  respond to failure in
                     near real-time.

                                     Applications designed
                                      for 99.999% uptime
                                       can run anywhere



                               18
Pattern:
                              Scale-out, not UP

                                                              You name them
                                                             and when they get
 Scale Up: (Virtual*)
                                                               sick, you nurse
Servers are like pets
                                                                them back to
                                                                    health

                                  garfield.company.com




     attrib: Bill Baker, Distinguished Engineer, Microsoft
     * added by yours truly ...                       19
Pattern:
                                 Scale-out, not UP

                                                                 You name them
                                                                and when they get
   Scale Up: (Virtual*)
                                                                  sick, you nurse
  Servers are like pets
                                                                   them back to
                                                                       health

                                     garfield.company.com


                                                                You number them
 Scale Out: (Virtual*)                                          and when they get
Servers are like cattle                                           sick, you shoot
                                                                        them

                                     web001.company.com
        attrib: Bill Baker, Distinguished Engineer, Microsoft
        * added by yours truly ...                       19
Pattern:
                 Buy from ODMs

ODMs operate their
businesses on 3-10%
      margins.

                   AMZN, GOOG, and
                  Facebook buy direct
                  without a middleman.

                                     Only a few enterprise
                                    vendors are pivoting to
                                           compete.




                            20
Pattern:
Less enterprise “value” in x86 servers

 Generic servers rule. Full
 stop. Nothing is better
 because nothing else is
        *generic*.




            “... a data center full of vanity
                  free servers ... more
             efficient ... less expensive to
                build and run ... “ - OCP


                              21
Pattern:
                    Flat Networking
The largest cloud operators
 all run layer-3 routed, flat
 networks with no VLANs.



  Cloud-ready apps don’t
   need or want VLANs.


  Enterprise apps can be
supported on open clouds
 using Software-defined
    Networking (SDN)



                               22
Pattern:
          Software-defined Networking (SDN)
•   x86 server is the new Linecard
•   network switch is the new ASIC              “Network Virtualization”
•   VXLAN (or NVGRE) is the new Chassis
•   SDN Controller is the new SUP Engine




                                           23
Pattern:
                 Flat Networking + SDNs
Flat + SDN co-exist                                                  Internet
 & thrive together

              VM                                            VM

                                Availability
                                  Zone
              VM                                            VM
                                                                                           VPC
                                                                     VM
                                                                                          Gateway

                                                                     Virtual L2 Network

                       1                              2
                                                                     VM             VM




            Standard       VM                    VM         VPC      Virtual Private
            Security                                      Security       Cloud
             Group                                         Group      Networking
                                 Physical
                                  Node


                                            24
Pattern:
      RAIS instead of HA pairs/clusters


• Redundant arrays of inexpensive services (RAIS)
  • Load balanced
  • No state sharing
  • On failure, connections are lost, but failures are rare
• Ridiculously simple & scalable
• Most things retry anyway
• Hardware failures are in-frequent & impact subset of traffic
  • (N-F)/N, where N = total, F = failed
• Cascade failures are unlikely and failure domains are small


                               25
Service array (RAIS) example:


Public IP                                        Backbone Routers
 Blocks
                   OSPF Route Announcements



     RAIS (NAT, LB, VPN)


                                               Cloud Access Switches
  API


                   Return Traffic (default or source
                                NAT)
Cloud
Control
 Plane                                           AZ (Spine) Switches




                                  26
Pattern:
  Lots of inexpensive 1RU Switches
           Simple spine-and-leaf flat routed network




  Rack 1    Rack 2   Rack 3




1RU: 6K-30K VMs / AZ


                              27
Pattern:
  Lots of inexpensive 1RU Switches
           Simple spine-and-leaf flat routed network




                                        Multiple   Multiple   Multiple
  Rack 1    Rack 2   Rack 3
                                        Racks2
                                         Rack      Racks2
                                                    Rack      Racks2
                                                               Rack
                                          Rack 1     Rack 1     Rack 1




1RU: 6K-30K VMs / AZ               Modular: 40K-200K VMs / AZ


                              27
Pattern:
        Direct-attached Storage (DAS)
Cloud-ready apps               DAS is the smallest failure
manage their own                 domain possible with
data replication.               reasonable storage I/O.




SAN == massive failure        SSDs will be the great
      domain.                      equalizer.

                         28
Pattern:
           Elastic Block Device Services

                          EBS/EBD is a crutch for
                           poorly written apps.


 Bigger failure domains (AWS
outage anyone?), complex, sets
      high expectations


                        Sometimes you need a crutch.
                          When you do, overbuild the
                       network, and make sure you have
                              a smart scheduler.



                                 29
Pattern:
      More Servers == More Storage I/O
   >1M writes/second, triple-
redundancy w/ Cassandra on AWS




        Linear scale-out == linear costs for performance

                               30
Pattern:
           Hypervisors are a commodity

Cloud end-users want OS
   of choice, not HVs.




                                     Level up! Managing iron is for
                                         mainframe operators.




  Hypervisor of the future is open
    source, easily modifiable, &
            extensible.


                                31
Open Cloud System
Simply Scaled        Production Ready




     randyb@cloudscaling.com
           @randybias




                32
1 of 34

Recommended

Introduction to Docker - 2017 by
Introduction to Docker - 2017Introduction to Docker - 2017
Introduction to Docker - 2017Docker, Inc.
78.2K views41 slides
Why to Cloud Native by
Why to Cloud NativeWhy to Cloud Native
Why to Cloud NativeKarthik Gaekwad
2.6K views44 slides
An Introduction To Jenkins by
An Introduction To JenkinsAn Introduction To Jenkins
An Introduction To JenkinsKnoldus Inc.
23.3K views15 slides
What Is A Docker Container? | Docker Container Tutorial For Beginners| Docker... by
What Is A Docker Container? | Docker Container Tutorial For Beginners| Docker...What Is A Docker Container? | Docker Container Tutorial For Beginners| Docker...
What Is A Docker Container? | Docker Container Tutorial For Beginners| Docker...Simplilearn
5.2K views61 slides
Service Mesh - Observability by
Service Mesh - ObservabilityService Mesh - Observability
Service Mesh - ObservabilityAraf Karsh Hamid
376 views59 slides
Containers Docker Kind Kubernetes Istio by
Containers Docker Kind Kubernetes IstioContainers Docker Kind Kubernetes Istio
Containers Docker Kind Kubernetes IstioAraf Karsh Hamid
348 views261 slides

More Related Content

What's hot

OpenStack Introduction by
OpenStack IntroductionOpenStack Introduction
OpenStack Introductionopenstackindia
49.5K views32 slides
VMware Integrated OpenStack by
VMware Integrated OpenStackVMware Integrated OpenStack
VMware Integrated OpenStackMarketingArrowECS_CZ
2K views16 slides
Anatomy of a Continuous Integration and Delivery (CICD) Pipeline by
Anatomy of a Continuous Integration and Delivery (CICD) PipelineAnatomy of a Continuous Integration and Delivery (CICD) Pipeline
Anatomy of a Continuous Integration and Delivery (CICD) PipelineRobert McDermott
80.4K views20 slides
OpenStack Architecture by
OpenStack ArchitectureOpenStack Architecture
OpenStack ArchitectureMirantis
88.5K views88 slides
Modern big data and machine learning in the era of cloud, docker and kubernetes by
Modern big data and machine learning in the era of cloud, docker and kubernetesModern big data and machine learning in the era of cloud, docker and kubernetes
Modern big data and machine learning in the era of cloud, docker and kubernetesSlim Baltagi
3.9K views26 slides
Docker Introduction by
Docker IntroductionDocker Introduction
Docker IntroductionPeng Xiao
1.6K views48 slides

What's hot(20)

OpenStack Introduction by openstackindia
OpenStack IntroductionOpenStack Introduction
OpenStack Introduction
openstackindia49.5K views
Anatomy of a Continuous Integration and Delivery (CICD) Pipeline by Robert McDermott
Anatomy of a Continuous Integration and Delivery (CICD) PipelineAnatomy of a Continuous Integration and Delivery (CICD) Pipeline
Anatomy of a Continuous Integration and Delivery (CICD) Pipeline
Robert McDermott80.4K views
OpenStack Architecture by Mirantis
OpenStack ArchitectureOpenStack Architecture
OpenStack Architecture
Mirantis88.5K views
Modern big data and machine learning in the era of cloud, docker and kubernetes by Slim Baltagi
Modern big data and machine learning in the era of cloud, docker and kubernetesModern big data and machine learning in the era of cloud, docker and kubernetes
Modern big data and machine learning in the era of cloud, docker and kubernetes
Slim Baltagi3.9K views
Docker Introduction by Peng Xiao
Docker IntroductionDocker Introduction
Docker Introduction
Peng Xiao1.6K views
What is Cloud Computing with Amazon Web Services? by Amazon Web Services
What is Cloud Computing with Amazon Web Services?What is Cloud Computing with Amazon Web Services?
What is Cloud Computing with Amazon Web Services?
Amazon Web Services25.8K views
Application Monitoring using Datadog by Mukta Aphale
Application Monitoring using DatadogApplication Monitoring using Datadog
Application Monitoring using Datadog
Mukta Aphale3.8K views
Docker introduction by dotCloud
Docker introductionDocker introduction
Docker introduction
dotCloud455.8K views
OpenStack Architecture and Use Cases by Jalal Mostafa
OpenStack Architecture and Use CasesOpenStack Architecture and Use Cases
OpenStack Architecture and Use Cases
Jalal Mostafa7.3K views
Introduction to docker by Instruqt
Introduction to dockerIntroduction to docker
Introduction to docker
Instruqt664 views
Présentation DEVOPS.pptx by boulonvert
Présentation DEVOPS.pptxPrésentation DEVOPS.pptx
Présentation DEVOPS.pptx
boulonvert276 views
Docker 101: Introduction to Docker by Docker, Inc.
Docker 101: Introduction to DockerDocker 101: Introduction to Docker
Docker 101: Introduction to Docker
Docker, Inc.71.2K views
Docker 101 : Introduction to Docker and Containers by Yajushi Srivastava
Docker 101 : Introduction to Docker and ContainersDocker 101 : Introduction to Docker and Containers
Docker 101 : Introduction to Docker and Containers
Yajushi Srivastava1.9K views
Kubernetes introduction by DAEBUM LEE
Kubernetes introductionKubernetes introduction
Kubernetes introduction
DAEBUM LEE3.9K views

Viewers also liked

Leaders in the Cloud: Identifying Cloud Business Value for Customers by
Leaders in the Cloud: Identifying Cloud Business Value for CustomersLeaders in the Cloud: Identifying Cloud Business Value for Customers
Leaders in the Cloud: Identifying Cloud Business Value for CustomersOpSource
7.1K views20 slides
Getting Started with Amazon CloudSearch by
Getting Started with Amazon CloudSearchGetting Started with Amazon CloudSearch
Getting Started with Amazon CloudSearchAmazon Web Services
15.7K views37 slides
Cloud computing What Why How by
Cloud computing What Why HowCloud computing What Why How
Cloud computing What Why HowAsian Institute of Technology (AIT)
28.7K views41 slides
Cloud Computing and Enterprise Architecture by
Cloud Computing and Enterprise ArchitectureCloud Computing and Enterprise Architecture
Cloud Computing and Enterprise ArchitectureDavid Linthicum
11K views15 slides
Cloud Computing Without The Hype An Executive Guide (1.00 Slideshare) by
Cloud Computing Without The Hype   An Executive Guide (1.00 Slideshare)Cloud Computing Without The Hype   An Executive Guide (1.00 Slideshare)
Cloud Computing Without The Hype An Executive Guide (1.00 Slideshare)Lustratus REPAMA
7.6K views20 slides
Open source and standards - unleashing the potential for innovation of cloud ... by
Open source and standards - unleashing the potential for innovation of cloud ...Open source and standards - unleashing the potential for innovation of cloud ...
Open source and standards - unleashing the potential for innovation of cloud ...Ignacio M. Llorente
6.5K views15 slides

Viewers also liked(20)

Leaders in the Cloud: Identifying Cloud Business Value for Customers by OpSource
Leaders in the Cloud: Identifying Cloud Business Value for CustomersLeaders in the Cloud: Identifying Cloud Business Value for Customers
Leaders in the Cloud: Identifying Cloud Business Value for Customers
OpSource7.1K views
Cloud Computing and Enterprise Architecture by David Linthicum
Cloud Computing and Enterprise ArchitectureCloud Computing and Enterprise Architecture
Cloud Computing and Enterprise Architecture
David Linthicum11K views
Cloud Computing Without The Hype An Executive Guide (1.00 Slideshare) by Lustratus REPAMA
Cloud Computing Without The Hype   An Executive Guide (1.00 Slideshare)Cloud Computing Without The Hype   An Executive Guide (1.00 Slideshare)
Cloud Computing Without The Hype An Executive Guide (1.00 Slideshare)
Lustratus REPAMA7.6K views
Open source and standards - unleashing the potential for innovation of cloud ... by Ignacio M. Llorente
Open source and standards - unleashing the potential for innovation of cloud ...Open source and standards - unleashing the potential for innovation of cloud ...
Open source and standards - unleashing the potential for innovation of cloud ...
Ignacio M. Llorente6.5K views
ClientSummit2010_CloudWorkshop by Razorfish
ClientSummit2010_CloudWorkshopClientSummit2010_CloudWorkshop
ClientSummit2010_CloudWorkshop
Razorfish21.8K views
Cloud is such stuff as dreams are made on by Patrick Chanezon
Cloud is such stuff as dreams are made onCloud is such stuff as dreams are made on
Cloud is such stuff as dreams are made on
Patrick Chanezon58.6K views
AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat... by Amazon Web Services
AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...
AWS Partner Presentation - PetaByte Scale Computing on Amazon EC2 with BigDat...
Amazon Web Services6.8K views
Running Microsoft SharePoint On AWS - Smartronix and AWS - Webinar by Amazon Web Services
Running Microsoft SharePoint On AWS - Smartronix and AWS - WebinarRunning Microsoft SharePoint On AWS - Smartronix and AWS - Webinar
Running Microsoft SharePoint On AWS - Smartronix and AWS - Webinar
Journey Through the AWS Cloud; Development and Test by Amazon Web Services
Journey Through the AWS Cloud; Development and TestJourney Through the AWS Cloud; Development and Test
Journey Through the AWS Cloud; Development and Test
Amazon Web Services8.9K views
Hybrid Customer Insight - Data Collection and Analysis from On-premise and in... by LicensingLive! - SafeNet
Hybrid Customer Insight - Data Collection and Analysis from On-premise and in...Hybrid Customer Insight - Data Collection and Analysis from On-premise and in...
Hybrid Customer Insight - Data Collection and Analysis from On-premise and in...
Google App Engine for Business 101 by Chris Schalk
Google App Engine for Business 101Google App Engine for Business 101
Google App Engine for Business 101
Chris Schalk5.5K views
AWS for Media: Content in the Cloud, Miles Ward (Amazon Web Services) and Bha... by Amazon Web Services
AWS for Media: Content in the Cloud, Miles Ward (Amazon Web Services) and Bha...AWS for Media: Content in the Cloud, Miles Ward (Amazon Web Services) and Bha...
AWS for Media: Content in the Cloud, Miles Ward (Amazon Web Services) and Bha...
Amazon Web Services10.4K views
Enterprise 2.0 Summit 2012 Closing Keynote - Next-Generation Ecosystems And i... by Dion Hinchcliffe
Enterprise 2.0 Summit 2012 Closing Keynote - Next-Generation Ecosystems And i...Enterprise 2.0 Summit 2012 Closing Keynote - Next-Generation Ecosystems And i...
Enterprise 2.0 Summit 2012 Closing Keynote - Next-Generation Ecosystems And i...
Dion Hinchcliffe11.7K views
Getting an open systems cloud strategy right the first time linthicm by David Linthicum
Getting an open systems cloud strategy right the first time linthicmGetting an open systems cloud strategy right the first time linthicm
Getting an open systems cloud strategy right the first time linthicm
David Linthicum6.3K views
Cloud Computing Integration Introduction by toryharis
Cloud Computing Integration IntroductionCloud Computing Integration Introduction
Cloud Computing Integration Introduction
toryharis13.6K views

Similar to Architectures for open and scalable clouds

Vr storm cips_03nov2010 by
Vr storm cips_03nov2010Vr storm cips_03nov2010
Vr storm cips_03nov2010National Research Council Canada
233 views27 slides
Cloud deep-dive0212 by
Cloud deep-dive0212Cloud deep-dive0212
Cloud deep-dive0212Accenture
1.7K views19 slides
Cloud Computing by
Cloud ComputingCloud Computing
Cloud ComputingMark Frydenberg
7.6K views32 slides
20090911 virtualizationandcloud by
20090911 virtualizationandcloud20090911 virtualizationandcloud
20090911 virtualizationandcloudDebabrata Debnath
189 views33 slides
Achieving scalability & speed with IaaS by
Achieving scalability & speed with IaaSAchieving scalability & speed with IaaS
Achieving scalability & speed with IaaSIBM Software India
434 views27 slides
Linux Everywhere? Matching the Workload to the Computer by
Linux Everywhere? Matching the Workload to the ComputerLinux Everywhere? Matching the Workload to the Computer
Linux Everywhere? Matching the Workload to the ComputerRobert Sutor
1.2K views24 slides

Similar to Architectures for open and scalable clouds(20)

Cloud deep-dive0212 by Accenture
Cloud deep-dive0212Cloud deep-dive0212
Cloud deep-dive0212
Accenture1.7K views
Linux Everywhere? Matching the Workload to the Computer by Robert Sutor
Linux Everywhere? Matching the Workload to the ComputerLinux Everywhere? Matching the Workload to the Computer
Linux Everywhere? Matching the Workload to the Computer
Robert Sutor1.2K views
Cloud Computing For Intelligence And Defense V6 by GovCloud Network
Cloud Computing For Intelligence And Defense V6Cloud Computing For Intelligence And Defense V6
Cloud Computing For Intelligence And Defense V6
GovCloud Network981 views
Virtual Insight Linked In Wi 2003 by bjhutt
Virtual Insight Linked In Wi 2003Virtual Insight Linked In Wi 2003
Virtual Insight Linked In Wi 2003
bjhutt245 views
Virtualizing & Mobility - Transforming the way governments communicate - Mitel by KBIZEAU
Virtualizing & Mobility - Transforming the way governments communicate - MitelVirtualizing & Mobility - Transforming the way governments communicate - Mitel
Virtualizing & Mobility - Transforming the way governments communicate - Mitel
KBIZEAU418 views
2012 open storage summit keynote by Randy Bias
2012 open storage summit   keynote2012 open storage summit   keynote
2012 open storage summit keynote
Randy Bias1.9K views
20090911 virtualizationandcloud by Meenal Joshi
20090911 virtualizationandcloud20090911 virtualizationandcloud
20090911 virtualizationandcloud
Meenal Joshi131 views
Pets vs. Cattle: The Elastic Cloud Story by Randy Bias
Pets vs. Cattle: The Elastic Cloud StoryPets vs. Cattle: The Elastic Cloud Story
Pets vs. Cattle: The Elastic Cloud Story
Randy Bias64.5K views
AbiCloud Webinar 1.0 by Abiquo, Inc.
AbiCloud Webinar 1.0AbiCloud Webinar 1.0
AbiCloud Webinar 1.0
Abiquo, Inc.950 views
eNovance Make Your Cloud by eNovance
eNovance Make Your CloudeNovance Make Your Cloud
eNovance Make Your Cloud
eNovance937 views

More from Randy Bias

Services are the New Cloud Platform (Services-as-a-Platform) by
Services are the New Cloud Platform (Services-as-a-Platform)Services are the New Cloud Platform (Services-as-a-Platform)
Services are the New Cloud Platform (Services-as-a-Platform)Randy Bias
3.4K views16 slides
Rebooting the OpenContrail Community by
Rebooting the OpenContrail CommunityRebooting the OpenContrail Community
Rebooting the OpenContrail CommunityRandy Bias
1.5K views13 slides
The History of Pets vs. Cattle ... And Using It Properly by
The History of Pets vs. Cattle ... And Using It ProperlyThe History of Pets vs. Cattle ... And Using It Properly
The History of Pets vs. Cattle ... And Using It ProperlyRandy Bias
14.9K views24 slides
State of the Stack v4 - OpenStack in All It's Glory by
State of the Stack v4 - OpenStack in All It's GloryState of the Stack v4 - OpenStack in All It's Glory
State of the Stack v4 - OpenStack in All It's GloryRandy Bias
20.3K views51 slides
Connect Expo 2015 - Australia - Bringing OpenStack into the Enterprise by
Connect Expo 2015 - Australia - Bringing OpenStack into the EnterpriseConnect Expo 2015 - Australia - Bringing OpenStack into the Enterprise
Connect Expo 2015 - Australia - Bringing OpenStack into the EnterpriseRandy Bias
2.6K views13 slides
The Cloud Revolution - Philippines Cloud Summit by
The Cloud Revolution - Philippines Cloud SummitThe Cloud Revolution - Philippines Cloud Summit
The Cloud Revolution - Philippines Cloud SummitRandy Bias
5.6K views51 slides

More from Randy Bias(20)

Services are the New Cloud Platform (Services-as-a-Platform) by Randy Bias
Services are the New Cloud Platform (Services-as-a-Platform)Services are the New Cloud Platform (Services-as-a-Platform)
Services are the New Cloud Platform (Services-as-a-Platform)
Randy Bias3.4K views
Rebooting the OpenContrail Community by Randy Bias
Rebooting the OpenContrail CommunityRebooting the OpenContrail Community
Rebooting the OpenContrail Community
Randy Bias1.5K views
The History of Pets vs. Cattle ... And Using It Properly by Randy Bias
The History of Pets vs. Cattle ... And Using It ProperlyThe History of Pets vs. Cattle ... And Using It Properly
The History of Pets vs. Cattle ... And Using It Properly
Randy Bias14.9K views
State of the Stack v4 - OpenStack in All It's Glory by Randy Bias
State of the Stack v4 - OpenStack in All It's GloryState of the Stack v4 - OpenStack in All It's Glory
State of the Stack v4 - OpenStack in All It's Glory
Randy Bias20.3K views
Connect Expo 2015 - Australia - Bringing OpenStack into the Enterprise by Randy Bias
Connect Expo 2015 - Australia - Bringing OpenStack into the EnterpriseConnect Expo 2015 - Australia - Bringing OpenStack into the Enterprise
Connect Expo 2015 - Australia - Bringing OpenStack into the Enterprise
Randy Bias2.6K views
The Cloud Revolution - Philippines Cloud Summit by Randy Bias
The Cloud Revolution - Philippines Cloud SummitThe Cloud Revolution - Philippines Cloud Summit
The Cloud Revolution - Philippines Cloud Summit
Randy Bias5.6K views
The Lie of a Benevolent Dictator; the Truth of a Working Democratic Meritocracy by Randy Bias
The Lie of a Benevolent Dictator; the Truth of a Working Democratic MeritocracyThe Lie of a Benevolent Dictator; the Truth of a Working Democratic Meritocracy
The Lie of a Benevolent Dictator; the Truth of a Working Democratic Meritocracy
Randy Bias2K views
OpenStack Architected Like AWS (and GCP) by Randy Bias
OpenStack Architected Like AWS (and GCP)OpenStack Architected Like AWS (and GCP)
OpenStack Architected Like AWS (and GCP)
Randy Bias9.6K views
OpenStack Scale-out Networking Architecture by Randy Bias
OpenStack Scale-out Networking ArchitectureOpenStack Scale-out Networking Architecture
OpenStack Scale-out Networking Architecture
Randy Bias9.7K views
SFBay OpenStack Meetup // Neutron and SDN in Production – Dec 3 2013 by Randy Bias
SFBay OpenStack Meetup // Neutron and SDN in Production – Dec 3 2013SFBay OpenStack Meetup // Neutron and SDN in Production – Dec 3 2013
SFBay OpenStack Meetup // Neutron and SDN in Production – Dec 3 2013
Randy Bias2.4K views
AWS Repatriation: Bring Your Apps Back by Randy Bias
AWS Repatriation: Bring Your Apps BackAWS Repatriation: Bring Your Apps Back
AWS Repatriation: Bring Your Apps Back
Randy Bias6.2K views
State of the Stack v2 by Randy Bias
State of the Stack v2State of the Stack v2
State of the Stack v2
Randy Bias10.5K views
Networking is NOT Free: Lessons in Network Design by Randy Bias
Networking is NOT Free: Lessons in Network DesignNetworking is NOT Free: Lessons in Network Design
Networking is NOT Free: Lessons in Network Design
Randy Bias6.2K views
Scale-Out Block Storage by Randy Bias
Scale-Out Block StorageScale-Out Block Storage
Scale-Out Block Storage
Randy Bias5.8K views
State of the Stack April 2013 by Randy Bias
State of the Stack April 2013State of the Stack April 2013
State of the Stack April 2013
Randy Bias73.8K views
Open Cloud System Networking Vision by Randy Bias
Open Cloud System Networking VisionOpen Cloud System Networking Vision
Open Cloud System Networking Vision
Randy Bias2.4K views
OpenStack Summit :: Redundancy Doesn't Always Mean "HA" or "Cluster" by Randy Bias
OpenStack Summit :: Redundancy Doesn't Always Mean "HA" or "Cluster"OpenStack Summit :: Redundancy Doesn't Always Mean "HA" or "Cluster"
OpenStack Summit :: Redundancy Doesn't Always Mean "HA" or "Cluster"
Randy Bias3.8K views
OpenStack Summit :: Profiling the Nova Scheduler by Randy Bias
OpenStack Summit :: Profiling the Nova SchedulerOpenStack Summit :: Profiling the Nova Scheduler
OpenStack Summit :: Profiling the Nova Scheduler
Randy Bias3.7K views
OpenStack Summit :: Pimp My Cloud by Randy Bias
OpenStack Summit :: Pimp My CloudOpenStack Summit :: Pimp My Cloud
OpenStack Summit :: Pimp My Cloud
Randy Bias1.4K views
Distributed RPC in Nova with ZeroMQ by Randy Bias
Distributed RPC in Nova with ZeroMQDistributed RPC in Nova with ZeroMQ
Distributed RPC in Nova with ZeroMQ
Randy Bias5.3K views

Recently uploaded

Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava... by
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...ShapeBlue
145 views17 slides
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... by
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...ShapeBlue
119 views17 slides
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... by
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...ShapeBlue
139 views29 slides
Business Analyst Series 2023 - Week 4 Session 7 by
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7DianaGray10
139 views31 slides
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineShapeBlue
221 views19 slides
The Power of Heat Decarbonisation Plans in the Built Environment by
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built EnvironmentIES VE
79 views20 slides

Recently uploaded(20)

Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava... by ShapeBlue
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
ShapeBlue145 views
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... by ShapeBlue
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
ShapeBlue119 views
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... by ShapeBlue
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
ShapeBlue139 views
Business Analyst Series 2023 - Week 4 Session 7 by DianaGray10
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7
DianaGray10139 views
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue221 views
The Power of Heat Decarbonisation Plans in the Built Environment by IES VE
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built Environment
IES VE79 views
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O... by ShapeBlue
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...
ShapeBlue132 views
DRBD Deep Dive - Philipp Reisner - LINBIT by ShapeBlue
DRBD Deep Dive - Philipp Reisner - LINBITDRBD Deep Dive - Philipp Reisner - LINBIT
DRBD Deep Dive - Philipp Reisner - LINBIT
ShapeBlue180 views
Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ by ShapeBlue
Confidence in CloudStack - Aron Wagner, Nathan Gleason - AmericConfidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
ShapeBlue130 views
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas... by Bernd Ruecker
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
Bernd Ruecker54 views
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading... by The Digital Insurer
Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading...
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue by ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
ShapeBlue138 views
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue by ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
ShapeBlue147 views
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ... by ShapeBlue
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
ShapeBlue126 views
NTGapps NTG LowCode Platform by Mustafa Kuğu
NTGapps NTG LowCode Platform NTGapps NTG LowCode Platform
NTGapps NTG LowCode Platform
Mustafa Kuğu423 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue198 views
The Role of Patterns in the Era of Large Language Models by Yunyao Li
The Role of Patterns in the Era of Large Language ModelsThe Role of Patterns in the Era of Large Language Models
The Role of Patterns in the Era of Large Language Models
Yunyao Li85 views
Why and How CloudStack at weSystems - Stephan Bienek - weSystems by ShapeBlue
Why and How CloudStack at weSystems - Stephan Bienek - weSystemsWhy and How CloudStack at weSystems - Stephan Bienek - weSystems
Why and How CloudStack at weSystems - Stephan Bienek - weSystems
ShapeBlue238 views
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue by ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlueMigrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
ShapeBlue218 views

Architectures for open and scalable clouds

  • 1. Architectures for open and scalable clouds February 14, 2012 Randy Bias, CTO & Co-founder CCA - NoDerivs 3.0 Unported License - Usage OK, no modifications, full attribution
  • 2. Our Perspective on Cloud Computing It came from the large Internet players. 2
  • 3. A Story of Two Clouds 3
  • 4. A Story of Two Clouds 4
  • 5. Tenets of Open & Scalable Clouds 1. Avoid vendor lock-in like bubonic plague • See also Open Cloud Initiative (opencloudinitiative.org) 2. Simplicity scales, complexity fails • 10x bigger == 100x more complex 3. TCO matters; measuring ROI is critical to success 4. Security is paramount ... but different 5. Risk acceptance over risk mitigation 6. Agility & iteration over big bang 5
  • 6. This is a BIG Topic • What I am covering today is patterns in: • Hardware and software • Networking, storage, and compute • NOT covered today: • Cloud operations • Infrastructure software engineering • Measuring success through operational excellence • Security 6
  • 7. Open Clouds (briefly) 7
  • 8. A Word on ‘Open’ 8
  • 9. Here we go ... • Elements: • Open APIs & protocols • Open hardware • Open networking • Open source software (OSS) • Combined with: • Architectural patterns, best practices, & de facto standards • Operational excellence 9
  • 10. Open APIs & Protocols 10
  • 14. Open & Scalable Cloud Patterns 14
  • 15. Threads • Small failure domains are less impacting • Loose-coupling minimizes cascade failures • Scale-out over scale-up with exceptions • More AND cheaper • State synchronization is dangerous (remember CAP) • Everything has an API • Automation ONLY works w/ homogeneity & modularity • Lowest common denominator (LCD) services (LBaaS vs F5aaS) • People are the number one source of failures 15
  • 16. Pattern: Loose coupling Synchronous, blocking calls mean cascading failures. Async, non-block calls mean failure in isolation. 16
  • 17. Pattern: Open source software Excessive software taxation is the past. You can always fork. Black boxes create lock-in. 17
  • 18. Pattern: Uptime in software - self management Hardware fails. Software fails. People fail. Only software can measure itself & respond to failure in near real-time. Applications designed for 99.999% uptime can run anywhere 18
  • 19. Pattern: Scale-out, not UP You name them and when they get Scale Up: (Virtual*) sick, you nurse Servers are like pets them back to health garfield.company.com attrib: Bill Baker, Distinguished Engineer, Microsoft * added by yours truly ... 19
  • 20. Pattern: Scale-out, not UP You name them and when they get Scale Up: (Virtual*) sick, you nurse Servers are like pets them back to health garfield.company.com You number them Scale Out: (Virtual*) and when they get Servers are like cattle sick, you shoot them web001.company.com attrib: Bill Baker, Distinguished Engineer, Microsoft * added by yours truly ... 19
  • 21. Pattern: Buy from ODMs ODMs operate their businesses on 3-10% margins. AMZN, GOOG, and Facebook buy direct without a middleman. Only a few enterprise vendors are pivoting to compete. 20
  • 22. Pattern: Less enterprise “value” in x86 servers Generic servers rule. Full stop. Nothing is better because nothing else is *generic*. “... a data center full of vanity free servers ... more efficient ... less expensive to build and run ... “ - OCP 21
  • 23. Pattern: Flat Networking The largest cloud operators all run layer-3 routed, flat networks with no VLANs. Cloud-ready apps don’t need or want VLANs. Enterprise apps can be supported on open clouds using Software-defined Networking (SDN) 22
  • 24. Pattern: Software-defined Networking (SDN) • x86 server is the new Linecard • network switch is the new ASIC “Network Virtualization” • VXLAN (or NVGRE) is the new Chassis • SDN Controller is the new SUP Engine 23
  • 25. Pattern: Flat Networking + SDNs Flat + SDN co-exist Internet & thrive together VM VM Availability Zone VM VM VPC VM Gateway Virtual L2 Network 1 2 VM VM Standard VM VM VPC Virtual Private Security Security Cloud Group Group Networking Physical Node 24
  • 26. Pattern: RAIS instead of HA pairs/clusters • Redundant arrays of inexpensive services (RAIS) • Load balanced • No state sharing • On failure, connections are lost, but failures are rare • Ridiculously simple & scalable • Most things retry anyway • Hardware failures are in-frequent & impact subset of traffic • (N-F)/N, where N = total, F = failed • Cascade failures are unlikely and failure domains are small 25
  • 27. Service array (RAIS) example: Public IP Backbone Routers Blocks OSPF Route Announcements RAIS (NAT, LB, VPN) Cloud Access Switches API Return Traffic (default or source NAT) Cloud Control Plane AZ (Spine) Switches 26
  • 28. Pattern: Lots of inexpensive 1RU Switches Simple spine-and-leaf flat routed network Rack 1 Rack 2 Rack 3 1RU: 6K-30K VMs / AZ 27
  • 29. Pattern: Lots of inexpensive 1RU Switches Simple spine-and-leaf flat routed network Multiple Multiple Multiple Rack 1 Rack 2 Rack 3 Racks2 Rack Racks2 Rack Racks2 Rack Rack 1 Rack 1 Rack 1 1RU: 6K-30K VMs / AZ Modular: 40K-200K VMs / AZ 27
  • 30. Pattern: Direct-attached Storage (DAS) Cloud-ready apps DAS is the smallest failure manage their own domain possible with data replication. reasonable storage I/O. SAN == massive failure SSDs will be the great domain. equalizer. 28
  • 31. Pattern: Elastic Block Device Services EBS/EBD is a crutch for poorly written apps. Bigger failure domains (AWS outage anyone?), complex, sets high expectations Sometimes you need a crutch. When you do, overbuild the network, and make sure you have a smart scheduler. 29
  • 32. Pattern: More Servers == More Storage I/O >1M writes/second, triple- redundancy w/ Cassandra on AWS Linear scale-out == linear costs for performance 30
  • 33. Pattern: Hypervisors are a commodity Cloud end-users want OS of choice, not HVs. Level up! Managing iron is for mainframe operators. Hypervisor of the future is open source, easily modifiable, & extensible. 31
  • 34. Open Cloud System Simply Scaled Production Ready randyb@cloudscaling.com @randybias 32