SlideShare a Scribd company logo
1 of 21
Outage-Proof Your Cloud
Applications
Brian Adler, Sr. Services Architect
Roberto Monge, Cloud Solutions Engineer
RightScale

December 18, 2012


                    Watch the video of this webinar
2#



Your Panel Today
Presenting
• Brian Adler, Sr. Services Architect, RightScale
• Roberto Monge, Cloud Solutions Engineer, RightScale

Q&A
• Spencer Adams, Account Manager, RightScale
• Noel Cohen, Account Manager, RightScale



            Please use the “Questions” window
                to ask questions any time!

        Cloud Management                            #rightscale
3#



Agenda
• High Availability and Disaster Recovery
   •   Terminology/Level-Setting
   •   Designing for Failure
   •   Cloud and component definitions
   •   HA and DR configurations
• Conclusions / Q&A




             Cloud Management               #rightscale
4#



Terminology


  Ability of a system to     Fault Tolerant          The process, policies
  continue operating         systems are             and procedures
  properly (perhaps at       measured by their       related to restoring
  a degraded level) if       Availability in terms   critical systems after
  one or more                of planned and          a catastrophic event
  components fails           unplanned service
                             outages for end
                             users




          Cloud Management                                            #rightscale
5#



Designing for Failure

  1   Large scale failures in     3   Need to balance cost
      the cloud are rare but do       and complexity of HA
      happen                          efforts against risks you
                                      are willing to bear

  2   Application owners are      4   Cloud infrastructure has
      ultimately responsible          made DR and HA
      for availability and            remarkably affordable
      recoverability                     •   Multi-server
                                         •   Multi-Zone
                                         •   Multi-Region
                                         •   Multi-Cloud


         Cloud Management                                   #rightscale
6#



Cloud Isolation Definitions
                                Region               Zone
  Resources                     One or more          Datacenter with
                                geographically       separate power source
                                proximate Zones
  API endpoint, control plane   Shared               Shared
  Local Area Network            Shared               Shared
  Clouds
  Amazon Web Services                  Region              Availability Zone
  Rackspace                                       Region
  Windows Azure                                   Region
  Google Cloud Platform                Region              Availability Group
  CloudStack                           Region                    Zone
  OpenStack                             Zone               Availability Zone




           Cloud Management                                                #rightscale
7#



Multi-Zone HA
                                                                                                               Consider
                                                                                                              distributed
                                                           DNS                                                  NoSQL
                                                                                                            databases with
                                          172.168.7.31                172.168.8.62
                                                                                                               the same
                US-EAST 1a
                                                                                           1
                                                                                          US-EAST 1b
                                                                                                              distribution
                                                                                                            considerations.
                       LOAD BALANCERS                                    LOAD BALANCERS
                                                                                                            Spread primary
                                                                                                              and replica
                                                                                                             nodes across
                                                                                                            multiple zones.
                                                                                                            Place as many
                                                    APP SERVERS                                             as you need for
                                                                                                                required
                                                                                         AUTOSCALE
                                                                                                               resiliency.


                             MASTER DB                             SLAVE DB

                                    REPLICATE


                                     EBS
                                                                                           SNAPSHOTS
                                                                                                                 S3

 Snapshot data volume for backups                                                        Consider local storage for additional
                                                         Place Slave databases in one
  so the database can be readily                                                             slave database to remove
                                                           or more zones for failover.
    recovered within the region.                                                          dependency on attached volume


                Cloud Management                                                                             #rightscale
8#



Multi-Region/Cloud DR Options
Availability Downtime

  99.999%            0                                      Multi-Cloud HA
                                                           (Live/Live Config)
    99.9%     < 5 Mins                              Hot DR
                                                (Least Common)
    99.5%     < 1 Hour                 Warm DR
                                    (Recommended)

      99%     > 1 Hour        Cold DR
                           (Most Common)


                                $          $$        $$$         $$$$



            Cloud Management                                         #rightscale
9#



Multi-Region Cold DR
Staged Server Configuration and generally no staged data
• Not recommended if rapid recovery is required
• Slow to replicate data to other cloud and bring database online
                                                        DNS
                                         172.168.7.31

        CHICAGO                                                                DALLAS


                   LOAD BALANCERS                             LOAD BALANCERS




                     APP SERVERS                               APP SERVERS




             MASTER DB               SLAVE DB                    SLAVE DB

                         REPLICATE


                         CBS

                   SNAPSHOTS
                                      CLOUD
                                      FILES

                  Cloud Management                                                 #rightscale
10#



Multi-Region Warm DR
Staged Server Configuration, pre-staged data and running Slave Database Server
• Generally recommended DR solution
• Minimal additional cost and allows fairly rapid recovery
                                                      DNS
                                       172.168.7.31

       CHICAGO                                                                  DALLAS


                  LOAD BALANCERS                               LOAD BALANCERS




                    APP SERVERS                                  APP SERVERS




            MASTER DB              SLAVE DB                         SLAVE DB

                       REPLICATE                            REPLICATE



                        CBS
                                                                  SNAPSHOTS
                   SNAPSHOTS
                                     CLOUD
                                     FILES

                 Cloud Management                                                   #rightscale
11#



Multi-Region Hot DR
Parallel Deployment with all servers running but all traffic going to primary
• Not recommended
• Very high additional cost to allow rapid recovery
                                                        DNS
                                         172.168.7.31

        CHICAGO                                                                   DALLAS


                   LOAD BALANCERS                                LOAD BALANCERS




                     APP SERVERS                                   APP SERVERS




             MASTER DB               SLAVE DB                         SLAVE DB

                         REPLICATE                            REPLICATE


                          CBS

                    SNAPSHOTS                                      SNAPSHOTS
                                      CLOUD
                                      FILES

                  Cloud Management                                                    #rightscale
12#



Multi-Cloud HA
Live/Live configuration. Geo-target IP services to direct traffic to regional LBs.
• Possible, but not recommended (more to follow…)
• Max additional cost and max availability, but complex to implement and manage
                                                          DNS
                                           172.168.7.31         172.168.8.62

        US-EAST                                                                                      CHICAGO


                   LOAD BALANCERS                                          LOAD BALANCERS




                     APP SERVERS                                               APP SERVERS




              MASTER DB              SLAVE DB                                   SLAVE DB

                         REPLICATE                                      REPLICATE


                          EBS

                   SNAPSHOTS                                                   SNAPSHOTS


                                      S3                                                     SWIFT


                  Cloud Management                                                                       #rightscale
13#



Multi-Cloud HA
Looks similar to Multi-Zone… but additional problems to solve as some resources
are not shared
                           You need DNS management                       Security is an issue as security
                            or a global load balancer.           DNS      groups are Region-specific.
                                                  172.168.7.31         172.168.8.62

             US-EAST                                                                                           CHICAGO


 Machine Images           LOAD BALANCERS                                          LOAD BALANCERS
 are specific to the
   cloud/region.


                            APP SERVERS                                               APP SERVERS




                       MASTER DB            SLAVE DB                                   SLAVE DB

                                REPLICATE                                      REPLICATE


                                 EBS                                                                   VOLUME

                          SNAPSHOTS                                                   SNAPSHOTS


                                             S3                                                        SWIFT


                        Cloud Management                                                                           #rightscale
14#



In the Dashboard

                                         Cost
                                     forecasting
Multi-region                            for DR
 or cloud                           environment




Multi-region
 Warm DR
  Staged
  servers




               Cloud Management   #rightscale
15#



Automating HA and DR
• Use dynamic DNS for your database servers
   • Allow app servers to use a single FQDN.
   • Use a low TTL to allow rapid failover in the case of a change in master
     database
• Automatic connection of app servers to load balancing servers
   • App servers can connect to all load balancers automatically at launch
   • No manual intervention
   • No DNS modifications
• Automated promotion of slave to master
   • Process is automated
   • Decision to run process is manual




            Cloud Management                                          #rightscale
16#



How RightScale makes it possible
MultiCloud Images
• MultiCloud Images can be launched across regions and clouds
  without modification
            ServerTemplate contains a list
      1     of MultiCloud Images (MCIs)
                                                  When the Server is
                                             2    created, a specific MCI
                                                  is chosen.
                                                                            The appropriate
                                                                        3   RightImage is used at
          MultiCloud Images
                                                                            launch.
             Cloud A, B, Image 1
             Cloud A C, Image 2
             Cloud B, Image 1        Cloud A, B, Image 1



                                                                                Cloud B
             Stability across clouds
                                                                               Image 1

                                         RightImage


                Cloud Management                                                                    #rightscale
17#



How RightScale makes it possible
ServerTemplates, Tags, and Inputs
• Automated load balancer registration and database connections
• Autoscaling across zones
• Dynamic configuration




          Cloud Management                             #rightscale
18#



DR Cost Comparison Example
                      Multi-Region               Multi-Region               Multi-Region
                      Cold DR                    Warm DR                    Hot DR
Total                 $4480 / month              $5630 / month              $8800 / month
Running               $4470 / month              $5540 / month              $8440 / month
                      3 Load Balancers (Large)   3 Load Balancers (Large)   6 Load Balancers (Large)
                      6 App Servers (XLarge)     6 App Servers (XLarge)     12 App Servers (XLarge)
                      1 Master DB (2XLarge)      1 Master DB (2XLarge)      1 Master DB (2XLarge)
                      1 Slave DB (2XLarge)       2 Slave DB (2XLarge)       2 Slave DB (2XLarge)

Staged                $0 / month                 $0 / month
                      3 Load Balancers (Large)   3 Load Balancers (Large)
                      6 App Servers (XLarge)     6 App Servers (Xlarge)
                      1 Slave DB (2XLarge)

Replication           $10 / month                $90 / month                $360 / month
                      25GB / day cross-zone      25GB / day cross-region    100GB / day cross-region




              Cloud Management                                                             #rightscale
19#



Most Common Observed Cloud Outages
• Outage of specific services in a zone
   • Degraded performance
   • E.g. EBS, ELB, RDS
• Outage of specific services in a region
   • Control plane error or cascading problems
   • E.g. EBS
• Outage of power or network in a zone
   • No connectivity
   • E.g. EC2, Azure
• Capacity availability in a region during an outage
   • Not possible to provision instances, volumes, or other services




            Cloud Management                                           #rightscale
20#



Outage-Proofing Best Practices


  Place in >1          Replicate data        Replicate data
   zone:                  across zones          across zones
   • Load balancers      Backup across        Design stateless
   • App servers          regions & clouds      apps for
   • Databases           Monitoring, alert,    resilience to
  Maintain               and automate          reboot / relaunch
   capacity to            operations to
   absorb zone or         speed up
   region failures        failover



         Cloud Management                                    #rightscale
21#



Next Steps
• Learn: Building Scalable Applications in the Cloud Whitepaper
   • http://www.rightscale.com/info_center/white-papers/building-scalable-
     applications-in-the-cloud.php
• Analyze: Deployment review of your environment
   • http://www.rightscale.com/about_us/contact_us.php
• Try: Free Edition
   • www.rightscale.com/free




                      Contact RightScale
                          (866) 720-0208
                       sales@rightscale.com
                        www.rightscale.com
            Cloud Management                                         #rightscale

More Related Content

More from RightScale

How to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale OptimaHow to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale OptimaRightScale
 
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...RightScale
 
Using RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider ToolsUsing RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider ToolsRightScale
 
Best Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and ComplianceBest Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and ComplianceRightScale
 
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and MoreAutomating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and MoreRightScale
 
The 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for EnterprisesThe 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for EnterprisesRightScale
 
9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage Costs9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage CostsRightScale
 
Serverless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBMServerless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBMRightScale
 
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP SuccessBest Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP SuccessRightScale
 
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBMCloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBMRightScale
 
2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud Report2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud ReportRightScale
 
Got a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP HelpsGot a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP HelpsRightScale
 
How to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale OptimaHow to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale OptimaRightScale
 
Top 10 Cloud Trends for 2018 and Actions You Can Take Now
Top 10 Cloud Trends for 2018 and Actions You Can Take NowTop 10 Cloud Trends for 2018 and Actions You Can Take Now
Top 10 Cloud Trends for 2018 and Actions You Can Take NowRightScale
 
AWS re:Invent 2017 Recap
AWS re:Invent 2017 RecapAWS re:Invent 2017 Recap
AWS re:Invent 2017 RecapRightScale
 
Cloud Instances Price Comparison: AWS vs Azure vs Google vs IBM
Cloud Instances Price Comparison: AWS vs Azure vs Google vs IBMCloud Instances Price Comparison: AWS vs Azure vs Google vs IBM
Cloud Instances Price Comparison: AWS vs Azure vs Google vs IBMRightScale
 
Enterprise Cloud Strategy: 7 Areas You Need to Re-Think
Enterprise Cloud Strategy: 7 Areas You Need to Re-ThinkEnterprise Cloud Strategy: 7 Areas You Need to Re-Think
Enterprise Cloud Strategy: 7 Areas You Need to Re-ThinkRightScale
 
How MSPs Can Be Successful in AWS, Azure, and Google Clouds
How MSPs Can Be Successful in AWS, Azure, and Google CloudsHow MSPs Can Be Successful in AWS, Azure, and Google Clouds
How MSPs Can Be Successful in AWS, Azure, and Google CloudsRightScale
 
Orchestrating PaaS and IaaS+ with RightScale
Orchestrating PaaS and IaaS+ with RightScaleOrchestrating PaaS and IaaS+ with RightScale
Orchestrating PaaS and IaaS+ with RightScaleRightScale
 
Managing Container-as-a-Service and Docker Clusters in the Cloud with RightScale
Managing Container-as-a-Service and Docker Clusters in the Cloud with RightScaleManaging Container-as-a-Service and Docker Clusters in the Cloud with RightScale
Managing Container-as-a-Service and Docker Clusters in the Cloud with RightScaleRightScale
 

More from RightScale (20)

How to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale OptimaHow to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale Optima
 
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
 
Using RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider ToolsUsing RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider Tools
 
Best Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and ComplianceBest Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and Compliance
 
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and MoreAutomating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
 
The 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for EnterprisesThe 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for Enterprises
 
9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage Costs9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage Costs
 
Serverless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBMServerless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBM
 
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP SuccessBest Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
 
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBMCloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
 
2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud Report2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud Report
 
Got a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP HelpsGot a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP Helps
 
How to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale OptimaHow to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale Optima
 
Top 10 Cloud Trends for 2018 and Actions You Can Take Now
Top 10 Cloud Trends for 2018 and Actions You Can Take NowTop 10 Cloud Trends for 2018 and Actions You Can Take Now
Top 10 Cloud Trends for 2018 and Actions You Can Take Now
 
AWS re:Invent 2017 Recap
AWS re:Invent 2017 RecapAWS re:Invent 2017 Recap
AWS re:Invent 2017 Recap
 
Cloud Instances Price Comparison: AWS vs Azure vs Google vs IBM
Cloud Instances Price Comparison: AWS vs Azure vs Google vs IBMCloud Instances Price Comparison: AWS vs Azure vs Google vs IBM
Cloud Instances Price Comparison: AWS vs Azure vs Google vs IBM
 
Enterprise Cloud Strategy: 7 Areas You Need to Re-Think
Enterprise Cloud Strategy: 7 Areas You Need to Re-ThinkEnterprise Cloud Strategy: 7 Areas You Need to Re-Think
Enterprise Cloud Strategy: 7 Areas You Need to Re-Think
 
How MSPs Can Be Successful in AWS, Azure, and Google Clouds
How MSPs Can Be Successful in AWS, Azure, and Google CloudsHow MSPs Can Be Successful in AWS, Azure, and Google Clouds
How MSPs Can Be Successful in AWS, Azure, and Google Clouds
 
Orchestrating PaaS and IaaS+ with RightScale
Orchestrating PaaS and IaaS+ with RightScaleOrchestrating PaaS and IaaS+ with RightScale
Orchestrating PaaS and IaaS+ with RightScale
 
Managing Container-as-a-Service and Docker Clusters in the Cloud with RightScale
Managing Container-as-a-Service and Docker Clusters in the Cloud with RightScaleManaging Container-as-a-Service and Docker Clusters in the Cloud with RightScale
Managing Container-as-a-Service and Docker Clusters in the Cloud with RightScale
 

Recently uploaded

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 

Recently uploaded (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 

Rightscale Webinar: Outage Proof Your Cloud Applications

  • 1. Outage-Proof Your Cloud Applications Brian Adler, Sr. Services Architect Roberto Monge, Cloud Solutions Engineer RightScale December 18, 2012 Watch the video of this webinar
  • 2. 2# Your Panel Today Presenting • Brian Adler, Sr. Services Architect, RightScale • Roberto Monge, Cloud Solutions Engineer, RightScale Q&A • Spencer Adams, Account Manager, RightScale • Noel Cohen, Account Manager, RightScale Please use the “Questions” window to ask questions any time! Cloud Management #rightscale
  • 3. 3# Agenda • High Availability and Disaster Recovery • Terminology/Level-Setting • Designing for Failure • Cloud and component definitions • HA and DR configurations • Conclusions / Q&A Cloud Management #rightscale
  • 4. 4# Terminology Ability of a system to Fault Tolerant The process, policies continue operating systems are and procedures properly (perhaps at measured by their related to restoring a degraded level) if Availability in terms critical systems after one or more of planned and a catastrophic event components fails unplanned service outages for end users Cloud Management #rightscale
  • 5. 5# Designing for Failure 1 Large scale failures in 3 Need to balance cost the cloud are rare but do and complexity of HA happen efforts against risks you are willing to bear 2 Application owners are 4 Cloud infrastructure has ultimately responsible made DR and HA for availability and remarkably affordable recoverability • Multi-server • Multi-Zone • Multi-Region • Multi-Cloud Cloud Management #rightscale
  • 6. 6# Cloud Isolation Definitions Region Zone Resources One or more Datacenter with geographically separate power source proximate Zones API endpoint, control plane Shared Shared Local Area Network Shared Shared Clouds Amazon Web Services Region Availability Zone Rackspace Region Windows Azure Region Google Cloud Platform Region Availability Group CloudStack Region Zone OpenStack Zone Availability Zone Cloud Management #rightscale
  • 7. 7# Multi-Zone HA Consider distributed DNS NoSQL databases with 172.168.7.31 172.168.8.62 the same US-EAST 1a 1 US-EAST 1b distribution considerations. LOAD BALANCERS LOAD BALANCERS Spread primary and replica nodes across multiple zones. Place as many APP SERVERS as you need for required AUTOSCALE resiliency. MASTER DB SLAVE DB REPLICATE EBS SNAPSHOTS S3 Snapshot data volume for backups Consider local storage for additional Place Slave databases in one so the database can be readily slave database to remove or more zones for failover. recovered within the region. dependency on attached volume Cloud Management #rightscale
  • 8. 8# Multi-Region/Cloud DR Options Availability Downtime 99.999% 0 Multi-Cloud HA (Live/Live Config) 99.9% < 5 Mins Hot DR (Least Common) 99.5% < 1 Hour Warm DR (Recommended) 99% > 1 Hour Cold DR (Most Common) $ $$ $$$ $$$$ Cloud Management #rightscale
  • 9. 9# Multi-Region Cold DR Staged Server Configuration and generally no staged data • Not recommended if rapid recovery is required • Slow to replicate data to other cloud and bring database online DNS 172.168.7.31 CHICAGO DALLAS LOAD BALANCERS LOAD BALANCERS APP SERVERS APP SERVERS MASTER DB SLAVE DB SLAVE DB REPLICATE CBS SNAPSHOTS CLOUD FILES Cloud Management #rightscale
  • 10. 10# Multi-Region Warm DR Staged Server Configuration, pre-staged data and running Slave Database Server • Generally recommended DR solution • Minimal additional cost and allows fairly rapid recovery DNS 172.168.7.31 CHICAGO DALLAS LOAD BALANCERS LOAD BALANCERS APP SERVERS APP SERVERS MASTER DB SLAVE DB SLAVE DB REPLICATE REPLICATE CBS SNAPSHOTS SNAPSHOTS CLOUD FILES Cloud Management #rightscale
  • 11. 11# Multi-Region Hot DR Parallel Deployment with all servers running but all traffic going to primary • Not recommended • Very high additional cost to allow rapid recovery DNS 172.168.7.31 CHICAGO DALLAS LOAD BALANCERS LOAD BALANCERS APP SERVERS APP SERVERS MASTER DB SLAVE DB SLAVE DB REPLICATE REPLICATE CBS SNAPSHOTS SNAPSHOTS CLOUD FILES Cloud Management #rightscale
  • 12. 12# Multi-Cloud HA Live/Live configuration. Geo-target IP services to direct traffic to regional LBs. • Possible, but not recommended (more to follow…) • Max additional cost and max availability, but complex to implement and manage DNS 172.168.7.31 172.168.8.62 US-EAST CHICAGO LOAD BALANCERS LOAD BALANCERS APP SERVERS APP SERVERS MASTER DB SLAVE DB SLAVE DB REPLICATE REPLICATE EBS SNAPSHOTS SNAPSHOTS S3 SWIFT Cloud Management #rightscale
  • 13. 13# Multi-Cloud HA Looks similar to Multi-Zone… but additional problems to solve as some resources are not shared You need DNS management Security is an issue as security or a global load balancer. DNS groups are Region-specific. 172.168.7.31 172.168.8.62 US-EAST CHICAGO Machine Images LOAD BALANCERS LOAD BALANCERS are specific to the cloud/region. APP SERVERS APP SERVERS MASTER DB SLAVE DB SLAVE DB REPLICATE REPLICATE EBS VOLUME SNAPSHOTS SNAPSHOTS S3 SWIFT Cloud Management #rightscale
  • 14. 14# In the Dashboard Cost forecasting Multi-region for DR or cloud environment Multi-region Warm DR Staged servers Cloud Management #rightscale
  • 15. 15# Automating HA and DR • Use dynamic DNS for your database servers • Allow app servers to use a single FQDN. • Use a low TTL to allow rapid failover in the case of a change in master database • Automatic connection of app servers to load balancing servers • App servers can connect to all load balancers automatically at launch • No manual intervention • No DNS modifications • Automated promotion of slave to master • Process is automated • Decision to run process is manual Cloud Management #rightscale
  • 16. 16# How RightScale makes it possible MultiCloud Images • MultiCloud Images can be launched across regions and clouds without modification ServerTemplate contains a list 1 of MultiCloud Images (MCIs) When the Server is 2 created, a specific MCI is chosen. The appropriate 3 RightImage is used at MultiCloud Images launch. Cloud A, B, Image 1 Cloud A C, Image 2 Cloud B, Image 1 Cloud A, B, Image 1 Cloud B Stability across clouds Image 1 RightImage Cloud Management #rightscale
  • 17. 17# How RightScale makes it possible ServerTemplates, Tags, and Inputs • Automated load balancer registration and database connections • Autoscaling across zones • Dynamic configuration Cloud Management #rightscale
  • 18. 18# DR Cost Comparison Example Multi-Region Multi-Region Multi-Region Cold DR Warm DR Hot DR Total $4480 / month $5630 / month $8800 / month Running $4470 / month $5540 / month $8440 / month 3 Load Balancers (Large) 3 Load Balancers (Large) 6 Load Balancers (Large) 6 App Servers (XLarge) 6 App Servers (XLarge) 12 App Servers (XLarge) 1 Master DB (2XLarge) 1 Master DB (2XLarge) 1 Master DB (2XLarge) 1 Slave DB (2XLarge) 2 Slave DB (2XLarge) 2 Slave DB (2XLarge) Staged $0 / month $0 / month 3 Load Balancers (Large) 3 Load Balancers (Large) 6 App Servers (XLarge) 6 App Servers (Xlarge) 1 Slave DB (2XLarge) Replication $10 / month $90 / month $360 / month 25GB / day cross-zone 25GB / day cross-region 100GB / day cross-region Cloud Management #rightscale
  • 19. 19# Most Common Observed Cloud Outages • Outage of specific services in a zone • Degraded performance • E.g. EBS, ELB, RDS • Outage of specific services in a region • Control plane error or cascading problems • E.g. EBS • Outage of power or network in a zone • No connectivity • E.g. EC2, Azure • Capacity availability in a region during an outage • Not possible to provision instances, volumes, or other services Cloud Management #rightscale
  • 20. 20# Outage-Proofing Best Practices Place in >1 Replicate data Replicate data zone: across zones across zones • Load balancers  Backup across Design stateless • App servers regions & clouds apps for • Databases  Monitoring, alert, resilience to Maintain and automate reboot / relaunch capacity to operations to absorb zone or speed up region failures failover Cloud Management #rightscale
  • 21. 21# Next Steps • Learn: Building Scalable Applications in the Cloud Whitepaper • http://www.rightscale.com/info_center/white-papers/building-scalable- applications-in-the-cloud.php • Analyze: Deployment review of your environment • http://www.rightscale.com/about_us/contact_us.php • Try: Free Edition • www.rightscale.com/free Contact RightScale (866) 720-0208 sales@rightscale.com www.rightscale.com Cloud Management #rightscale

Editor's Notes

  1. Cold DR(Most common... hours) Staged Server Configuration and generally no staged data. Bring up the servers and load the data to failover. Cold DR failover is typically manual.Warm DR(Recommended... &gt;hour) Staged Server Configuration, pre-staged data and running Database Slave Server. Warm DR failover is typically manual but can be automated.Hot DR(Least common... but needed if &lt;5 min) Parallel Deployment with all servers running but all traffic going to primary. Hot DR failover is normally automated.Hot HALive/Live configuration. May use Geo-target IP services to direct traffic to regional load balancers. Failover to other region if one has problems. Hot HA is normally seamlessly automated.
  2. Note: Other costs such as IOPS, volumes, other bandwidth, object storage, and snapshot storage is additional