Your SlideShare is downloading. ×
Constructing a Fault-Tolerant, Highly Available Cloud Infrastructure for your Drupal Site
Upcoming SlideShare
Loading in...5

Thanks for flagging this SlideShare!

Oops! An error has occurred.

Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Constructing a Fault-Tolerant, Highly Available Cloud Infrastructure for your Drupal Site


Published on

Published in: Technology

  • Be the first to comment

  • Be the first to like this

No Downloads
Total Views
On Slideshare
From Embeds
Number of Embeds
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

No notes for slide


  • 1. Webinar Audio Options• Audio will remain quiet until we begin at the top of the hour• Streaming Audio • Appears automatically in pop-up window • Or click Communicate : Join Audio Broadcast • Remember to unmute your computer• No Streaming Audio? • Request phone access Thank you for joining!• Technical Support The webinar will begin • US & Canada 866.229.3239 shortly. • International Support 408.435.7088
  • 2. Housekeeping• Slides and recording will be posted in next 48 hours• Submit questions via the Q&A Tab in WebEx, we’ll answer as many as we can• Try it now: tell us where you are joining from• Hashtags: #acquia #drupal
  • 3. Upcoming Webinars• Building a Common Drupal Platform for Your Organization Using Drupa• Accessible Theming in Drupal• Integrating a CDN with Acquia Cloud• Ensuring Success When Migrating Your Content to Drupal• OpenPublic & Drupal: Taking the Guesswork Out of Open Source For Government• Community Box 2.0, mehrsprachige Communities mit Commons
  • 4. Acquia is Hiring• Do you love working with Drupal?• Acquia is hiring in North America, Europe, and Australia! • Engineering / DevOps • Design • Support • Operations • Client Advisors • Sales and Marketing
  • 5. Constructing a Fault-Tolerant, HighlyAvailable cloud Infrastructure for yourDrupal siteAndrew Kenney Jess IandiorioVP of Platform Sr. Director, CloudEngineering Product MarketingDecember 12, 2012
  • 6. Creating killerwebsites is hard …
  • 7. Hosting them shouldn’tbe.
  • 8. For business-criticalsites,How do you avoid acrisis?
  • 9. Agenda• Drupal Hosting Challenges• Cloud Failure Scenarios• HA & Resiliency• Resource Challenges• Designing for Failure• Architecting & Automating failover• Testing Failure
  • 10. Drupal HostingChallenges filesystem• Drupal expects a POSIX• Drupal is not optimized for high-latency MySQL operations• Drupal is not built with partition tolerance in mind• Shortage of talent or expertise for operating Drupal in the Cloud or at scale
  • 11. Cloud Failure Scenarios• Machine loss• Service outage• Network disruption• Inaccessible/unreliable storage system• Traffic spike• Control Plane failure• Corrupt/Partial Backups
  • 12. High Availablity &Resiliency• Plan for Failure • Automate deployment & configurations• Eliminate SPOFs • Two (at least) of everything• Monitor everything • Monitor the monitors• Back up all data • Periodically test all backups• Test emergency procedures • Never assume any procedure works unless it’s periodically tested
  • 13. Resource Challenges• Cloud Hype – the cloud frees developers from needing operations staff to do their job• Cloud Reality – the cloud introduces even more instability unless you plan for failure
  • 14. Designing for Failure1. Multiple AZ hosting
  • 15. Designing for Failure1. Multiple AZ hosting2. Multiple region hosting
  • 16. Designing for Failure1. Multiple AZ hosting2. Multiple region hosting3. Shared security model
  • 17. Designing for Failure Infrastructure & Security Scanning Application Health1. Multiple AZ hosting2. Multiple region hosting3. Shared security model Acquia Security Team Acquia Operations Team4. Monitoring
  • 18. Monitoring US-West US-East Rackspace Rackspace Pingdom Pingdom Web servers External Monitoring Mon servers
  • 19. Designing for Failure1. Multiple AZ hosting2. Multiple region hosting3. Shared security model4. Monitoring5. Recovering from failure
  • 20. Failover in the Cloud• Amazon Elastic Load Balancers (ELBs) allow for failover from one Availability Zone (AZ) to another• Acquia load balancers allow for unhealthy web nodes in any given AZ to be removed from service• DNS switch allows for failover or promotion of database servers• Manual DNS switch allows for (one way) failover of a site from one region to another
  • 21. Testing failover• Failover and failback should be a scriptable process able to be routinely handled by automated systems or be operations personnel• Failover scenarios may be useful in events such application deployment or database schema changes
  • 22. Why not DIY?• Your core competency is not HA • Let your precious engineering/IT ops staff focus on what’s key to your organizations success• Most organization are not 24x7x365 • The Internet doesn’t sleep and failure can strike at any time• Don’t get stuck in the blame game • If your site goes down and you are called upon at an inconvenient time, you’ll be between the hosting provider or team, and the Drupal application team
  • 23. Why Acquia?• White glove service• 24x7 operations• Drupal expertise • Operations • Scalability • Performance• HA Offerings • Multi-zone • Multi-region
  • 24. Dev Cloud Acquia’s Continuous Integration Platform for Developers.• Intuitive development workflow• Power tools for power users• Drupal-tuned hosting infrastructure
  • 25. Managed Cloud Never let your best day become your worst.• White-glove managed service for mission- critical Drupal websites• Drupal-tuned hosting infrastructure• HA, elastic resources with multi-region failover
  • 26. Questions?• For more information visit:• Contact us: or 888.9.ACQUIA• Follow us: @acquia• Comments welcome:•• Today’s webinar recording will be posted to: