SlideShare a Scribd company logo
1 of 46
Download to read offline
Availability,
                            the Cloud and
                             Everything
                                Joe Williams




Saturday, October 2, 2010
Me

                            • Joe Williams
                             • Infrastructure Engineer
                             • Cloudant
                             • @williamsjoe
                             • joeandmotorboat.com



Saturday, October 2, 2010
• Distributed database built on CouchDB
                     • Real-time Search and Analytics
                     • Sign Up! (Free to 256MB)
                     • cloudant.com
                     • http://github.com/cloudant/bigcouch


Saturday, October 2, 2010
Bias


                     • Distributed Databases (CouchDB)
                     • Amazon EC2
                     • Chef
                     • Erlang



Saturday, October 2, 2010
Availability




Saturday, October 2, 2010
Availability




                     • What is Availability?




Saturday, October 2, 2010
Availability




Saturday, October 2, 2010
Availability

        “System availability refers to the accessibility of
      system services to users. A system is available if it is
     operational for an overwhelming fraction of the time.
        Unlike reliability, availability is instantaneous.”




Saturday, October 2, 2010
Availability


    “System reliability refers to the property of tolerating
    constituent component failures, for the longest time. A
          system is perfectly reliable if it never fails.”




Saturday, October 2, 2010
Availability



                     • Reliability * Availability = Dependability




Saturday, October 2, 2010
Availability

                     • Availability & Reliability
                            • Mean time to failures
                            • Mean time to repair
                            • Durability
                            • Fault isolation
                            • Fault tolerance


Saturday, October 2, 2010
Availability



                     • Uptime / Downtime
                            • Perceived
                            • Actual




Saturday, October 2, 2010
Availability



                     • Probabilistic Risk Assessment
                            • Event Tree Analysis
                            • Fault Tree Analysis



                                   Apthorpe (http://www.usenix.org/events/lisa01/tech/apthorpe/apthorpe.ps)



Saturday, October 2, 2010
The Cloud




Saturday, October 2, 2010
The Cloud


                      “It never gets easier, you just go faster.”
                                   - Greg Lemond




Saturday, October 2, 2010
The Cloud


                     • Abstraction
                     • Commoditization
                     • Homogenous
                     • Ephemeral



Saturday, October 2, 2010
The Cloud

                     • Costs
                            • Loss of Control
                            • Single Points of Failure
                            • Network Partitions / Data Locality
                            • Unreliable
                            • Performance

Saturday, October 2, 2010
The Cloud


                     • Benefits
                            • API to everything
                            • Fast and Flexible Resource Mgmt
                            • “Unlimited” Resources



Saturday, October 2, 2010
The Cloud



                                                                            • Bootstrapping
                                                                             • Time and Effort


           Adam Jacob and Ezra Zygmuntowicz (http://blip.tv/file/2285124/)




Saturday, October 2, 2010
The Cloud




                     • Nodes are stateless and disposable.




Saturday, October 2, 2010
The Cloud


           "Clouds are systems ... and with systems, you have to think hard and know how to deal with issues in that
         environment. The scale is so much bigger, and you don't have the physical control. But we think people should
           be optimistic about what we can do here. If we are clever about deploying cloud computing with a clear-eyed
                notion of what the risk models are, maybe we can actually save the economy through technology."

                            - Security in the Ether By David Talbot - MIT Technology Review Jan/Feb 2010




Saturday, October 2, 2010
What’s Next



                     • Distributed Systems
                     • Automation
                     • Data Driven Operations




Saturday, October 2, 2010
Distributed Systems




                                Baran (http://www.rand.org/pubs/research_memoranda/RM3420/)




Saturday, October 2, 2010
Distributed Systems




                     • RAID ain’t as redundant as it used to be.




                                  Leventhal (http://queue.acm.org/detail.cfm?id=1670144)




Saturday, October 2, 2010
Distributed Systems



                     • Redundancy
                            • Duplication
                            • Distribution




Saturday, October 2, 2010
Distributed Systems


                     • Alphabet Soup
                            • ACID, CAP, BASE, 2PC, MVCC
                            • Vector Clocks, Eventual Consistency
                            • Dynamo, Paxos, Chandra, Byzantine



Saturday, October 2, 2010
Distributed Systems




                     • CAP == Availability




Saturday, October 2, 2010
Distributed Systems


                     • Erlang
                            • Distributed
                            • Concurrent
                            • Fault Tolerant



Saturday, October 2, 2010
Distributed Systems



                     • Erlang
                            • Supervision Trees




Saturday, October 2, 2010
Distributed Systems



                     • Erlang
                            • Hot Code Upgrades
                            • Distributed Upgrades are HARD




Saturday, October 2, 2010
Distributed Systems
                     • Future Work
                            • Erlang Supervision Trees
                            • PRA / FTA / ETA




                                    Apthorpe (http://www.usenix.org/events/lisa01/tech/apthorpe/apthorpe.ps)

Saturday, October 2, 2010
Automation




Saturday, October 2, 2010
Automation




                     • Optimal use of the cloud.




Saturday, October 2, 2010
Automation




                     • Frequent deployment.




Saturday, October 2, 2010
Automation

                     • Tools
                            • Chef
                            • Puppet
                            • Cfengine
                            • Bcfg2


Saturday, October 2, 2010
Automation

                    • Erlang + Chef (as of v0.8)
                            • erl_call Provider




Saturday, October 2, 2010
Data Driven Operations




Saturday, October 2, 2010
Data Driven Operations


                  “What gets measured, gets managed.”
                                -Peter Drucker




Saturday, October 2, 2010
Data Driven Operations




                     • Instrumentation




Saturday, October 2, 2010
Data Driven Operations




                     • Logging




Saturday, October 2, 2010
Data Driven Operations




                     • Visualization




Saturday, October 2, 2010
Data Driven Operations




                     • Demo!




Saturday, October 2, 2010
Data Driven Operations


             •       Modeling

             •       Analysis

             •       Universal Law of Computational Scalability

             •       Amdahl’s Law




Saturday, October 2, 2010
Data Driven Operations




                     • Modeling isn’t just for capacity planning.




                                   Montagne (http://queue.acm.org/detail.cfm?id=1862187)


Saturday, October 2, 2010
The End




Saturday, October 2, 2010
Questions?



                            Joe Williams - @williamsjoe




Saturday, October 2, 2010

More Related Content

Similar to Availability, The Cloud and Everything (version 2, Surge2010)

Calculating the ROI for XML and DITA topic-based authoring
Calculating the ROI for XML and DITA topic-based authoringCalculating the ROI for XML and DITA topic-based authoring
Calculating the ROI for XML and DITA topic-based authoringScriptorium Publishing
 
A Practical Introduction to Symfony2
A Practical Introduction to Symfony2A Practical Introduction to Symfony2
A Practical Introduction to Symfony2Kris Wallsmith
 
OpenStreetMap dongpo deng
OpenStreetMap   dongpo dengOpenStreetMap   dongpo deng
OpenStreetMap dongpo dengAidIQ
 
M.Malone Simple Geo @ Social Developers Summit
M.Malone Simple Geo @ Social Developers SummitM.Malone Simple Geo @ Social Developers Summit
M.Malone Simple Geo @ Social Developers SummitMediabistro
 
Batch Indexing & Near Real Time, keeping things fast
Batch Indexing & Near Real Time, keeping things fastBatch Indexing & Near Real Time, keeping things fast
Batch Indexing & Near Real Time, keeping things fastMarc Sturlese
 
Aegir one drupal to rule them all
Aegir one drupal to rule them allAegir one drupal to rule them all
Aegir one drupal to rule them allDevelopment Seed
 
Magic broker 2 #iot2010 presentation
Magic broker 2 #iot2010 presentationMagic broker 2 #iot2010 presentation
Magic broker 2 #iot2010 presentationAdrian Friday
 
Geoloqi: Non-visual augmented reality Open Source Bridge
Geoloqi: Non-visual augmented reality Open Source BridgeGeoloqi: Non-visual augmented reality Open Source Bridge
Geoloqi: Non-visual augmented reality Open Source BridgeAaron Parecki
 
Geoloqi - Non-visual location based augmented reality with SMS and GPS - Ope...
Geoloqi -  Non-visual location based augmented reality with SMS and GPS - Ope...Geoloqi -  Non-visual location based augmented reality with SMS and GPS - Ope...
Geoloqi - Non-visual location based augmented reality with SMS and GPS - Ope...Amber Case
 
Drupal security - Configuration and process
Drupal security - Configuration and processDrupal security - Configuration and process
Drupal security - Configuration and processGábor Hojtsy
 
DNSSEC Deployment at ROOT Zone
DNSSEC Deployment at ROOT ZoneDNSSEC Deployment at ROOT Zone
DNSSEC Deployment at ROOT ZoneMehmet Akcin
 
Gaelyk - SpringOne2GX - 2010 - Guillaume Laforge
Gaelyk - SpringOne2GX - 2010 - Guillaume LaforgeGaelyk - SpringOne2GX - 2010 - Guillaume Laforge
Gaelyk - SpringOne2GX - 2010 - Guillaume LaforgeGuillaume Laforge
 
Scaling webappswithrabbitmq
Scaling webappswithrabbitmqScaling webappswithrabbitmq
Scaling webappswithrabbitmqAlvaro Videla
 
Integrating php withrabbitmq_zendcon
Integrating php withrabbitmq_zendconIntegrating php withrabbitmq_zendcon
Integrating php withrabbitmq_zendconAlvaro Videla
 

Similar to Availability, The Cloud and Everything (version 2, Surge2010) (15)

Calculating the ROI for XML and DITA topic-based authoring
Calculating the ROI for XML and DITA topic-based authoringCalculating the ROI for XML and DITA topic-based authoring
Calculating the ROI for XML and DITA topic-based authoring
 
A Practical Introduction to Symfony2
A Practical Introduction to Symfony2A Practical Introduction to Symfony2
A Practical Introduction to Symfony2
 
OpenStreetMap dongpo deng
OpenStreetMap   dongpo dengOpenStreetMap   dongpo deng
OpenStreetMap dongpo deng
 
M.Malone Simple Geo @ Social Developers Summit
M.Malone Simple Geo @ Social Developers SummitM.Malone Simple Geo @ Social Developers Summit
M.Malone Simple Geo @ Social Developers Summit
 
Batch Indexing & Near Real Time, keeping things fast
Batch Indexing & Near Real Time, keeping things fastBatch Indexing & Near Real Time, keeping things fast
Batch Indexing & Near Real Time, keeping things fast
 
Aegir one drupal to rule them all
Aegir one drupal to rule them allAegir one drupal to rule them all
Aegir one drupal to rule them all
 
Magic broker 2 #iot2010 presentation
Magic broker 2 #iot2010 presentationMagic broker 2 #iot2010 presentation
Magic broker 2 #iot2010 presentation
 
iOS & Arduino
iOS & ArduinoiOS & Arduino
iOS & Arduino
 
Geoloqi: Non-visual augmented reality Open Source Bridge
Geoloqi: Non-visual augmented reality Open Source BridgeGeoloqi: Non-visual augmented reality Open Source Bridge
Geoloqi: Non-visual augmented reality Open Source Bridge
 
Geoloqi - Non-visual location based augmented reality with SMS and GPS - Ope...
Geoloqi -  Non-visual location based augmented reality with SMS and GPS - Ope...Geoloqi -  Non-visual location based augmented reality with SMS and GPS - Ope...
Geoloqi - Non-visual location based augmented reality with SMS and GPS - Ope...
 
Drupal security - Configuration and process
Drupal security - Configuration and processDrupal security - Configuration and process
Drupal security - Configuration and process
 
DNSSEC Deployment at ROOT Zone
DNSSEC Deployment at ROOT ZoneDNSSEC Deployment at ROOT Zone
DNSSEC Deployment at ROOT Zone
 
Gaelyk - SpringOne2GX - 2010 - Guillaume Laforge
Gaelyk - SpringOne2GX - 2010 - Guillaume LaforgeGaelyk - SpringOne2GX - 2010 - Guillaume Laforge
Gaelyk - SpringOne2GX - 2010 - Guillaume Laforge
 
Scaling webappswithrabbitmq
Scaling webappswithrabbitmqScaling webappswithrabbitmq
Scaling webappswithrabbitmq
 
Integrating php withrabbitmq_zendcon
Integrating php withrabbitmq_zendconIntegrating php withrabbitmq_zendcon
Integrating php withrabbitmq_zendcon
 

Recently uploaded

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 

Recently uploaded (20)

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 

Availability, The Cloud and Everything (version 2, Surge2010)

  • 1. Availability, the Cloud and Everything Joe Williams Saturday, October 2, 2010
  • 2. Me • Joe Williams • Infrastructure Engineer • Cloudant • @williamsjoe • joeandmotorboat.com Saturday, October 2, 2010
  • 3. • Distributed database built on CouchDB • Real-time Search and Analytics • Sign Up! (Free to 256MB) • cloudant.com • http://github.com/cloudant/bigcouch Saturday, October 2, 2010
  • 4. Bias • Distributed Databases (CouchDB) • Amazon EC2 • Chef • Erlang Saturday, October 2, 2010
  • 6. Availability • What is Availability? Saturday, October 2, 2010
  • 8. Availability “System availability refers to the accessibility of system services to users. A system is available if it is operational for an overwhelming fraction of the time. Unlike reliability, availability is instantaneous.” Saturday, October 2, 2010
  • 9. Availability “System reliability refers to the property of tolerating constituent component failures, for the longest time. A system is perfectly reliable if it never fails.” Saturday, October 2, 2010
  • 10. Availability • Reliability * Availability = Dependability Saturday, October 2, 2010
  • 11. Availability • Availability & Reliability • Mean time to failures • Mean time to repair • Durability • Fault isolation • Fault tolerance Saturday, October 2, 2010
  • 12. Availability • Uptime / Downtime • Perceived • Actual Saturday, October 2, 2010
  • 13. Availability • Probabilistic Risk Assessment • Event Tree Analysis • Fault Tree Analysis Apthorpe (http://www.usenix.org/events/lisa01/tech/apthorpe/apthorpe.ps) Saturday, October 2, 2010
  • 15. The Cloud “It never gets easier, you just go faster.” - Greg Lemond Saturday, October 2, 2010
  • 16. The Cloud • Abstraction • Commoditization • Homogenous • Ephemeral Saturday, October 2, 2010
  • 17. The Cloud • Costs • Loss of Control • Single Points of Failure • Network Partitions / Data Locality • Unreliable • Performance Saturday, October 2, 2010
  • 18. The Cloud • Benefits • API to everything • Fast and Flexible Resource Mgmt • “Unlimited” Resources Saturday, October 2, 2010
  • 19. The Cloud • Bootstrapping • Time and Effort Adam Jacob and Ezra Zygmuntowicz (http://blip.tv/file/2285124/) Saturday, October 2, 2010
  • 20. The Cloud • Nodes are stateless and disposable. Saturday, October 2, 2010
  • 21. The Cloud "Clouds are systems ... and with systems, you have to think hard and know how to deal with issues in that environment. The scale is so much bigger, and you don't have the physical control. But we think people should be optimistic about what we can do here. If we are clever about deploying cloud computing with a clear-eyed notion of what the risk models are, maybe we can actually save the economy through technology." - Security in the Ether By David Talbot - MIT Technology Review Jan/Feb 2010 Saturday, October 2, 2010
  • 22. What’s Next • Distributed Systems • Automation • Data Driven Operations Saturday, October 2, 2010
  • 23. Distributed Systems Baran (http://www.rand.org/pubs/research_memoranda/RM3420/) Saturday, October 2, 2010
  • 24. Distributed Systems • RAID ain’t as redundant as it used to be. Leventhal (http://queue.acm.org/detail.cfm?id=1670144) Saturday, October 2, 2010
  • 25. Distributed Systems • Redundancy • Duplication • Distribution Saturday, October 2, 2010
  • 26. Distributed Systems • Alphabet Soup • ACID, CAP, BASE, 2PC, MVCC • Vector Clocks, Eventual Consistency • Dynamo, Paxos, Chandra, Byzantine Saturday, October 2, 2010
  • 27. Distributed Systems • CAP == Availability Saturday, October 2, 2010
  • 28. Distributed Systems • Erlang • Distributed • Concurrent • Fault Tolerant Saturday, October 2, 2010
  • 29. Distributed Systems • Erlang • Supervision Trees Saturday, October 2, 2010
  • 30. Distributed Systems • Erlang • Hot Code Upgrades • Distributed Upgrades are HARD Saturday, October 2, 2010
  • 31. Distributed Systems • Future Work • Erlang Supervision Trees • PRA / FTA / ETA Apthorpe (http://www.usenix.org/events/lisa01/tech/apthorpe/apthorpe.ps) Saturday, October 2, 2010
  • 33. Automation • Optimal use of the cloud. Saturday, October 2, 2010
  • 34. Automation • Frequent deployment. Saturday, October 2, 2010
  • 35. Automation • Tools • Chef • Puppet • Cfengine • Bcfg2 Saturday, October 2, 2010
  • 36. Automation • Erlang + Chef (as of v0.8) • erl_call Provider Saturday, October 2, 2010
  • 38. Data Driven Operations “What gets measured, gets managed.” -Peter Drucker Saturday, October 2, 2010
  • 39. Data Driven Operations • Instrumentation Saturday, October 2, 2010
  • 40. Data Driven Operations • Logging Saturday, October 2, 2010
  • 41. Data Driven Operations • Visualization Saturday, October 2, 2010
  • 42. Data Driven Operations • Demo! Saturday, October 2, 2010
  • 43. Data Driven Operations • Modeling • Analysis • Universal Law of Computational Scalability • Amdahl’s Law Saturday, October 2, 2010
  • 44. Data Driven Operations • Modeling isn’t just for capacity planning. Montagne (http://queue.acm.org/detail.cfm?id=1862187) Saturday, October 2, 2010
  • 46. Questions? Joe Williams - @williamsjoe Saturday, October 2, 2010