Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Oops-Less Operation

285 views

Published on

Building, deploying and operating application systems for high scale and failure tolerance is the supreme field of software engineering. While Continuous Integration (CI) and oftentimes also Continuous Delivery (CD) have become a part of commonly used build pipelines, monitoring and observability is still often an afterthought or manually configured. To keep up with containers being started and stopped for version upgrades, scaling up and down or to mitigate failure situations, monitoring needs to automate all the tasks to react to infrastructure changes and find issues before users being impacted. People today expect “Oops-Less Operation”, or do you want your bank to be offline?

Published in: Software
  • Be the first to comment

  • Be the first to like this

Oops-Less Operation

  1. 1. Oops-Less Operation
  2. 2. @noctarius2k@instanahq Software Development Today
  3. 3. @noctarius2k@instanahq Modular Applications /
 Microservices? Software Development Today
  4. 4. @noctarius2k@instanahq Modular Applications /
 Microservices? Continuous Integration? Software Development Today
  5. 5. @noctarius2k@instanahq Modular Applications /
 Microservices? Continuous Integration? Continuous Deployment? Software Development Today
  6. 6. @noctarius2k@instanahq Who’s That Dude? • Chris Engelbert • Senior Developer Advocate @ Instana • Java-Passionate (10+ years) • Go (2 years) • Performance • Garbage Collection • Benchmark Fairytales
  7. 7. @noctarius2k@instanahq Who’s That Dude?
  8. 8. @noctarius2k@instanahq Who’s That Dude? Codes Java Prefers Kotlin Adores TypeScript
  9. 9. @noctarius2k@instanahq Who’s That Dude? Codes Java Prefers Kotlin Kinda likes Go Adores TypeScript
  10. 10. @noctarius2k@instanahq Who’s That Dude? Codes Java Prefers Kotlin Forced to use JS Kinda likes Go Adores TypeScript Totally hates Python
  11. 11. @noctarius2k@instanahq Who’s That Dude? Codes Java Prefers Kotlin Forced to use JS Kinda likes Go Adores TypeScript Totally hates Python LOVES BEER! (Dah, German)
  12. 12. Resiliency
  13. 13. @noctarius2k@instanahq Resiliency
  14. 14. @noctarius2k@instanahq Resiliency
  15. 15. @noctarius2k@instanahq Resiliency
  16. 16. @noctarius2k@instanahq Resiliency
  17. 17. @noctarius2k@instanahq Resiliency
  18. 18. @noctarius2k@instanahq Resiliency
  19. 19. @noctarius2k@instanahq ResiliencyResiliency
  20. 20. Resilient? To What?
  21. 21. @noctarius2k@instanahq Resilient? To What? Power Outage
  22. 22. @noctarius2k@instanahq Resilient? To What? Hardware Failure
  23. 23. @noctarius2k@instanahq Resilient? To What? Network Failure
  24. 24. @noctarius2k@instanahq Resilient? To What? Human Error
  25. 25. @noctarius2k@instanahq Resilient? To What? Software Bugs
  26. 26. @noctarius2k@instanahq Resilient? To What? What Could Possibly Go Wrong™?
  27. 27. The Stages of Resiliency
  28. 28. @noctarius2k@instanahq The Stages of Resiliency Cross-Cutting Concern
  29. 29. @noctarius2k@instanahq The Stages of Resiliency Cross-Cutting Concern
  30. 30. @noctarius2k@instanahq The Stages of Resiliency Cross-Cutting Concern
  31. 31. @noctarius2k@instanahq The Stages of Resiliency Cross-Cutting Concern
  32. 32. Some Basic Rules
  33. 33. @noctarius2k@instanahq Some Basic Rules Rule #1
  34. 34. @noctarius2k@instanahq Some Basic Rules Rule #2 No Single Point of Failure
  35. 35. @noctarius2k@instanahq Some Basic Rules Rule #3 Embrace The Failure
  36. 36. @noctarius2k@instanahq Some Basic Rules Rule #4 Have a Beer the second it works
  37. 37. @noctarius2k@instanahq Some Basic Rules Rule #4 Have a Beer the second it works You Never Know How Long It’ll Work
  38. 38. Developers, Developers, Developers
  39. 39. @noctarius2k@instanahq Developers, Developers, Developers
  40. 40. @noctarius2k@instanahq Developers, Developers, Developers Time Budget (250 ms) Backend #1 Backend #2 Timeout (100 ms) Timeout (100 ms) Failure Response
  41. 41. @noctarius2k@instanahq Developers, Developers, Developers Back-off Algorithm
  42. 42. @noctarius2k@instanahq Developers, Developers, Developers Immutability
  43. 43. @noctarius2k@instanahq Developers, Developers, Developers Idempotency
  44. 44. DevOps and Opertions
  45. 45. @noctarius2k@instanahq DevOps and Operations https://avinetworks.com/glossary/load-balancer/
  46. 46. @noctarius2k@instanahq DevOps and Operations
  47. 47. @noctarius2k@instanahq DevOps and Operations
  48. 48. @noctarius2k@instanahq DevOps and Operations
  49. 49. Infrastructure and Cloud
  50. 50. @noctarius2k@instanahq Infrastructure and Cloud
  51. 51. @noctarius2k@instanahq Infrastructure and Cloud
  52. 52. @noctarius2k@instanahq Infrastructure and Cloud https://cloudacademy.com/blog/aws-global-infrastructure/
  53. 53. @noctarius2k@instanahq Infrastructure and Cloud https://avinetworks.com/glossary/multi-cloud/
  54. 54. @noctarius2k@instanahq Infrastructure and Cloud
  55. 55. @noctarius2k@instanahq Infrastructure and Cloud Physical Host Hypervisor Kubernetes Docker SDN Sidecar Container Physical Host Hypervisor Kubernetes Docker SDN Sidecar Container Physical Host Hypervisor Kubernetes Docker SDN Sidecar Container Abstraction Layers bring Complexity
  56. 56. Sooooooo much more! ?? ? https://medium.com/@adhorn/patterns-for-resilient-architecture-part-1-d3b60cd8d2b6 https://blog.serverdensity.com/service-resilience/ https://dzone.com/articles/libraries-for-microservices-development https://blog.giantswarm.io/reliability-not-enough-resilient-applications-containerized-microservices/ https://www.pushtechnology.com/blog/building-resilient-applications/ https://cabforward.com/the-difference-between-reliable-and-resilient-software/ https://developers.redhat.com/blog/2017/05/16/it-takes-more-than-a-circuit-breaker-to-create-a-resilient-application/ https://www.cio.com/article/2388239/ensure-cloud-application-resilience-the-netflix-way.html https://computing.llnl.gov/projects/application-level-resilience https://jaxenter.com/need-resilient-software-design-115055.html
  57. 57. Questions? ?? ?
  58. 58. 😊 Thank You 😊 ?? ?

×