In this deck we go discuss about the increased complexity of microservice deployments by means of containers and walk through container failures and their impact to the entire environment.
Running microservice environments is no free lunchAlois Mayr
Since the beginning of the hype around containers and microservices, several organizations have actually transformed their IT to support both legacy and new applications in a more modern environment, moving away from physical data centers toward public, private, or hybrid clouds, using new technologies that pop up every day. While certain things have gotten easier, you still need to do your homework when it comes to proper application architecture, scalability, orchestration, performance, and monitoring.
In this presentation we cover the dark side of microservices so you don’t have to learn it the hard way.
Flink Forward Berlin 2018: Lasse Nedergaard - "Our successful journey with Fl...Flink Forward
At Trackunit we have based our telematic IoT processing pipeline on Flink. We started out on version 1.2 and are now on 1.5. In this session I will share the lessons learned going from one giant Flink job to many smalls and some of the problems we have seen operating Flink on AWS EMR cluster, including topics such as:
• Why external enrichment can be challenging with Flink Async operator.
• Pattern to change external enrichment into streaming join.
• Building your own source
• Why Flink restart is great but should be avoided as it will terminate your cluster.
• Why iteration can cause deadlocking when backpressure occurs.
• Kinesis rate exceeded exception
• Why throttling Flink source read during catchup is needed.
• Why we moved from EMR/Kinesis and into DC/OS and kafka.
• And much more.
Slides from my talk at QCon New York on how Netflix increases resiliency through failure, covering the Chaos Monkey, Chaos Gorilla, Latency Monkey, and others from the Simian Army.
De presentatie zal ingaan op wallboards/information radiators in het algemeen en de wedstrijd die door Atlassian in oktober - november 2010 georganiseerd is. Daarnaast zal er gesproken worden over onze gerealiseerde wallboard, zoals; gebruikte technologien en integratie met andere producten (waaronder Atlassian producten).
Flink Forward Berlin 2018: Wei-Che (Tony) Wei - "Lessons learned from Migrati...Flink Forward
In modern applications of streaming frameworks, stateful streaming is arguably one of the most important usage cases. Flink, as a well-supported streaming framework for stateful streaming, readily helps developers spend less efforts on system deployment and focus more on the business logic. Nevertheless, upgrading from an existing production system to a new one with stateful streaming can still be a challenging task for any development team. In this talk, we will share our experience in migrating an existing system at Appier (an AI-based startup specialized with B2B solutions) to stateful streaming with Flink. We will first discuss how stateful streaming matches our business logic and its potential benefits. Then, we review the obstacles that we have encountered during migration, and present our solutions to conquer them. We hope that our experience and tips shared in this talk hints future users to prepare themselves towards applying Flink in their production systems more painlessly.
(DVO205) Monitoring Evolution: Flying Blind to Flying by InstrumentAmazon Web Services
Today, AdRoll runs its infrastructure by instrumentation: constantly asking empirical questions, analyzing data for answers, and designing new features with instrumentation in mind to understand how functionality will work upon release. AdRoll’s development methodology did not start out this way, however. It took a cultural shift and many new tools and processes to adopt this approach. In this session, AdRoll and Datadog will discuss how to evolve your organization from a state of “flying blind” to a culture focused on monitoring and data-based decisions. Session sponsored by Datadog.
BOSH deploys distributed systems, and Diego runs any containersBenjamin Gandon
Learn how BOSH deploys distributed systems. Plus, Discover Diego, the flexible container engine, inside Cloud Foundry… or as the standalone Lattice engine!
Running microservice environments is no free lunchAlois Mayr
Since the beginning of the hype around containers and microservices, several organizations have actually transformed their IT to support both legacy and new applications in a more modern environment, moving away from physical data centers toward public, private, or hybrid clouds, using new technologies that pop up every day. While certain things have gotten easier, you still need to do your homework when it comes to proper application architecture, scalability, orchestration, performance, and monitoring.
In this presentation we cover the dark side of microservices so you don’t have to learn it the hard way.
Flink Forward Berlin 2018: Lasse Nedergaard - "Our successful journey with Fl...Flink Forward
At Trackunit we have based our telematic IoT processing pipeline on Flink. We started out on version 1.2 and are now on 1.5. In this session I will share the lessons learned going from one giant Flink job to many smalls and some of the problems we have seen operating Flink on AWS EMR cluster, including topics such as:
• Why external enrichment can be challenging with Flink Async operator.
• Pattern to change external enrichment into streaming join.
• Building your own source
• Why Flink restart is great but should be avoided as it will terminate your cluster.
• Why iteration can cause deadlocking when backpressure occurs.
• Kinesis rate exceeded exception
• Why throttling Flink source read during catchup is needed.
• Why we moved from EMR/Kinesis and into DC/OS and kafka.
• And much more.
Slides from my talk at QCon New York on how Netflix increases resiliency through failure, covering the Chaos Monkey, Chaos Gorilla, Latency Monkey, and others from the Simian Army.
De presentatie zal ingaan op wallboards/information radiators in het algemeen en de wedstrijd die door Atlassian in oktober - november 2010 georganiseerd is. Daarnaast zal er gesproken worden over onze gerealiseerde wallboard, zoals; gebruikte technologien en integratie met andere producten (waaronder Atlassian producten).
Flink Forward Berlin 2018: Wei-Che (Tony) Wei - "Lessons learned from Migrati...Flink Forward
In modern applications of streaming frameworks, stateful streaming is arguably one of the most important usage cases. Flink, as a well-supported streaming framework for stateful streaming, readily helps developers spend less efforts on system deployment and focus more on the business logic. Nevertheless, upgrading from an existing production system to a new one with stateful streaming can still be a challenging task for any development team. In this talk, we will share our experience in migrating an existing system at Appier (an AI-based startup specialized with B2B solutions) to stateful streaming with Flink. We will first discuss how stateful streaming matches our business logic and its potential benefits. Then, we review the obstacles that we have encountered during migration, and present our solutions to conquer them. We hope that our experience and tips shared in this talk hints future users to prepare themselves towards applying Flink in their production systems more painlessly.
(DVO205) Monitoring Evolution: Flying Blind to Flying by InstrumentAmazon Web Services
Today, AdRoll runs its infrastructure by instrumentation: constantly asking empirical questions, analyzing data for answers, and designing new features with instrumentation in mind to understand how functionality will work upon release. AdRoll’s development methodology did not start out this way, however. It took a cultural shift and many new tools and processes to adopt this approach. In this session, AdRoll and Datadog will discuss how to evolve your organization from a state of “flying blind” to a culture focused on monitoring and data-based decisions. Session sponsored by Datadog.
BOSH deploys distributed systems, and Diego runs any containersBenjamin Gandon
Learn how BOSH deploys distributed systems. Plus, Discover Diego, the flexible container engine, inside Cloud Foundry… or as the standalone Lattice engine!
Best Practices for Monitoring Your Cloud Environment and ApplicationsProlifics
Abstract: You have completed the heavy lifting of migrating applications to the cloud. But you are not done yet. What is your monitoring strategy for the cloud? What are the best practices to monitor the cloud infrastructure, deployed applications and end user experience? In this session, we will be answering these questions and explore the various IBM APM and Analytics offerings that will help you in your decision making process. Having a comprehensive monitoring strategy is critical as most customers use a combination of public and private cloud environments and being able to monitor these using a fully integrated and customizable solution is essential to the health, availability and performance of the cloud deployed applications and services.
Tracxn FinTech SEA Startup Landscape, July 2016Tracxn
Our FinTech SouthEast Asia Report covers FinTech trends and investments in Singapore, Indonesia, Malaysia, Thailand, Philippines, and Vietnam, with exhaustive Q&A’s with the leadership team at East Ventures and Lenddo.
View these slides if you're you new to cloud computing and would like to learn more about Amazon Web Services (AWS), if you intend to implement a project and would like to discover the basics of the AWS cloud or if you are a business looking to evaluate cloud computing.
In the webinar based on these slides, we answered the following questions:
• What is Cloud Computing with AWS and what benefits can it deliver?
• Who is using AWS and what are they using it for?
• How can I use AWS Services to run my workloads?
View the webinar recording on YouTube here: http://youtu.be/QROD20r6-sQ
Datapipe, an AWS Premier Consulting Partner, has built and customized a global monitoring platform specifically for AWS. This presentation discusses the challenges encountered when architecting this solution and provides a live demonstration of the platform and its specific monitoring capabilities.
Philippe Gelis, CEO & Co-Founder of Kantox, talking about the next 10 years in Fintech; A new co-petitive eco-system starts emerging within the financial sector
Overview of industry trends and insights of Fortune 500 companies and startups' activities in the FinTech space. We cover banking tech (security, crm, analytics), payments (pos, money transfer, commerce), cyber currency (blockchain, bitcoin, wallets, cryptocurrency exchanges), business finance (lending, crowdfunding), personal finance (lending, wealth management, mortgage, credit), and alternative cores (banking, insurance).
The Mushroom Cloud Effect - What happens when containers fail?Alois Mayr
Micro service architectures result in up to 20 times larger environments than their monolithic counterparts. In such big and interconnected environments container metrics will tell you about infrastructure health but not service health. Even if you have implemented service health checks to quickly react on service failures, in a resilient system you will see intermediary mushroom cloud effects of a large number of services being affected temporarily. How do you find out what really caused the problem and how to distinguish effect vs. cause?
The Mushroom Cloud Effect or What Happens When Containers Fail? by Alois Mayr...Docker, Inc.
Micro service architectures result in up to 20 times larger environments than their monolithic counterparts. In such big and interconnected environments container metrics will tell you about infrastructure health but not service health. Even if you have implemented service health checks to quickly react on service failures, in a resilient system you will see intermediary mushroom cloud effects of a large number of services being affected temporarily. How do you find out what really caused the problem and how to distinguish effect vs. cause?
In this session we will do post-mortem analysis by walking through different cases of failures we've observed in a real-world large e-commerce production environment and show you how to figure out what actually caused the failures.
Best Practices for Monitoring Your Cloud Environment and ApplicationsProlifics
Abstract: You have completed the heavy lifting of migrating applications to the cloud. But you are not done yet. What is your monitoring strategy for the cloud? What are the best practices to monitor the cloud infrastructure, deployed applications and end user experience? In this session, we will be answering these questions and explore the various IBM APM and Analytics offerings that will help you in your decision making process. Having a comprehensive monitoring strategy is critical as most customers use a combination of public and private cloud environments and being able to monitor these using a fully integrated and customizable solution is essential to the health, availability and performance of the cloud deployed applications and services.
Tracxn FinTech SEA Startup Landscape, July 2016Tracxn
Our FinTech SouthEast Asia Report covers FinTech trends and investments in Singapore, Indonesia, Malaysia, Thailand, Philippines, and Vietnam, with exhaustive Q&A’s with the leadership team at East Ventures and Lenddo.
View these slides if you're you new to cloud computing and would like to learn more about Amazon Web Services (AWS), if you intend to implement a project and would like to discover the basics of the AWS cloud or if you are a business looking to evaluate cloud computing.
In the webinar based on these slides, we answered the following questions:
• What is Cloud Computing with AWS and what benefits can it deliver?
• Who is using AWS and what are they using it for?
• How can I use AWS Services to run my workloads?
View the webinar recording on YouTube here: http://youtu.be/QROD20r6-sQ
Datapipe, an AWS Premier Consulting Partner, has built and customized a global monitoring platform specifically for AWS. This presentation discusses the challenges encountered when architecting this solution and provides a live demonstration of the platform and its specific monitoring capabilities.
Philippe Gelis, CEO & Co-Founder of Kantox, talking about the next 10 years in Fintech; A new co-petitive eco-system starts emerging within the financial sector
Overview of industry trends and insights of Fortune 500 companies and startups' activities in the FinTech space. We cover banking tech (security, crm, analytics), payments (pos, money transfer, commerce), cyber currency (blockchain, bitcoin, wallets, cryptocurrency exchanges), business finance (lending, crowdfunding), personal finance (lending, wealth management, mortgage, credit), and alternative cores (banking, insurance).
The Mushroom Cloud Effect - What happens when containers fail?Alois Mayr
Micro service architectures result in up to 20 times larger environments than their monolithic counterparts. In such big and interconnected environments container metrics will tell you about infrastructure health but not service health. Even if you have implemented service health checks to quickly react on service failures, in a resilient system you will see intermediary mushroom cloud effects of a large number of services being affected temporarily. How do you find out what really caused the problem and how to distinguish effect vs. cause?
The Mushroom Cloud Effect or What Happens When Containers Fail? by Alois Mayr...Docker, Inc.
Micro service architectures result in up to 20 times larger environments than their monolithic counterparts. In such big and interconnected environments container metrics will tell you about infrastructure health but not service health. Even if you have implemented service health checks to quickly react on service failures, in a resilient system you will see intermediary mushroom cloud effects of a large number of services being affected temporarily. How do you find out what really caused the problem and how to distinguish effect vs. cause?
In this session we will do post-mortem analysis by walking through different cases of failures we've observed in a real-world large e-commerce production environment and show you how to figure out what actually caused the failures.
My 6th. revision of my Stackato presentation given at the German Perl Workshop 2013 in Berlin, Germany,
More information available at: https://logiclab.jira.com/wiki/display/OPEN/Stackato
JavaOne 2016 "Java, Microservices, Cloud and Containers"Daniel Bryant
Everyone is talking about building “cloud native” Java applications—and taking advantage of microservice architecture, containers, and orchestration/PaaS platforms—but there is surprisingly little discussion of migrating existing legacy (moneymaking) applications. This session aims to address this, and, using lessons learned from several real-world examples, it covers topics such when to rewrite applications (if at all), modeling/extracting business domains, applying the “application strangler” pattern, common misconceptions with “12-factor” application design, and the benefits/drawbacks of container technology.
Spotinst 'AWS Cost Optimization' Webinar - Jan 20th, 2016Spotinst
This webinar will assist you optimize your spendings by running spot instances over AWS EC2.
In addition, you will find case studies about Spotinst product of the Israeli unicorn ironSource and the hot Ad-Tech company Inneractive and techniques how to run containers with Spotinst and Rancher.
Site reliability in the Serverless age - Serverless Boston 2019Erik Peterson
Is SRE, DevOps and serverless a match made in heaven or is something missing? What about cost when building reliable Serverless systems? To answer this, lets explore SRE and Serverless principals, a new concept called FinDevOps, and along the way make a few predictions about our serverless future
This is my presentation of ActiveStates stackato given to the Copenhagen Perl Mongers
More information available at: https://logiclab.jira.com/wiki/display/OPEN/Stackato
.Net Microservices with Event Sourcing, CQRS, Docker and... Windows Server 20...Javier García Magna
Good technical practices you can follow with (micro)services but can be applied to almost anything: discovery (microphone/consul), security, resilience (polly), composition, ssecurity (jwt/oauth2)... And then an example with a CQRS application, and how docker can be used in Windows 2016. Lastly a brief summary of what Service Fabric is and its programming models.
The slides for my UBC Alumni talk on programming for the Cloud. I show Cloud Foundry as an example of an open cloud platform and how easy it is to create modular, scalable applications using it.
Do you need Ops in your new startup? If not now, then when? And...what is Ops?
Learn how to scale ruby-based distributed software infrastructure in the cloud to serve 4,000 requests per second, handle 400 updates per second, and achieve 99.97% uptime – all while building the product at the speed of light.
Unimpressed? Now try doing the above altogether without the Ops team, while growing your traffic 100x in 6 months and deploying 5-6 times a day!
It could be a dream, but luckily it's a reality that could be yours.
My Stackato presentation given to the CopenhagenJS user group. Basic examples were implemented in Node.
More information available at: https://logiclab.jira.com/wiki/display/OPEN/Stackato
GraphSummit Paris - The art of the possible with Graph TechnologyNeo4j
Sudhir Hasbe, Chief Product Officer, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...Juraj Vysvader
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I didn't get rich from it but it did have 63K downloads (powered possible tens of thousands of websites).
Software Engineering, Software Consulting, Tech Lead, Spring Boot, Spring Cloud, Spring Core, Spring JDBC, Spring Transaction, Spring MVC, OpenShift Cloud Platform, Kafka, REST, SOAP, LLD & HLD.
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamtakuyayamamoto1800
In this slide, we show the simulation example and the way to compile this solver.
In this solver, the Helmholtz equation can be solved by helmholtzFoam. Also, the Helmholtz equation with uniformly dispersed bubbles can be simulated by helmholtzBubbleFoam.
Unleash Unlimited Potential with One-Time Purchase
BoxLang is more than just a language; it's a community. By choosing a Visionary License, you're not just investing in your success, you're actively contributing to the ongoing development and support of BoxLang.
Check out the webinar slides to learn more about how XfilesPro transforms Salesforce document management by leveraging its world-class applications. For more details, please connect with sales@xfilespro.com
If you want to watch the on-demand webinar, please click here: https://www.xfilespro.com/webinars/salesforce-document-management-2-0-smarter-faster-better/
May Marketo Masterclass, London MUG May 22 2024.pdfAdele Miller
Can't make Adobe Summit in Vegas? No sweat because the EMEA Marketo Engage Champions are coming to London to share their Summit sessions, insights and more!
This is a MUG with a twist you don't want to miss.
Quarkus Hidden and Forbidden ExtensionsMax Andersen
Quarkus has a vast extension ecosystem and is known for its subsonic and subatomic feature set. Some of these features are not as well known, and some extensions are less talked about, but that does not make them less interesting - quite the opposite.
Come join this talk to see some tips and tricks for using Quarkus and some of the lesser known features, extensions and development techniques.
How to Position Your Globus Data Portal for Success Ten Good PracticesGlobus
Science gateways allow science and engineering communities to access shared data, software, computing services, and instruments. Science gateways have gained a lot of traction in the last twenty years, as evidenced by projects such as the Science Gateways Community Institute (SGCI) and the Center of Excellence on Science Gateways (SGX3) in the US, The Australian Research Data Commons (ARDC) and its platforms in Australia, and the projects around Virtual Research Environments in Europe. A few mature frameworks have evolved with their different strengths and foci and have been taken up by a larger community such as the Globus Data Portal, Hubzero, Tapis, and Galaxy. However, even when gateways are built on successful frameworks, they continue to face the challenges of ongoing maintenance costs and how to meet the ever-expanding needs of the community they serve with enhanced features. It is not uncommon that gateways with compelling use cases are nonetheless unable to get past the prototype phase and become a full production service, or if they do, they don't survive more than a couple of years. While there is no guaranteed pathway to success, it seems likely that for any gateway there is a need for a strong community and/or solid funding streams to create and sustain its success. With over twenty years of examples to draw from, this presentation goes into detail for ten factors common to successful and enduring gateways that effectively serve as best practices for any new or developing gateway.
We describe the deployment and use of Globus Compute for remote computation. This content is aimed at researchers who wish to compute on remote resources using a unified programming interface, as well as system administrators who will deploy and operate Globus Compute services on their research computing infrastructure.
Atelier - Innover avec l’IA Générative et les graphes de connaissancesNeo4j
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Allez au-delà du battage médiatique autour de l’IA et découvrez des techniques pratiques pour utiliser l’IA de manière responsable à travers les données de votre organisation. Explorez comment utiliser les graphes de connaissances pour augmenter la précision, la transparence et la capacité d’explication dans les systèmes d’IA générative. Vous partirez avec une expérience pratique combinant les relations entre les données et les LLM pour apporter du contexte spécifique à votre domaine et améliorer votre raisonnement.
Amenez votre ordinateur portable et nous vous guiderons sur la mise en place de votre propre pile d’IA générative, en vous fournissant des exemples pratiques et codés pour démarrer en quelques minutes.
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Globus
Large Language Models (LLMs) are currently the center of attention in the tech world, particularly for their potential to advance research. In this presentation, we'll explore a straightforward and effective method for quickly initiating inference runs on supercomputers using the vLLM tool with Globus Compute, specifically on the Polaris system at ALCF. We'll begin by briefly discussing the popularity and applications of LLMs in various fields. Following this, we will introduce the vLLM tool, and explain how it integrates with Globus Compute to efficiently manage LLM operations on Polaris. Attendees will learn the practical aspects of setting up and remotely triggering LLMs from local machines, focusing on ease of use and efficiency. This talk is ideal for researchers and practitioners looking to leverage the power of LLMs in their work, offering a clear guide to harnessing supercomputing resources for quick and effective LLM inference.
Prosigns: Transforming Business with Tailored Technology SolutionsProsigns
Unlocking Business Potential: Tailored Technology Solutions by Prosigns
Discover how Prosigns, a leading technology solutions provider, partners with businesses to drive innovation and success. Our presentation showcases our comprehensive range of services, including custom software development, web and mobile app development, AI & ML solutions, blockchain integration, DevOps services, and Microsoft Dynamics 365 support.
Custom Software Development: Prosigns specializes in creating bespoke software solutions that cater to your unique business needs. Our team of experts works closely with you to understand your requirements and deliver tailor-made software that enhances efficiency and drives growth.
Web and Mobile App Development: From responsive websites to intuitive mobile applications, Prosigns develops cutting-edge solutions that engage users and deliver seamless experiences across devices.
AI & ML Solutions: Harnessing the power of Artificial Intelligence and Machine Learning, Prosigns provides smart solutions that automate processes, provide valuable insights, and drive informed decision-making.
Blockchain Integration: Prosigns offers comprehensive blockchain solutions, including development, integration, and consulting services, enabling businesses to leverage blockchain technology for enhanced security, transparency, and efficiency.
DevOps Services: Prosigns' DevOps services streamline development and operations processes, ensuring faster and more reliable software delivery through automation and continuous integration.
Microsoft Dynamics 365 Support: Prosigns provides comprehensive support and maintenance services for Microsoft Dynamics 365, ensuring your system is always up-to-date, secure, and running smoothly.
Learn how our collaborative approach and dedication to excellence help businesses achieve their goals and stay ahead in today's digital landscape. From concept to deployment, Prosigns is your trusted partner for transforming ideas into reality and unlocking the full potential of your business.
Join us on a journey of innovation and growth. Let's partner for success with Prosigns.
Prosigns: Transforming Business with Tailored Technology Solutions
When containers fail
1. @mayraloisAugust, 2016
The Mushroom Cloud Effect
or
What Happens When Containers Fail?
Alois Mayr
Technology Lead Cloud & Containers
Microservices and Containers Meetup Austin
2. @mayralois
about:me
• Austrian
• Never seen
Sound of Music
• Often seen much more modern
technology stuff
• Seen even more technology stuff
now with Dynatrace
• Technology Lead for Cloud & Containers
CloudFoundry, Docker, AWS, etc.
3. @mayralois
about:dynatrace
• APM market leader who helps companies in
Digital transformation
• Founded in Austria back in 2005
• ~ 1600 employees worldwide
• > 8000 customers across all industries
• Seen many performance and stability
problems and patterns out there
4. @mayralois
about:you
• Who of you run/manage containers in
production?
• Whose life has become easier since then?
• What’s needed to make it easy?
• Thanks!
10. @mayralois
Important Aspects…
• Lots of (micro-)services
• Lots of communication between services
• Service dependencies
• Versioning and API compatibilities
• Zero downtime
13. @mayralois
Deployments are no Longer Static
7:00 a.m.
Low load, service running
with minimum redundancy
12:00 p.m.
Scaled up service during peak load
with failover of problematic node
7:00 p.m.
Scaled back down to lower load,
move to different geolocation
19. @mayralois
The Hungry Container Breakdown
• Shared /logs partition on host
• No log rotation, no archiving for app logs
• No proper log management used for Docker environment
• Shared /logs partition ran out of space
What was the problem?
20. @mayralois
The Hungry Container Breakdown
• Container health checks failed
• Orchestration killed container and rescheduled new one
• Still no free space on /logs
• Termination and rescheduling
• /var/lib/docker ran out of space
• Cluster nodes were no longer able to run any containers
How the problem has evolved over time?
21. @mayralois
The Hungry Container Breakdown
• Services at the top of the graph
• Increased failure rates
• Lots of depending Tomcat and DB services affected
How the problem affected services?
23. @mayralois
The Hungry Container Breakdown
Log management tools for app logs
--log-driver=none|syslog
Remove container / clean-up jobs
--rm=true
/var/lib/docker deserves its own partition
How the problem could have been avoided?
26. @mayralois
Break Your Clusters Early
Massive load testing!
Survive three days of pain
Include everything
Services, Containers,
Orchestration, EC2 instances