2. Agenda
• Introduction to AWS Autoscaling
• Key Terms and Components
• How it works in AWS
• Features and Benefits
• Types of Scaling
• Use cases and Troubleshooting
• Resources
3. Introduction to AWS AutoScaling
• Monitors your applications and automatically adjusts capacity to maintain
steady, predictable performance at the lowest possible cost
• With AWS Auto Scaling, your applications always have the right resources
at the right time.
• Enables dynamic scaling of EC2 instances in order to automatically increase
the number of Amazon EC2 instances during demand spikes to maintain
performance and decrease capacity during lulls to reduce costs
• Auto Scaling ensures a steady minimum specified count of Instances will
always be running. If an instance is found unhealthy, Auto Scaling will
terminate the Instance and launch a new one
4. Key Terms and Components
• Scaling Groups
• Core of Auto Scaling and contains a collection of EC2
instances that share similar characteristics
• Specify minimum, maximum, and, desired number of EC2
capacity
• Metrics & Health Checks – metrics to determine when it
should launch or terminate instances and health checks
to determine if the instance is healthy or not
• Launch configurations
• Scaling group uses a launch configuration as a template
for its EC2 instances
• Specify information such as the AMI ID, instance type,
key pair, security groups, and block device mapping for
your instances.
• Scaling plans
• A scaling plan tells Auto Scaling when and how to scale
• you can base a scaling plan on the occurrence of
specified conditions (dynamic scaling) or on a schedule
6. Features and Benefits
• Improve Fault Tolerance
• Amazon EC2 Auto Scaling can detect when an instance is unhealthy, terminate
it, and replace it with a new one.
• Increase Application Availability
• Amazon EC2 Auto Scaling ensures that your application always has the right
amount of compute capacity.
• Lower Costs
• Amazon EC2 Auto Scaling adds new instances only when necessary, and
terminates them when no longer needed.
7. Types of Scaling
• Horizontal Scaling
• To increase the capacity by adding a
parallel unit/server
• Can be achieved automatically
• No downtime required
• Vertical Scaling
• To increase the capacity by increasing
resources in same logical unit/server
• Have to perform manually
• Server Reboot is required
8. Use Cases and Troubleshooting
• If your EC2 instances fail to launch due to issues with the capacity limits of your
Auto Scaling group, you might get one or more of the following error messages.
• We currently do not have sufficient <instance type> capacity in the Availability Zone you
requested (<requested Availability Zone>)....
• Cause:
• At this time, Auto Scaling cannot support your instance type in your requested Availability
Zone.
• Solution:
• Create a new launch configuration by following the recommendations in the error message.
• Update your Auto Scaling group with the new launch configuration
• <number of instances> instance(s) are already running. Launching EC2 instance failed.
• Cause: The Auto Scaling group has reached the limit set by the DesiredCapacity parameter.
• Solution:
• Update your Auto Scaling group by providing a new value for the desiredcapacity parameter
• If you've reached your limit for number of EC2 instances, you can request an increase
9. Resources
• Web Book
• https://docs.aws.amazon.com/autoscaling/ec2/userguide/as-dg.pdf#as-scaling-simple-step
• FAQs
• https://aws.amazon.com/autoscaling/faqs/
• Videos:
• https://www.youtube.com/watch?v=7SfVZqOVcCI
• https://www.youtube.com/watch?v=-hFAWk6hyZA
• https://www.youtube.com/watch?v=5swEiz0i-kE