This document discusses scaling in cloud computing. Scaling allows systems to expand or contract resources based on workload. In cloud computing, scaling is dynamic and often automatic. There are two main types of scaling - vertical which increases a server's resources, and horizontal which adds more servers. Horizontal scaling is more cloud-native as it allows infinite scalability, while vertical scaling has limits. Auto-scaling can be proactive based on schedules or reactive based on demand. Both approaches together provide flexible scaling in the cloud.