This document discusses the challenges cloud providers face in managing the performance of enterprise applications deployed in the cloud. It outlines how queuing models can be used to analyze application performance, identify bottlenecks, determine optimal resource allocation, and ensure performance meets SLAs. The key points are:
1) Cloud providers must monitor application workloads, characterize transactions and usage patterns, and plan capacity based on changing demands.
2) Queuing models can simulate application behavior under different workloads and help size resources needed to meet performance targets.
3) Both hardware and software bottlenecks must be identified and addressed, as insufficient tuning parameters can impact performance more than hardware capacity.