This document discusses virtual CPUs and CPU architecture. It begins by explaining how hypervisor resource queues work and how requests for CPU, memory, storage and networking are placed in queues. It then covers physical CPU architecture including cores, sockets, NUMA and memory locality. It discusses how virtual CPUs are scheduled by the hypervisor and ways to measure scheduling pressure. Finally, it provides recommendations for right-sizing virtual machines and balancing workloads to reduce scheduling delays.