- Layer 7 load balancing (L7 LB) is needed for modern applications as architectures and users have changed from monolithic to microservices and APIs, and from only humans to humans, systems, and things.
- Scaling modern applications requires choosing an architecture like horizontal duplication, functional decomposition, or data partitioning along the X, Y, and Z axes rather than just choosing a load balancing algorithm.
- Architecting for scalability involves variations of L7 LB techniques like data partitioning, URL/HTTP dispatch, dynamic routing, functional decomposition, and API metering and versioning. Key considerations for scaling include API design, user/entity identification, API versioning, and monitoring.