Scaling APIs: Predict, Prepare for, Overcome the Challenges


Published on

Published in: Technology
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • Creative Commons Attribution-Share Alike 3.0 United States License
  • Scaling APIs: Predict, Prepare for, Overcome the Challenges

    1. 1. Scaling APIsFeeding your SpeedsApril 5, 2012Greg Brail @gbrailBrian Pagano @brianpagano
    2. 2. @gbrail @brianpagano
    3. 3.
    4. 4.
    5. 5. New! IRC Channel #api-craft on freenode
    6. 6. What is scale to an API?Developers: Lots of developers building appsApps: Lots of apps for end users to useEnd users: Millions and millions of app usersVersions: Lots of API versions to manageAPI calls: All of these things result in API calls…
    7. 7. Today’s Topic: API CallsToday we are going to focus on handling huge numbersof API calls in an API infrastructure Ever been in a meeting where someone said,“let’s not talk about ‘speeds and feeds’ today?” This is not that meeting.
    8. 8. Why do APIs need to scale?Thousands of app developers… Each one must be managed, signed up, etc.Thousands of apps… Each one has credentials that need to be validatedMillions of end users… Each one has one or more OAuth access tokensResult in lots of API calls. For example: One million end users, Making 1000 API calls a day, Results in one billion API calls a day, Or about 11,000 API calls per second on average
    9. 9. Tracking API callsToday we’ll mainly talk about throughputMeasured in API calls or transactions per second (tps) For an API, usually as the number of users increases, throughput increasesAs throughput increases, latency often increases too It’s not enough just to handle lots of throughput – it’s important to handle it with a reasonable amount of latency
    10. 10. What Limits Scale?Disk DatabaseNetwork App serverCPU API ProxyMemory Load balancer Cache servers
    11. 11. What are some limits?Seek time Database design & tuningRotational speed App server coding & configTransfer speed Proxy configurationClock speed Load balancer policiesNumber of cores Cache configurationAmount of RAM And many more…
    12. 12. Some examplesWe’re going to talk about things to look at asthroughput grows from one level of traffic to another… 1 tps 10 tps 100 tps 1000 tps 10,000 tps 100,000 tps and beyond…..
    13. 13. Image from
    14. 14. At 1 transaction per second86,400 per day / 2.5 million per monthAlmost everything can handle this.What about the database? If each API call makes several big SQL queries, it may not!
    15. 15. Strategies for 1 tpsTest (always)Tune the database installationTune the database designMonitor query performanceTest again!
    16. 16. image from
    17. 17. At 10 transactions per second864,000 per day / 25 million per monthMost infrastructure can still handle this.What about the application server? Is the app well-designed enough? Does it make an excessive number of database calls?
    18. 18. Strategies for 10 tpsEnsure that the app server is properly optimized Do API calls make the minimum number of database calls? Do API calls depend on large numbers of external services?
    19. 19. image from
    20. 20. At 100 transactions per second8.6 million per day / 259 million per month(Now we are starting to get somewhere)RDBMS systems may struggleLess-efficient app servers may struggle“Free” tiers on hosting platforms aren’t an option
    21. 21. Strategies for 100 tpsDatabase optimization and tuning is critical here Allocate fast storage, and lots of it Allocate lots of memory Tune the database to use it! Find bad queries and fix them or optimize themApp server tuning is critical here Are there enough threads in the thread pool? Are there enough processes?
    22. 22. Image from
    23. 23. At 1000 transactions per second86 million per day / 2.5 billion per monthNow everything may start to break…What is the mix between reads and writes?
    24. 24. Strategies for 1000 tpsUnderstand the mix between reads and writes Cache the reads as much as you can If you can cache them closer to the client, betterUnderstand your app server performance Faster app servers should still be able to handle (like Java) RoR, Python, PHP, etc will require much bigger clusters Stateless app servers are your friend!
    25. 25. More strategies for 1000 tpsCan the database handle the load? It can if most transactions are reads And you cache as much as you canOtherwise it’s time to scale the database layer Sharded RDBMSes Or a scalable NoSQL database works here
    26. 26. At 10,000 transactions per second864 million per day / 25 billion per monthIf most transactions are reads, caching is your friendOtherwise, this is serious business No single database can handle this Few single app servers can handle this If API calls are large, what will the bandwidth be?
    27. 27. Strategies for 10,000 tpsCaching is even more essential Even a simple cache can handle this load on one or two boxesDatabase writes are problematic No single database server can write 10,000 times per second Scalable, eventually-consistent databases can scale this big, (like Cassandra)
    28. 28. More for 10,000App servers You’ll need a cluster of app servers no matter what! What about session management? What about load balancing?
    29. 29. 100,000 API calls per second8.6 billion per day!Now your API is truly impressive (either that or it is very poorly designed!)You will need racks of infrastructure no matter what!
    30. 30. Some other considerationsAPI designClient designLatencyBandwidth
    31. 31. What about API design?Every API call has overhead: TCP connection / SSL handshake / load balancer CPU / API proxy CPU / App server CPU and thread pool / database connections / disk I/O…Do you need to make so many? Can you design your APIs to support fewer high-value API calls? Can you have “batch” calls in your API?
    32. 32. What about the client?Can client apps use the API more efficiently? Don’t make the same API calls over and over Utilize compression Utilize conditional requests in HTTP Which means that the API server should support them! Request only the data that’s needed Which means that the API server should trim responses Or paginate them
    33. 33. What about latency?Latency kills user experience!How can the API server reduce it? Remove steps in the processing flow through caching Cache closer to the API clients
    34. 34. What about the network?What kind of network connection to you have? For instance, in EC2 you get 1Gbps, Or about 122 megabytes / second At 10,000 tps, for instance, that’s 12K per API call
    35. 35. THANK YOUQuestions and ideas to:@gbrail@brianpagano