This document discusses latency and strategies for improving latency when building APIs and services. It describes using percentiles instead of averages to measure latency, and tools like HdrHistogram and Finagle for tracking latencies. It also discusses experiences using Redis, Cassandra and Aerospike for low latency, and strategies for connecting to Cassandra and handling timeouts. Key takeaways are around not relying on averages, challenges of scaling in .NET, choosing the right NoSQL solution, and that blocking is not always bad if it reduces timeouts.