This document discusses Apache Kafka, a distributed publish-subscribe messaging system. It describes how Kafka is used at LinkedIn to handle high volumes of real-time event data across multiple data centers. Some key points: Kafka is used to handle 20 billion events per day and 3 terabytes of data at LinkedIn. It provides high throughput, persistence, and elastic scalability. Kafka also integrates with Hadoop for offline data processing.