More Related Content Similar to Kafka overview and use cases (20) Kafka overview and use cases4. SITE RELIABILITY ENGINEERING©2016 LinkedIn Corporation. All Rights Reserved.
What for?
•Messaging
•Website Activity Tracking
•Metrics
•Log Aggregation
•Stream Processing
•For fun ;)
4
5. SITE RELIABILITY ENGINEERING©2016 LinkedIn Corporation. All Rights Reserved.
So how does it work?
▪ Components
– Producer
– Broker
▪ Topic
▪ Partition
– Consumer
5
9. SITE RELIABILITY ENGINEERING©2016 LinkedIn Corporation. All Rights Reserved.
Attributes of a Kafka Cluster
▪ Durable
▪ Scalable
▪ Low Latency
▪ Finite Retention
▪ No single point of failure
9
10. SITE RELIABILITY ENGINEERING©2016 LinkedIn Corporation. All Rights Reserved.
Kafka At LinkedIn
▪ Multiple Datacenters, Multiple Clusters
▪ Mirroring between clusters
▪ Message Types
– Metrics
– Tracking
– Queuing
▪ Data transport from applications to Hadoop, and back
10
11. SITE RELIABILITY ENGINEERING©2016 LinkedIn Corporation. All Rights Reserved.
Some numbers!
▪ 1800+ Broker machines
▪ 79K+ Topics
▪ 1.1M+ Partitions
▪ 1.3 Trillion messages per day
▪ 330 Terabytes in/day
▪ 1.2 Petabytes out/day
▪ Peak load for a single cluster
– 2 million messages/sec
– 4.7 Gigabits/sec inbound
– 15 Gigabits/sec outbound
11