7. Ka<a ——
a high-throughput distributed messaging system source from LinkedIn
Producer
KaLa LinkedIn Apache
KaLa Producer
&
"
'
(
Model
Two models: queuing
and publish-subscribe.
the la<er is more
commonly used.
high-level abstracPon
Each Topic corresponds
to one or more log files.
Each Topic has one or
more par@@ons.
Feature
Scalability & Durability
& Reliability &
Performance & Fault-
tolerant
Ecosystem
Stream processing
systems, Hadoop
integraPon, monitoring,
and deployment tools.
8. Ka<a
should be no7ced when using Ka<a
)
KaLa offset
Consumer
Buffer
KaLa gzip snappy lz4
CPU
KaLa KaLa
KaLa
Topic Par@@on
Par@@on
Topic Par@@on
Zookeeper
KaLa Zookeeper
ZK ZK
10. BackType nathanmarz Storm BackType
TwiXer Storm TwiXer TwiXer
Apache
' Feature
Storm ——
a distributed real-7me computa7on system source from TwiGer
Highly scalable
Fault-tolerant
Guarantees processing
Language agnosPc
* Topology of one Spout
* Topology of three Spout