Apache Kafka as Message Queue for your microservices and other occasions

Apache Kafka as
Message Queue
for your microservices and other occasions
Michael Reinsch

michael@movingfast.io

at Rug::B Feb 2018

Message Queue
• Queue acts as buﬀer

• Indirect communication

• Multiple consumer processes

• Multiple producers
Message queueProducer
Hello! Hello!
Consumer

With Exchange
• Queue is strongly coupled to consumer

• Exchange needs to know system topology

• Adding additional consumers is relatively expensive
Message queue
Producer
Hello!
Hello!
Hello!
X
Exchange
Consumer X
Consumer Y

Apache Kafka
• Topics are always multi-publisher and multi-subscriber

• Adding / removing consumers is very cheap
Topic
Consumer X
Hello!
Hello!
Greets Consumer Y
Hello!
Producer

Apache Kafka:
Distributed Streaming Platform
Key capabilities:

• Publish and subscribe to streams of records

• Store streams of records in a fault-tolerant way

• Process streams of records as they occur

Topics
• ‘Category’ for a stream of records

• Producers only append

• All published records are retained
for a conﬁgurable retention period

• Consumers use oﬀset pointers to
store their last processed event

Topic Partitions
• Topic can have many partitions

• Partitions are an ordered,
immutable sequence of records

• Partition size is limited by disk
space

• Partitions are replicated for fault-
tolerance

• Unit of parallelism

Consumer Groups
• Consumers can form consumer groups

• Each consumer in a group is the
exclusive consumer of a “fair share” of
partitions

• Strong ordering guaranty within a topic
partition
TopicProducer
Hello!
Hello!
greets
Hello!
Consumer Group X
Consumer Group Y

Demo!
Scripts at github.com/mreinsch/kafka_demo

Kafka vs. REST API
• Asynchronous, indirect communication

• Less coupling in producer -> easier to extend

• Fewer critical paths
Topic
Producer
Ex: new users
Ex: new orders
Consumer Group X
Consumer Group Y

Kafka vs. job queue
• Similar, but diﬀerent

• Less coupling in producer
Topic
Producer
Ex: new users
Ex: new orders
Consumer Group X
Consumer Group Y

Challenges
when using Kafka (or other message queues)  
for your microservices

Asynchronicity
• Example: existing clients use REST API, but processing is done by
some microservice

• Possible solution:

• Return `202 Accepted` pointing to a job resource

• Job resource returns status / actual resource location when
ﬁnished

• Use another Kafka topic to communicate job status changes

• http://restcookbook.com/Resources/asynchroneous-operations/

More Asynchronicity
• Example: need data from another service

• Pragmatic solution:

• Just keep using a REST API

• Scalable solution:

• Combine REST API with local cache which gets
invalidated by an asynchronous event

Error handling
• You need strategy for handling errors to avoid consumer
processes getting stuck

• ruby-kafka doesn’t provide this - higher level libs exist

• kafka_worker, minimalist worker abstraction

• Pushes message+metadata into error topic on hard
errors

• Work in progress…

Event Loops
• Example:

• Consumer A: consumes topic-a, publishes to topic-b

• Consumer B: consumes topic-b, publishes to topic-a

• Usually much more complex…

• We haven’t had one yet, but with more services it
becomes more likely

Some Tips
• Add some common metadata to each event, such as
Origin-UUID (pass on when triggering other events),
Seen-By

• Document which service consumes / produces which
events

• Only include data relevant to the event, other data should
be fetched as needed

Get in touch
Michael Reinsch

michael@movingfast.io

GitHub: mreinsch
Looking
for new
interesting
projects
/ opportunities!

Apache Kafka as Message Queue for your microservices and other occasions

More Related Content

What's hot

Similar to Apache Kafka as Message Queue for your microservices and other occasions

Recently uploaded

Apache Kafka as Message Queue for your microservices and other occasions