Kakfa summit london 2019 - the art of the event-streaming app

1
The art of the event streaming application.
streams, stream processors and scale
Neil Avery,
Office of the CTO,
@avery_neil

66
“We believe that the major
contributor to this complexity in
many systems is the handling of
state and the burden that this adds
when trying to analyse and reason
about the system.”
Out of the tar pit, 2006

88
What are microservices?
Microservices are a software development
technique - a variant of the service-oriented
architecture (SOA) architectural style that
structures an application as a collection of
loosely coupled services.
https://en.wikipedia.org/wiki/Microservices

99
structures an application as a collection of
loosely coupled services.
this is new!

12
Handling state is hard Cache?
Embedded?
Route to right instance?

13
● Scaling is hard
● Handling state is hard
● Sharing, coordinating is hard
● Run a database in each microservice - is hard
What have we learned about microservices?

1515
We actually had some of it right

1818
Event driven architectures

1919
aren’t new…
...but...

21
New technology, requirements and expectations

2222
Events
FACT!
SOMETHING
HAPPENED!

Ad placement
Examples...
User signed up Item was sold Payment

Events
Why do you care?
Loose coupling, autonomy, evolvability, scalability, resilience, traceability, replayability
EVENT-FIRST CHANGES HOW YOU
THINK ABOUT WHAT YOU ARE BUILDING
...more importantly...

25
Store events in
..a stream..

26
Different types of event models
● Change Data Capture - CDC (database txn log)
● Time series (IoT, metrics)
● Microservices (domain events)

28
Time travel user experience?
how many users
affected?has it happened
before?
Ask many questions of the same data, again and again
time

3131
old world : event-driven architectures
new world: event-streaming architectures

32
Stream processing
Kafka
Streams
processor
input events
output events
...temporal reasoning...
event-driven microservice

{
user: 100
type: bid
item: 389
cat: bikes/mtb
region: dc-east
}
Partitions give you horizontal scale
/bikes/ by item-id
key#
Key
space
{...}
{...}
{...}
ConsumerTopic
Partition
Partition
assignment

3737
Stream processors are uniquely
convergent.
Data + Processing
(sorry dba’s)

3838
All of your data
is
a stream of events

3939
stop...where is my database?
(you said scaling data was hard)

4040
Streams are your persistence model
They are also
your local
database

4141
The atomic unit for tackling complexity
Stream
processor
input events
output events
...or microservice or whatever...

42
It’s pretty powerful
Stream
processor
Stream
processor
Stream
processor
Topic: click-stream
Interactive query
CDC events from KTable
CDC Stream
partition
partition
partition
CQRS
Elastic

4343
Stream processor == Single atomic unit
It does one thing
Like

4444
We think in terms of function
“Bounded Context”
(dataflow - choreography)

4545
Let’s build something….
A simple dataflow series of processors
“Payment processing”

4646
KPay looks like this:
https://github.com/confluentinc/demo-scene/tree/master/scalable-payment-processing

4747
Bounded context
“Payments”
1. Payments inflight
2. Account processing [debit/credit]
3. Payments confirmed

48
Payments bounded context
choreography

49
Payments system: bounded context
[1] How much is being processed?
Expressed as:
- Count of payments inflight
- Total $ value processed
[2&3] Update the account balance
Expressed as:
- Debit
- Credit [4] Confirm successful payment
Expressed as:
- Total volume today
- Total $ amount today

50
Payments system: AccountProcessor
accountBalanceKTable = inflight.groupByKey()
.aggregate(
AccountBalance::new,
(key, value, aggregate) -> aggregate.handle(key, value), accountStore);
KStream<String, Payment>[] branch = inflight
.map((KeyValueMapper<String, Payment, KeyValue<String, Payment>>) (key, value) -> {
if (value.getState() == Payment.State.debit) {
value.setStateAndId(Payment.State.credit);
} else if (value.getState() == Payment.State.credit) {
value.setStateAndId(Payment.State.complete);
}
return new KeyValue<>(value.getId(), value);
})
.branch(isCreditRecord, isCompleteRecord);
branch[0].to(paymentsInflightTopic);
branch[1].to(paymentsCompleteTopic);
https://github.com/confluentinc/demo-scene/blob/master/scalable-payment-processing/.../AccountProcessor.java
KTable state
(Kafka Streams)

51
Payments system: AccountBalance
public AccountBalance handle(String key, Payment value) {
this.name = value.getId();
if (value.getState() == Payment.State.debit) {
this.amount = this.amount.subtract(value.getAmount());
} else if (value.getState() == Payment.State.credit) {
this.amount = this.amount.add(value.getAmount());
} else {
// report to dead letter queue via exception handler
throw new RuntimeException("Invalid payment received:" + value);
}
this.lastPayment = value;
return this;
}
https://github.com/confluentinc/demo-scene/.../scalable-payment-processing/.../model/AccountBalance.java

52
Payments system: event model
https://github.com/confluentinc/demo-scene/.../scalable-payment-processing/.../io/confluent/kpay/payments

5353
Bounded context
“Payments”
Is it enough?
no

5454
“It’s asynchronous, I don’t trust it”
(some developer, 2018)

5555
We only have one part of the picture
○ What about failures?
○ Upgrades?
○ How fast is it going?
○ What is happening - is it working?

5656
Event-streaming provides
● Evolution
● Decoupling
● Bounded context modelling
● Composition
(because of SoC)

5858
Event-streaming pillars:
1. Business function (payment)
2. Instrumentation plane (trust)
3. …
4. ...

59
Instrumentation Plane (trust)
Goal: Prove the application is meeting business requirements
Metrics:
- Payments Inflight, Count and Dollar value
- Payment Complete, Count and Dollar value

60
Instrumentation Plane
KStream<String, Payment> complete = builder.stream(paymentsCompleteTopic);
statsKTable = complete
.groupBy((key, value) -> "all-payments")
.windowedBy(TimeWindows.of(ONE_MINUTE))
.aggregate(
ThroughputStats::new,
(key, value, aggregate) -> aggregate.update(value),
completeWindowStore
);

61
Instrumentation Plane
public ThroughputStats update(Payment payment) {
totalPayments++;
totalDollarAmount = totalDollarAmount.add(payment.getAmount());
maxLatency = Math.max(maxLatency, payment.getElapsedMillis());
minLatency = Math.min(minLatency, payment.getElapsedMillis());
if payment.getAmount().doubleValue() > largestPayment.getAmount().doubleValue()) {
largestPayment = payment;
}
timestamp = System.currentTimeMillis();
return this;
}

62
Instrumentation Plane: Using IQ
https://github.com/confluentinc/demo-scene/blob/master/scalable-payment-processing/../ThroughputStats.java

6363
3. Control plane (coordinate)
4. ...

64
Control Plane
Goal: Provide mechanisms to coordinate system behavior
Why? Recover from outage, DR, overload etc
Applied: Flow control, start, pause, bootstrap, scale, gate and rate limit
Model:
- Status [pause, resume)
- Gate processor [Status]
- etc

6565
3. Control plane (coordinate)
4. Operational plane (run)

6666
Dependent on Control and Instrumentation planes
Dataflow patterns
● Application logs
● Error/Warning logs
● Audit logs
● Lineage
● Dead-letter-queues
/dead-letter/bid/region/processor
/ops/logs/ca
/ops/metric
Stream
processor
Operational Plane

67
Architectural pillars
/payments/incoming
PAY
/payments/confirmed
Core dataflow
Control plane
/control/state
START
STOP
/control/status
stream.filter()
Instrumentation plane
/payments/confirmed
BIZ
METRIC
IQ
IQ
IQ
/payments/dlq
ERROR
WARN
IQ
Operational plane

Bounded context (dataflow)
Choreography:
- Capture business function as a bounded context
- Events as API
2.
Accounts
[from]
payment.incoming
3.
Accounts
[to]
4.
Payment
Conf’d
1.
Payment
Inflight
payment.confirmed
payment.inflight
payment.inflight
payment.complete
payment.complete

Multiple Bounded contexts
Choreography:
- Chaining
- Layering
2.
Logistics
payment.incoming 1.
Payment
payment.complete

Multiple Bounded contexts
Orchestration
○ Captures workflow
○ Controls bounded context interaction
○ Business Process Model and Notation 2.0 (BPMN)
(Zeebe, Apache Airflow)
Source: https://docs.zeebe.io/bpmn-workflows/README.html

7373
Composition patterns at scale
Flickr: Dave DeGobbi

{faas}
events as a backbone
appappappapp
Payments Department 2
{faas}appappappapp
Department 3 Department 4
Pattern: Events as a backbone

{faas}
What is going on here?
appappappapp
Payments Department 2
Patterns: Topic naming
bikeshedding (uncountable)
1. Futile investment of time and energy in
discussion of marginal technical issues.
2. Procrastination.
https://en.wiktionary.org/wiki/bikeshedding
Parkinson observed that a committee whose
job is to approve plans for a nuclear power
plant may spend the majority of its time on
relatively unimportant but easy-to-grasp
issues, such as what materials to use for the
staff bikeshed, while neglecting the design
of the power plant itself, which is far more
important but also far more difficult to
criticize constructively.

Patterns: Topic conventions
Don’t
1. Use fields that change
2. Use fields if data is available elsewhere
3. Tie topic names to consumers or producers
Do
<message type>.<dataset name>.<data name>
<app-context>.<message type>.<dataset name>.<data name>
Source: Chris Riccomini
https://riccomini.name/how-paint-bike-shed-kafka-topic-naming-conventions
● Logging
● Queuing
● Tracking
● etl/db
● Streaming
● Push
● user

7777
What about that software crisis that started in
1968?
“We believe that the major contributor to this
complexity in many systems is the handling of state
and the burden that this adds when trying to analyse
and reason about the system.”
Out of the tar pit, 2006

Our mental model: Abstraction as an Art
Chained/Orchestrated
Bounded contexts
Stream processor
Stream
Event
Pillars
Business function Control plane Instrumentation Operations
Bounded context

Key takeaway (state)
Event streamingdriven microservices are the new atomic unit:
1. Provide simplicity (and time travel)
2. Handle state (via Kafka Streams)
3. Provide a new paradigm: convergent data and logic processing
Stream
processor

Key takeaway (complexity)
● Event-Streaming apps: model as bounded-context dataflows, handle
state & scaling
● Patterns: Build reusable dataflow patterns (instrumentation)
● Composition: Bounded contexts chaining and layering
● Composition: Choreography and Orchestration

82
Questions?
@avery_neil
“Journey to event driven” blog
1. Event-first thinking
2. Programming models
3. Serverless
4. Pillars of event-streaming ms’s
https://bit.ly/2tFfU84
or @avery_neil twitter profile

Kakfa summit london 2019 - the art of the event-streaming app

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Kakfa summit london 2019 - the art of the event-streaming app

Similar to Kakfa summit london 2019 - the art of the event-streaming app (20)

Recently uploaded

Recently uploaded (20)

Kakfa summit london 2019 - the art of the event-streaming app

Editor's Notes