One sink to rule them all: Introducing the new Async Sink

ONE SINK TO RULE THEM ALL: INTRODUCING THE NEW ASYNC SINK
© 2022, Amazon Web Services, Inc. or its affiliates.
One sink to rule them all
Introducing the new Async Sink
Danny Cranmer
Sr. Software Development Engineer, AWS
Steffen Hausmann
Principal Streaming Architect, AWS

Apache Flink’s connector ecosystem
2
Amazon Kinesis
Data Streams
Elasticsearch
RabbitMQ
Google PubSub

How to build a high quality connector
Buffering Send batch
requests
Retry failed
requests
Checkpointing Rate limiting and
Backpressure

Async Sink internals
internal buffer
endpoint
message request entries
batch request
response
requeue
failed requests
checkpoint
sink to
AsyncSinkWriter
request entries -> batch request
ElementConverter
message -> request entry
AsyncSinkWriter
response -> failed requests
AsyncSink
WriterStateSerializer
4

How the Async Sink manages throughput
6
endpoint
sink subtask3
sink subtask2
sink subtask1
1. Limit the number of in-flight request entries
2. If a batch request was successful, increase
the limit by a constant factor
3. If a batch request failed, cut the limit in half
Additive Increase/Multiplicative Decrease

Let’s go build!
7

Async Sink configuration
endpoint
message request entries
batch request
sink to
maxRecordSizeInBytes
8
maxInFlightRequests
maxBatchSize
maxBatchSizeInBytes
maxBufferedRequests
maxTimeInBufferMs
internal buffer

Current limitations
At least
once
Ordering Thread pool
management

What’s Next?
FLIP-242
Configurable
Rate Limiting
FLIP-252
Amazon DynamoDB
Sink
Wider
Community
Adoption

Thank you!
Danny Cranmer Steffen Hausmann

One sink to rule them all: Introducing the new Async Sink

In this document

More Related Content

What's hot

More from Flink Forward

Recently uploaded

One sink to rule them all: Introducing the new Async Sink