"Kafka Connect is an ideal tool for building data pipelines. It is both reliable and scalable, with a pluggable interface that lets you flow data between Kafka and any system you need. A Connect pipeline is made up of many different components, and understanding how each of these interact together is essential, even for the simplest setup.
In this talk we will introduce the Connect components, from connectors, to transformations to the runtime itself. We will also share some of the new capabilities and best practices that you should be aware of to help you run and manage connectors effectively.
Finally we will talk about some different open source projects that have been built on top of Connect that can help you get the most out of the framework."
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Similar to Getting up to speed with Kafka Connect: from the basics to the latest features with Kate Stanley and Mickael Maison | Kafka Summit London 2022
Similar to Getting up to speed with Kafka Connect: from the basics to the latest features with Kate Stanley and Mickael Maison | Kafka Summit London 2022 (20)
7. What is a Connector?
7
Connector
debezium-connector-mongodb-1.9.0.Final.jar
8. What is a Connector?
8
Connector
Connect
plugin.path
debezium-connector-mongodb-1.9.0.Final.jar
debezium-connector-mongodb-1.9.0.Final.jar
9. What is a Connector?
9
Connector
Connect
plugin.path
Connector
debezium-connector-mongodb-1.9.0.Final.jar
debezium-connector-mongodb-1.9.0.Final.jar
10. What is a Connector?
10
Connector
Connect
Connector
plugin
Connector Connector
debezium-connector-mongodb-1.9.0.Final.jar
plugin.path
27. Run in Distributed mode
27
Configuration
Worker
▸ group.id
▸ offset.storage.topic
▸ config.storage.topic
▸ status.storage.topic
28. Use automation for managing
clusters
Optional
section
marker
or
title
28
29. Use automation for managing clusters
29
Responsibilities
▸ Restarting failed connectors and tasks
▸ Scaling up and down
▸ Rolling out upgrades
▸ Adding new connector plugins
30. Use automation for managing clusters
30
Solutions
▸ Operators
▸ Pre-built images
▸ Tools
39. Consider using a Schema
Registry
Optional
section
marker
or
title
39
40. Consider using a schema registry
40
{
"title": "Getting up to Speed with Kafka Connect"
"speaker": "Mickael and Kate",
"room": "Arora West"
}
My Events
42. Consider using a schema registry
42
Using Avro Schemas with Apicurio
key.converter=io.apicurio.registry.utils.converter.AvroConverter
key.converter.apicurio.registry.url=http://registry:8080/api
...
52. Master the REST API
52
Manage logger level
GET /admin/loggers
PUT /admin/loggers/org.apache.kafka.connect -d '{"level": "DEBUG"}'
53. Master the REST API
53
Restart connectors and tasks
POST /connectors/file-sink/restart
POST /connectors/file-sink/restart?includeTasks=true&onlyFailed=true
Task
Connector
POST /connectors/file-sink/tasks/0/restart
67. KIP-618
67
▸ Source connectors use a transactional producers for
writing records and offsets
▸ Connectors can declare whether they support exactly
once
▸ Workers have exactly once disabled by default