Getting up to speed with MirrorMaker 2 | Mickael Maison, IBM and Ryanne Dolan, Twitter, Inc

Getting Started with
MirrorMaker 2
Mickael Maison - IBM
Ryanne Dolan - Twitter
Kafka Summit EU 2021

Summary
- Pain points of MM1
- Overview of MM2 Connectors
- Deployment modes
- Use cases and Scenarios
- Tips and Tricks to get started

Why MM2?
• Address problems with legacy MirrorMaker (MM1)
• Take advantage of Connect ecosystem
• Enable new replication use-cases

MirrorMaker1 Pain Point #1
Lack of consumer group offsets mirroring
• Data replicated, but not consumer offsets
• No offset translation
• Timestamp-based recovery
MM2:
• Offset translation
• Consumer group checkpoints

Hard to deploy, monitor
• No centralized "control plane"
• Each individual consumer and producer configured separately
• No high-level metrics
MM2:
• High-level "driver" manages replication between many clusters
• High-level configuration file defines global replication topology
• Cross-cluster metrics like Replication Latency

Unable to keep topics synchronized
• Configuration changes not sync'd
• Partitions not sync'd
• ACL not sync'd
MM2:
• Topic configuration sync'd
• Partitions sync'd
• ACLs sync'd

MirrorSourceConnector
• Replicates "remote topics"
• Sync topic configuration
• Sync topic ACLs
• Emit offset sync

us-east
us-west.topic1
us-west
Configs
ACLS
Records
Offset syncs
topic1
mm2-offset-syncs.us-east.internal

us-west
MirrorCheckpointConnector
• Consumes offset syncs
• Emit checkpoints: consumer group state
• Enables failover:
• Automatically: __consumer_offsets (since 2.7.0)
• Programmatically: mirror-client's translateOffsets()
us-east
MirrorCheckpoint
Connector
Checkpoints
mm2-offset-syncs.us-east.internal mm2-checkpoints.us-west.internal
__consumer_offsets
__consumer_offsets

MirrorHeartbeatConnector
• Send heartbeats to remote clusters
• Useful for monitoring replication flows
• Enables clients to discover replication topology
• mirror-client's upstreamClusters()
us-west
MirrorHeartbeat
Connector heartbeats
MirrorSource
Connector
us-east
us-west.heartbeats

Dedicated aka Driver mode
• connect-mirror-maker.sh
• Easy configuration
• Runs all connectors

Dedicated aka driver mode
Source Connector
Checkpoint Connector
Heartbeat Connector
Target Connect Source Connect
Mirror Maker 2

Connect Distributed
• Reuse existing Connect cluster
• Full control
• More configuration

Active/Standby
us-west us-east
MM2
topic1 us-west.topic1
topic2

Active/Standby - Dedicated
mm2.properties
clusters=us-west,us-east
us-west.bootstrap.servers=…
us-east.bootstrap.servers=…
us-west->us-east.enabled=true

Active/Standby - Connect
connect-distributed.properties
https://github.com/apache/kafka/blob/trunk/config/connect-distributed.properties
source-connector.json
{
"name": "MirrorSourceConnector",
"config":{
"connector.class":
"org.apache.kafka.connect.mirror.MirrorSourceConnector",
"name": "MirrorSourceConnector",
"topics": ".*",
"tasks.max": "30",
"source.cluster.alias": "us-west",
"target.cluster.alias": "us-east",
}
}
checkpoint-connector.json
{
"name": "MirrorCheckpointConnector",
"config":{
"connector.class":
"org.apache.kafka.connect.mirror.MirrorCheckpointConnector"
,
"name": "MirrorCheckpointConnector",
"groups": ".*",
"tasks.max": "15",
"source.cluster.alias": "us-west",
"target.cluster.alias": "us-east",
}
}

Active/Active
us-west us-east
MM2
topic2
us-east.topic2

Active/Active - Dedicated
mm2.properties
clusters=us-west,us-east
us-west.bootstrap.servers=…
us-east.bootstrap.servers=…
us-west->us-east.enabled=true
us-east->us-west.enabled=true

Active/Active - Connect
us-west us-east
MM2
topic2
us-east.topic2
MM2

Active/Active - Connect
heartbeat-connector.json
heartbeat-connector.j

Monitoring
• Throughput/latency per partition
• kafka.connect.mirror:type=MirrorSourceConnector - byte-rate|record-age-ms|replication-latency-ms
• Offset Checkpoint latency
• kafka.connect.mirror:type=MirrorCheckpointConnector - checkpoint-latency-ms
• Connect task/Connector health
• http://kafka.apache.org/documentation/#connect_monitoring
• Connect task configurations
• /<connector>/tasks-config since Kafka 2.8
• Duplicated tasks Connect JIRA: KAFKA-9849
• Fixed in 2.4.2, 2.5.1, 2.6.0 and above

Controls
• Scale Connect
tasks.max
Number of workers
• Select Mirroring workload
topics and groups settings
• Offset reset policy
consumer.auto.offset.reset=latest since Kafka 2.8

Kafka Improvement Proposals
• KIP-310: Add a Kafka Source Connector to Kafka Connect ✅ (withdrawn in favor of MM2)
• KIP-382: MirrorMaker 2.0 ✅
• KIP-597: MirrorMaker2 internal topics Formatters ✅
• KIP-605: Expand Connect Worker Internal Topic Settings ✅
• KIP-618: Atomic commit of source connector records and offsets
• KIP-661: Expose task configurations in Connect REST API ✅
• KIP-656: MirrorMaker2 Exactly-once Semantics
• KIP-690: Add additional configuration to control MirrorMaker 2 internal topics naming convention
• KIP-710: Full support for distributed mode in dedicated MirrorMaker 2.0 clusters
• KIP-712: Shallow Mirroring
• KIP-716: Allow configuring the location of the offset-syncs topic with MirrorMaker2
• KIP-720: Deprecate MirrorMaker 1 ✅

Notable Progress
• KAFKA-8930: MirrorMaker v2 documentation
• KAFKA-9175 MirrorMaker 2 emits invalid topic partition metrics
• KAFKA-9352 unbalanced assignment of topic-partition to tasks
• KAFKA-9849 Fix issue with worker.unsync.backoff.ms creating zombie workers
when incremental cooperative rebalancing is used
• KAFKA-10710 MirrorMaker 2 creates all combinations of herders
• KAFKA-12254 MirrorMaker 2.0 creates destination topic with default configs
Ongoing:
• KAFKA-10339 and KAFKA-10483: MirrorSinkConnectors and EOS
• KAFKA-9726 LegacyReplicationPolicy

Thank You!
Mickael Maison - @MickaelMaison
Ryanne Dolan -@DolanRyanne
https://kafka.apache.org/documentation/#georeplication
https://github.com/apache/kafka/tree/trunk/connect/mirror
https://cwiki.apache.org/confluence/display/KAFKA/KIP-382%3A+MirrorMaker+2.0

Getting up to speed with MirrorMaker 2 | Mickael Maison, IBM and Ryanne Dolan, Twitter, Inc

In this document