Unlock Your TikTok Potential: Free TikTok Likes with InstBlast
Sergii Bielskyi "Using Kafka and Azure Event hub together for streaming Big data"
1. Using Kafka and Azure Event hub
together for streaming Big data
Sergii Bielskyi
Cloud Architect at Eleks
https://medium.com/@sergiibielskyi
https://www.facebook.com/groups/iot.ua/
9. • Integration to Azure Stream Analytics, Functions, LogicApps, Azure Monitor, etc
• Cloud native service
• Using core features like RBAC, Vnet, MSI
• IP Filtering
• Capture
• Auto - scale
• NiFi
• Spark
• More …
Features of using Azure Event Hubs
10. • Automatically send Event Hubs data to your storage
• Better serve batching and archival scenarios
• Easy to configure, no code
• Enable batch & real time on the same stream
Event Hubs Capture – easy way to archive
19. Difference between Kafka and Event Hubs
• Event Hubs is multi tenant managed service, kafka is not
• On premises support for kafka
• Protocols support. Kafka has HTTP rest clients, event hubs supports AMQP, HTTPS, Kafka clients
• Languages. Kafka (Java), Event hubs (C#, .Net) and also Java
• Disaster recovery. Event Hubs supports Geo Redundant Storage, Kafka supports replication only into
the cluster
• Scalability. Event hubs auto scale the storage but kafka is IaaS
• Integration with Stream Analytics is only possible with Event Hubs
• Message size. Event hubs limit is 256Kb, kafka – any limits
24. Kafka on HDInsight
• SLA 99.9%
• Azure Managed Disks as the backing store for Kafka. Managed Disks can provide up to
16 TB of storage per Kafka broker
• Azure Log Analytics can be used to monitor Kafka on HDInsight
27. Use cases
• Messaging. Message broker
• Activity tracking. Provides in-order logging of records, it can be used to track and re-
create activities
• Aggregation. Using stream processing, aggregates information from different streams
to combine and centralize the information into operational data.
• Transformation. Using stream processing, combines the data from multiple input
topics into one or more output topics