Apache Flink: Building a Company-wide Self-service Streaming Data Platform

•

0 likes•114 views

"In my talk, we will examine all the stages of building our self-service Streaming Data Platform based on Apache Flink and Kafka Connect, from the selection of a solution for stateful streaming data processing, right up to the successful design of a robust self-service platform, covering the challenges that we’ve met. I will share our experience in providing non-Java developers with a company-wide self-service solution, which allows them to quickly and easily develop their streaming data pipelines. Additionally, I will highlight specific business use cases that would not have been implemented without our platform.0 characters0 characters"

Technology

20/03/2024
Apache Flink: Building
a company-wide
self-service Streaming
Data Platform
Gleb Shipilov (Data Integration Team Leader, Exness)

20/03/2024
Data as Bedrock
20/03/2024
Kafka Summit 2024
● Exness is the largest CFD broker by
trading volume and active clients
● Every millisecond counts
● As Exness delved deeper into event-driven
architecture, the need for processing
streaming data became paramount
● Each team had to deal with processing
streaming data on their own, solving all
the problems with:
○ Scalability
○ Fault tolerance
○ State management
○ Security
2

20/03/2024
Streaming Data Platform components
20/03/2024
Kafka Summit 2024 3

20/03/2024
Why Apache Flink?
● Support for several Kafka instances
● Performance
● Fault tolerance
● Support of very large state
● Java based framework
20/03/2024
Kafka Summit 2024 4

20/03/2024
What challenges have we faced
How to provide
Python and Go
developers with a
self-service platform
based on a Java
framework?
01
How to provide
developers with a
unified deployment
process for all the
components and
make it simple?
02
How to ensure
security?
03
How to flexibly
manage and isolate
resources between
teams?
04
20/03/2024
Kafka Summit 2024 5

20/03/2024
Flink SQL usage
20/03/2024
Kafka Summit 2024 6

20/03/2024
Flink SQL challenges
● Perfect for simple cases:
○ Aggregate;
○ Union data;
○ Flat data.
● Doesn’t work so perfect
with complex cases:
○ A lot of enrichments;
○ Complex business logic;
○ No tracing support.
20/03/2024
Kafka Summit 2024 7
Go developer

20/03/2024
PyFlink usage
20/03/2024
Kafka Summit 2024 8

20/03/2024
PyFlink challenges
● Lack of Apple silicon support (M1 / M2)
● No tracing support out of the box
● Necessity of both Table API and Data
Stream API usage in one PyFlink job
20/03/2024
Kafka Summit 2024 9
● At least the 1.17 version of Flink
● Tracing using:
○ OpenTelemetry;
○ Jaeger.
● Stream table environment to work with
both Table and Data Stream APIs

20/03/2024
Unified deployment process
● Main components of the deployment
process:
○ Terraform;
○ GitLab pipeline;
○ K8S operators.
20/03/2024
Kafka Summit 2024 10

20/03/2024
Unified deployment process
20/03/2024
Kafka Summit 2024 11

20/03/2024
Streaming Data replication using
Terraform over Flink
● Templated Terraform modules
instead of multiple and similar
SQL artefacts
● One module defines
configuration of the whole
pipeline from Kafka topic to S3
20/03/2024
Kafka Summit 2024 12

20/03/2024
One team–one Flink Cluster
● Security
● Resource management
● Own development environment
● Observability
20/03/2024
Kafka Summit 2024 13

20/03/2024
Monitoring and alerting
● Separate monitoring
for each team
● Slack channels with alerts
for each Flink cluster
● One Slack channel for technical
support of all the users
20/03/2024
Kafka Summit 2024 14

20/03/2024
The most important projects delivered
on our self-service platform
● Trading data processing lag
decrease from 2 hours to 2
minutes during peak times
● 1 MLN+ bots’ activity events
are prevented in real-time
● Fraud and abuse prevention
based on real-time data
● Marketing campaigns based
on real-time data
20/03/2024
Kafka Summit 2024 15

20/03/2024
Special thanks to:
Kafka Summit 2024 16
https://medium.com/exness-blog
● Alexey Perminov
● Ilya Soin
● Yury Smirnov
● Igor Matcko

Similar to Apache Flink: Building a Company-wide Self-service Streaming Data Platform

Updates from Hungary (Jozsef Kovacs)EOSC-hub project

Bandwidth: Use Cases for Elastic Cloud on Kubernetes Elasticsearch

Red Hat Summit 2017 - LT107508 - Better Managing your Red Hat footprint with ...Miguel Pérez Colino

Day 13 - Creating Data Processing Services | Train the Trainers ProgramFIWARE

What_s_New_in_OpenShift_Container_Platform_4.6.pdfchalermpany

Au delà des brokers, un tour de l’environnement Kafka | Florent Ramièreconfluent

State of ARM-based HPCinside-BigData.com

28March2024-Codeless-Generative-AI-PipelinesTimothy Spann

Flink September 2015 Community UpdateRobert Metzger

Unconference Round Table NotesTimothy Spann

big data fest building modern data streaming appsTimothy Spann

BigDataFest_ Building Modern Data Streaming Appsssuser73434e

Edge Computing: A Unified Infrastructure for all the Different PiecesCloudify Community

Opnfv & odl case study slidesChristopher Price

OpenStack and Kubernetes - A match made for Telco HeavenTrinath Somanchi

Serverless Kafka PatternsTaras Slipets

Using Kubernetes to make cellular data plans cheaper for 50M usersMirantis

data Artisans Product AnnouncementFlink Forward

Introduction to Anypoint Runtime Fabric on Amazon Elastic Kubernetes Service ...Anoop Ramachandran

Stream Processing with Flink and Stream Sharingconfluent

Similar to Apache Flink: Building a Company-wide Self-service Streaming Data Platform (20)

Updates from Hungary (Jozsef Kovacs)

Bandwidth: Use Cases for Elastic Cloud on Kubernetes

Red Hat Summit 2017 - LT107508 - Better Managing your Red Hat footprint with ...

Day 13 - Creating Data Processing Services | Train the Trainers Program

What_s_New_in_OpenShift_Container_Platform_4.6.pdf

Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière

State of ARM-based HPC

28March2024-Codeless-Generative-AI-Pipelines

Flink September 2015 Community Update

Unconference Round Table Notes

big data fest building modern data streaming apps

BigDataFest_ Building Modern Data Streaming Apps

Edge Computing: A Unified Infrastructure for all the Different Pieces

Opnfv & odl case study slides

OpenStack and Kubernetes - A match made for Telco Heaven

Serverless Kafka Patterns

Using Kubernetes to make cellular data plans cheaper for 50M users

data Artisans Product Announcement

Introduction to Anypoint Runtime Fabric on Amazon Elastic Kubernetes Service ...

Stream Processing with Flink and Stream Sharing

Recently uploaded

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

Bluetooth Controlled Car with Arduino.pdfngoud9212

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

APIForce Zurich 5 April Automation LPDGMarianaLemus7

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

AI as an Interface for Commercial BuildingsMemoori

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

Build your next Gen AI Breakthrough - April 2024Neo4j

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

CloudStudio User manual (basic edition):comworks

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

costume and set research powerpoint presentationphoebematthew05

Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group

Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely

Recently uploaded (20)

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

Bluetooth Controlled Car with Arduino.pdf

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Streamlining Python Development: A Guide to a Modern Project Setup

APIForce Zurich 5 April Automation LPDG

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

AI as an Interface for Commercial Buildings

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

Build your next Gen AI Breakthrough - April 2024

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

CloudStudio User manual (basic edition):

Pigging Solutions Piggable Sweeping Elbows

My INSURER PTE LTD - Insurtech Innovation Award 2024

Understanding the Laravel MVC Architecture

Science&tech:THE INFORMATION AGE STS.pdf

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

costume and set research powerpoint presentation

Snow Chain-Integrated Tire for a Safe Drive on Winter Roads

Unlocking the Potential of the Cloud for IBM Power Systems

Apache Flink: Building a Company-wide Self-service Streaming Data Platform

1. 20/03/2024 Apache Flink: Building a company-wide self-service Streaming Data Platform Gleb Shipilov (Data Integration Team Leader, Exness)

2. 20/03/2024 Data as Bedrock 20/03/2024 Kafka Summit 2024 ● Exness is the largest CFD broker by trading volume and active clients ● Every millisecond counts ● As Exness delved deeper into event-driven architecture, the need for processing streaming data became paramount ● Each team had to deal with processing streaming data on their own, solving all the problems with: ○ Scalability ○ Fault tolerance ○ State management ○ Security 2

3. 20/03/2024 Streaming Data Platform components 20/03/2024 Kafka Summit 2024 3

4. 20/03/2024 Why Apache Flink? ● Support for several Kafka instances ● Performance ● Fault tolerance ● Support of very large state ● Java based framework 20/03/2024 Kafka Summit 2024 4

5. 20/03/2024 What challenges have we faced How to provide Python and Go developers with a self-service platform based on a Java framework? 01 How to provide developers with a unified deployment process for all the components and make it simple? 02 How to ensure security? 03 How to flexibly manage and isolate resources between teams? 04 20/03/2024 Kafka Summit 2024 5

6. 20/03/2024 Flink SQL usage 20/03/2024 Kafka Summit 2024 6

7. 20/03/2024 Flink SQL challenges ● Perfect for simple cases: ○ Aggregate; ○ Union data; ○ Flat data. ● Doesn’t work so perfect with complex cases: ○ A lot of enrichments; ○ Complex business logic; ○ No tracing support. 20/03/2024 Kafka Summit 2024 7 Go developer

8. 20/03/2024 PyFlink usage 20/03/2024 Kafka Summit 2024 8

9. 20/03/2024 PyFlink challenges ● Lack of Apple silicon support (M1 / M2) ● No tracing support out of the box ● Necessity of both Table API and Data Stream API usage in one PyFlink job 20/03/2024 Kafka Summit 2024 9 ● At least the 1.17 version of Flink ● Tracing using: ○ OpenTelemetry; ○ Jaeger. ● Stream table environment to work with both Table and Data Stream APIs

10. 20/03/2024 Unified deployment process ● Main components of the deployment process: ○ Terraform; ○ GitLab pipeline; ○ K8S operators. 20/03/2024 Kafka Summit 2024 10

11. 20/03/2024 Unified deployment process 20/03/2024 Kafka Summit 2024 11

12. 20/03/2024 Streaming Data replication using Terraform over Flink ● Templated Terraform modules instead of multiple and similar SQL artefacts ● One module defines configuration of the whole pipeline from Kafka topic to S3 20/03/2024 Kafka Summit 2024 12

13. 20/03/2024 One team–one Flink Cluster ● Security ● Resource management ● Own development environment ● Observability 20/03/2024 Kafka Summit 2024 13

14. 20/03/2024 Monitoring and alerting ● Separate monitoring for each team ● Slack channels with alerts for each Flink cluster ● One Slack channel for technical support of all the users 20/03/2024 Kafka Summit 2024 14

15. 20/03/2024 The most important projects delivered on our self-service platform ● Trading data processing lag decrease from 2 hours to 2 minutes during peak times ● 1 MLN+ bots’ activity events are prevented in real-time ● Fraud and abuse prevention based on real-time data ● Marketing campaigns based on real-time data 20/03/2024 Kafka Summit 2024 15

16. 20/03/2024 Special thanks to: Kafka Summit 2024 16 https://medium.com/exness-blog ● Alexey Perminov ● Ilya Soin ● Yury Smirnov ● Igor Matcko

Apache Flink: Building a Company-wide Self-service Streaming Data Platform

Recommended

Recommended

More Related Content

Similar to Apache Flink: Building a Company-wide Self-service Streaming Data Platform

Similar to Apache Flink: Building a Company-wide Self-service Streaming Data Platform (20)

More from HostedbyConfluent

More from HostedbyConfluent (20)

Recently uploaded

Recently uploaded (20)

Apache Flink: Building a Company-wide Self-service Streaming Data Platform