SlideShare a Scribd company logo
1 of 51
1
Kai Waehner | Technology Evangelist, Confluent
contact@kai-waehner.de | LinkedIn | @KaiWaehner | www.confluent.io | www.kai-waehner.de
Streaming Machine Learning with
Apache Kafka and Confluent
in Pharma and Life Sciences
3
Use Cases in Pharma and Life Sciences
for Event Streaming
R&D
Engineering
Sales &
Marketing
Manufacturing &
Quality
Assurance
Supply
Chain
Product
Monitoring &
After Sales
Support
VoC (Voice of
Customer)
Single View
Customer
Yield/Quality
Optimization
Improved Drug Yield
Proactive Service
Scheduling
Testing &
Simulation
Drug Diversion
Process/Quality
Monitoring
Inventory & Supply
Chain Optimization
Proactive Service
Offers
Patent Research and
Analytics
Personalized
Offers / Ads
EDW Offload
Supply Chain
Network Design/Risk
Management
Product Predictive
Maintenance
Clinical Trials
Customer
Segmentation
Smart Products
Serialization &
e-Pedigree
Product Usage
Tracking
GTM Global Facilities
Inventory and
Logistics Visibility
Warranty & Recall
Management
www.kai-waehner.de | @KaiWaehner
4
Event Streaming in Pharma and Life Sciences
Use Cases Supporting Business Value
IoT Sensor
Ingestion
Digital
Replatformi
ng/
Mainframe
Offload
Customer
360
Faster
Transactional
Processing /
Analysis
Incl. Machine
Learning / AI
Microservices
Architecture
Online Fraud
Detection
Online Security
(Syslog, Lg
Aggregation,
Splunk
Replacement)
Middleware
Replacement
Website / Core
Operations
(Central
Nervous
System)
Real-time
App
Updates
New Cloud App /
Services + T2M
Develop & Market
New Drugs
Connected Health /
Remote Monitoring
Global Shortage in
Health Care Workers
Rise in New &
Chronic Health
Issues
Cybersecurity
Threads
T2M: Generic
Competition
Increase Revenue
(Make Money)
Decrease Costs
(Save Money)
Mitigate Risk
(Protect Money)
Business Value
1 | Business
Use Case
Strategic
Driver
2 | Business
Use Case
Data Eng. /
Infrastructure
Use Case
Web Click
Streams
Data
Pipelines
Messaging
Microservice /
Event
Sourcing
Stream
Processing
Data
Ingestion
Streaming
ETL
Log
Aggregation
www.kai-waehner.de | @KaiWaehner
555
Invitae
Data Science
+
24/7 Production
https://www.confluent.io/kafka-summit-
san-francisco-2019/from-zero-to-
streaming-healthcare-in-production
Bring comprehensive genetic information into mainstream medical practice to improve the
quality of healthcare for billions of people.
Our team's goal is enabling others to join in. Building an application entirely of Streams is a
significant and in many ways liberating paradigm shift; building self-healing streams with
aggregation, and deciding when to crash.
666
Babylon Health
Connectivity
+
Agile Microservice
Architecture
https://www.confluent.io/kafka-summit-
lon19/one-key-to-rule-them-all
At Babylon, we believe it is possible to put an accessible and affordable health service
in the hands of every person on earth; with GDPR and PII compliant security.
777
Bayer AG
On Premise and Cloud
+
Hybrid Real Time
Replication at Scale
https://www.confluent.io/kafka-summit-
sf18/bringing-streaming-data-to-the-
masses
Bayer Crop Science (formerly Monsanto) adopted a cloud first strategy and started a multi-year
transition to the cloud. A Kafka-based cross-datacenter DataHub was created to facilitate this
migration and to drive the shift to real-time stream processing. The DataHub has seen strong
enterprise adoption and supports a myriad of use cases.
888
celmatix
Real Time Aggregation
of Heterogeneous Data
+
Governance / Security
https://www.confluent.io/customers/
celmatix/
Through the development of digital tools and genetic insights focused on fertility, Celmatix is
disrupting how women approach their lifelong reproductive health journey by empowering them
and their physicians with more personalized information.
9Machine Learning and Event Streaming
Improve Traditional and to Build New Use Cases
in Pharma and Life Sciences
www.kai-waehner.de | @KaiWaehner
Streams Processing / AI / ML
Clinical Trials
Patents,
Text etc
Structured &
unstructured
Data
IoT & Business
Applications
Multi-Hybrid-
Cloud
10
Use Case: Drug Discovery
“On average, it takes at least ten
years for a new medicine to
complete the journey from initial
discovery to the marketplace”
PhRMA
http://phrma-docs.phrma.org/sites/default/files/pdf/rd_brochure_022307.pdf
www.kai-waehner.de | @KaiWaehner
121212
Recursion
Pharmaceutical
Discovering Drugs in
Real Time
+
Machine Learning
https://www.confluent.io/customers/recursion
https://www.confluent.io/kafka-summit-san-
francisco-2019/discovering-drugs-with-kafka-
streams
Massively parallel system that combines experimental biology, artificial intelligence,
automation and real-time event streaming to accelerate drug discovery.
www.kai-waehner.de | @KaiWaehner
13
Image and Video Processing
… (on high level) is “just” pixel (arrays of 0s and 1s) and matrix multiplication
www.kai-waehner.de | @KaiWaehner
14
Drug Discovery in manual and slow, bursty batch
mode, not scalable
www.kai-waehner.de | @KaiWaehner
15
Drug Discovery in automated, scalable, reliable
real time Mode
www.kai-waehner.de | @KaiWaehner
16
Digital Image
Processing
(e.g. noise
reduction)
Streaming Analytics for
Drug Discovery in Real Time at Scale
Real Time
Integration
Layer
Batch
Reporting
Platform
BI
DashboardEvent
Streaming
Platform
Real Time
Integration
Layer
Laboratory
Streaming Platform
Other Components
Automated
Drug
Analysis
All
Data
Processed
Images
Ingest
Images
Human
Intelligence
www.kai-waehner.de | @KaiWaehner
Data
Processing
(e.g. filtering)
Stateful
Workflow
Orchestration
18
Digital Image Processing
for Drug Discovery
Find drug treatments:
• ML models can be trained to decide between healthy cells and disease
cells with problematic genes
• Grow healthy cells and disease cells in labs
• Apply different drugs à Make disease cells look healthy again
www.kai-waehner.de | @KaiWaehner
20
The First
Analytic Models
How to deploy the models
in production?
…real-time processing?
…at scale?
…24/7 zero uptime?
www.kai-waehner.de | @KaiWaehner
21
Hidden Technical Debt
in Machine Learning Systems
https://papers.nips.cc/paper/5656-hidden-technical-debt-in-machine-learning-systems.pdf
www.kai-waehner.de | @KaiWaehner
22
How does
event streaming
help in
machine learning
use cases?
www.kai-waehner.de | @KaiWaehner
23
Scalable, Technology-Agnostic
Machine Learning Infrastructures
https://www.infoq.com/presentations/netflix-ml-meson
https://eng.uber.com/michelangelo
https://www.infoq.com/presentations/paypal-data-service-fraudwww.kai-waehner.de | @KaiWaehner
26
A Streaming Platform
is the Underpinning of an Event-driven Architecture
Microservices
DBs
SaaS apps
Mobile
Customer 360
Real-time fraud
detection
Data warehouse
Producers
Consumers
Database
change
Microservices
events
SaaS
data
Customer
experiences
Streams of real time events
Stream processing apps
Connectors
Connectors
Stream processing apps
www.kai-waehner.de | @KaiWaehner
27
Apache Kafka at Scale
at Tech Giants
> 7 trillion messages / day > 6 Petabytes / day
“You name it”
* Kafka Is not just used by tech giants
** Kafka is not just used for big datawww.kai-waehner.de | @KaiWaehner
28
Apache Kafka’s
Open Ecosystem as Infrastructure for ML
www.kai-waehner.de | @KaiWaehner
29
Apache Kafka’s
Open Ecosystem as Infrastructure for ML
Kafka
Streams /
ksqlDB
Kafka
Connect
Rest Proxy
Schema Registry
Go/ .NET
Kafka Producer
ksqlDB
Python
Consumer
www.kai-waehner.de | @KaiWaehner
30
Digital Image
Processing
(External SaaS
Service + REST)
Kafka, ksqlDB and TensorFlow for
Drug Discovery in Real Time at Scale
Kafka Client
(.NET C++)
Batch
Reporting
Platform
BI
Dashboard
Confluent
Server
Kafka
Connect
Laboratory
(Windows Machines)
Confluent Platform
Other Components
Model Training
and Scoring
(Confluent Python
Client + TensorFlow)
All Data
Processed
Images
Images
Human
Intelligence
www.kai-waehner.de | @KaiWaehner
Streaming
ETL
(ksqlDB)
Stateful
Workflow
Orchestration
(Kafka Streams)
Database
(MySQL) Kafka Connect
(Debezium CDC)
Historical Drugs Data
31
How do you implement a
Kafka ML infrastructure?
www.kai-waehner.de | @KaiWaehner
32
Ingestion of Images
Replication
MirrorMaker 2 /
Confluent Replicator
Kafka
Connect
Laboratory
www.kai-waehner.de | @KaiWaehner
33
Data Preprocessing
Preprocessing
Filter, transform, anonymize, extract features,
reduce noise, enhance brightness / contrast
Streams
Data Ready
For Model Training
www.kai-waehner.de | @KaiWaehner
34
SELECT image_id, experiment_id, image_details
FROM image_channel i
LEFT JOIN experiment_database e ON i.experiment_id =
e.experiment_id
WHERE e.image_type = ‘black_and_white';
Data Processing with ksqlDB
www.kai-waehner.de | @KaiWaehner
37
TensorFlow Model —
Convolutional Neural Network (CNN)
for Image Recognition (as part of the ML Pipeline)
www.kai-waehner.de | @KaiWaehner
38
Direct streaming ingestion
for model training and / or scoring
with TensorFlow I/O + Kafka Plugin
(no additional data storage
like S3 or HDFS required!)
Time
Model BModel A
Producer
Distributed Commit Log
Streaming Ingestion and Model Training
with TensorFlow IO
https://github.com/tensorflow/io
www.kai-waehner.de | @KaiWaehner
39
Kafka as
long-term storage?
www.kai-waehner.de | @KaiWaehner
40
Today, Kafka works well
for recent events, short
horizon storage, and
manual data balancing
CONFIDENTIAL
Kafka’s present-day design offers
extraordinarily low messaging latency by
storing topic data on fast disks that are
collocated with brokers. This is usually good.
But sometimes, you need to store a huge
amount of data for a long time.
Kafka
Processing
App
Storage
Transactions, auth,
quota enforcement,
compaction, ...
www.kai-waehner.de | @KaiWaehner
41Tiered Storage for Kafka
Object Store
Processing Storage
Transactions,
auth, quota
enforcement,
compaction, ...
Local
Remote
Kafka
Apps
Store Forever
Older data is offloaded to inexpensive object
storage, permitting it to be consumed at any time.
Save $$$
Storage limitations, like capacity and duration, are
effectively uncapped.
Instantaneously scale up and down
Your Kafka clusters will be able to automatically
self-balance load and hence elastically scale
(Only available in Confluent Platform)www.kai-waehner.de | @KaiWaehner
43
Reprocessing of Events
● New Consumer
○ e.g. a complete new microservices or a replacement of an existing application
● Error-Handling
○ Re-processing of data in case of error: Fix error and process events again
● Compliance / Regulatory Processing
○ Reprocessing of already processed data for legal reasons
○ Could be very old data (e.g. pharma: 10 years old)
● Query and Analysis of Existing Events
○ No need for another data store / data lake
○ Kafka Client Consumer for offset- or timestamp-based consumption of old events
○ ksqlDB (for simple pull queries)
○ Kafka-native analytics tool (e.g. Rockset with Kafka connector and ANSI SQL support for Tableau et al)
● Model Training
○ Consume events for model training with a) different one ML framework and different hyperparameters or b)
different ML frameworks
www.kai-waehner.de | @KaiWaehner
44
Streaming Machine Learning with
Apache Kafka and Tiered Storage
https://www.confluent.io/blog/streaming-machine-
learning-with-tiered-storage/
www.kai-waehner.de | @KaiWaehner
45
How to deploy the
analytic models?
www.kai-waehner.de | @KaiWaehner
46
Local Predictions
Model Training
in Cloud
Model Deployment
at the Edge
Analytic Model
Separation of
Model Training and Model Inference
www.kai-waehner.de | @KaiWaehner
47
Streams
Input Event
Prediction
Request
Response
Model Serving
TensorFlow Serving
gRPC / HTTP
Application
Stream Processing with External Model and RPC
www.kai-waehner.de | @KaiWaehner
Model
49
Prediction
Stream Processing
Model
doPrediction()
return value
Stream Processing
with Embedded Model
Streams
Input Event
www.kai-waehner.de | @KaiWaehner
50
Prediction
Kafka Client
Model
doPrediction()
return value
Client Application
with Embedded Model Input Event
www.kai-waehner.de | @KaiWaehner
REST
Proxy
52
“CREATE STREAM ImageAnalysis AS
SELECT image_id, analyzeImage(image_details)
FROM image_channel;“
User Defined Function (UDF)
Model Deployment with
Apache Kafka, ksqlDB
and TensorFlow
www.kai-waehner.de | @KaiWaehner
54
Model Training and Scoring
with the same ML Pipeline (or even in the same Application)
• Data Science team responsible for the whole model lifecycle
• Beloved Python tool stack (Pandas, scikit learn, TensorFlow, Jupyter, …)
• 24/7 production scale with Confluent Python Client (e.g. deployed in Docker containers on Kubernetes)
www.kai-waehner.de | @KaiWaehner
55
Digital Image
Processing
(External SaaS
Service + REST)
Kafka, ksqlDB and TensorFlow for
Drug Discovery in Real Time at Scale
Kafka Client
(.NET C++)
Batch
Reporting
Platform
BI
Dashboard
Confluent
Server
Kafka
Connect
Laboratory
(Windows Machines)
Confluent Platform
Other Components
Model Training
and Scoring
(Confluent Python
Client + TensorFlow)
All Data
Processed
Images
Images
Human
Intelligence
www.kai-waehner.de | @KaiWaehner
Streaming
ETL
(ksqlDB)
Stateful
Workflow
Orchestration
(Kafka Streams)
Database
(MySQL) Kafka Connect
(Debezium CDC)
Historical Drugs Data
5656
Machine Learning + Apache Kafka
à Examples @ Github
https://github.com/kaiwaehner
www.kai-waehner.de | @KaiWaehner
57
Image processing,
workflow orchestration,
legacy integration... ???
www.kai-waehner.de | @KaiWaehner
58
Image / Video Processing with Kafka
Kafka-native Image / Video Processing
vs.
Chunk + re-assemble
vs.
Metadata-only + Object Store
à All approaches are fine! J
https://www.slideshare.net/JiangjieQin/handle-large-messages-in-apache-kafka-58692297
https://www.confluent.io/blog/bust-the-burglars-machine-learning-with-tensorflow-and-apache-kafka/
www.kai-waehner.de | @KaiWaehner
59
Workflow Orchestration
Kafka-native vs. External Tool
à both approaches are fine! J
https://github.com/nsaje/dagger
https://eventil.com/events/using-apache-kafka-in-a-closed-environment-with-centralized-orchestration
https://zeebe.io/blog/2019/08/official-kafka-connector-for-zeebe/
www.kai-waehner.de | @KaiWaehner
60
Legacy Integration
Date Amount
1/27/2017 $4.56
1/22/2017 $32.14
Mainframe
Transaction Data
Traditional
Middleware
Application
Kafka
Microservices
Agile, lightweight
(but scalable robust)
Kafka microservice
Big Data project
(Elastic, Spark,
AWS Services, …)
1) Direct Legacy MQ Communication with App
2) Kafka for decoupling between MQ and App
3) Direct communication via Kafka (no MQ anymore)
4) New projects and applications
(independent or related to the existing migration projects)
External
Solution
www.kai-waehner.de | @KaiWaehner
6161
Legacy
Integration
https://www.kai-waehner.de/blog/2019/03/07/apache-kafka-middleware-mq-etl-esb-comparison/
https://www.kai-waehner.de/blog/2020/04/24/mainframe-offloading-replacement-apache-kafka-connect-ibm-db2-mq-cdc-cobol/
63
Generate added value from data
The pharmaceutical industry today has an
unprecedented wealth of opportunities to
generate added value from data.
These possibilities cover all
relevant areas such as:
• R&D / Engineering
• Sales & Marketing
• Manufacturing & QA
• Supply Chain
• Product Monitoring / After
Sales Support
Novel data use:
• Better therapies
• Faster and more accurate
diagnoses
• Faster drug development
• Improvement of clinical studies
• Real-World Data Generation
• real-world evidence
• Precision Medicine
• Support Remote Health etc
Challenges:
• Data silos
• Data integration
• Data growth/explosion
• Cloud & on-prem
• Use of new technologies like
AI/ML
• Time2Market
• Regulatory Affairs
• Security
• Performance
• API
Streams Processing / AI / ML
Clinical Trials
Patents,
Text etc
Structured &
unstructured
Data
IoT & Business
Applications
Multi-Hybrid-
Cloud
64
Questions?
Let’s connect...
Kai Waehner
Technology Evangelist
kai.waehner@confluent.io
@KaiWaehner
www.confluent.io
www.kai-waehner.de
LinkedIn

More Related Content

What's hot

Apache Kafka in the Airline, Aviation and Travel Industry
Apache Kafka in the Airline, Aviation and Travel IndustryApache Kafka in the Airline, Aviation and Travel Industry
Apache Kafka in the Airline, Aviation and Travel IndustryKai Wähner
 
Event Streaming in Retail with Apache Kafka
Event Streaming in Retail with Apache KafkaEvent Streaming in Retail with Apache Kafka
Event Streaming in Retail with Apache KafkaKai Wähner
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Kai Wähner
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Apache Kafka for Real-time Supply Chainin the Food and Retail IndustryApache Kafka for Real-time Supply Chainin the Food and Retail Industry
Apache Kafka for Real-time Supply Chain in the Food and Retail IndustryKai Wähner
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Kai Wähner
 
Apache Kafka in Financial Services - Use Cases and Architectures
Apache Kafka in Financial Services - Use Cases and ArchitecturesApache Kafka in Financial Services - Use Cases and Architectures
Apache Kafka in Financial Services - Use Cases and ArchitecturesKai Wähner
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...confluent
 
Apache Kafka in the Transportation and Logistics
Apache Kafka in the Transportation and LogisticsApache Kafka in the Transportation and Logistics
Apache Kafka in the Transportation and LogisticsKai Wähner
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKai Wähner
 
Kafka Streams State Stores Being Persistent
Kafka Streams State Stores Being PersistentKafka Streams State Stores Being Persistent
Kafka Streams State Stores Being Persistentconfluent
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKai Wähner
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Araf Karsh Hamid
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Kai Wähner
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaReal-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaKai Wähner
 
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Kai Wähner
 
Streaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache KafkaStreaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache Kafkaconfluent
 
Modern Data Flow
Modern Data FlowModern Data Flow
Modern Data Flowconfluent
 
Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKai Wähner
 
Kubernetes: Reducing Infrastructure Cost & Complexity
Kubernetes: Reducing Infrastructure Cost & ComplexityKubernetes: Reducing Infrastructure Cost & Complexity
Kubernetes: Reducing Infrastructure Cost & ComplexityDevOps.com
 

What's hot (20)

Apache Kafka in the Airline, Aviation and Travel Industry
Apache Kafka in the Airline, Aviation and Travel IndustryApache Kafka in the Airline, Aviation and Travel Industry
Apache Kafka in the Airline, Aviation and Travel Industry
 
Event Streaming in Retail with Apache Kafka
Event Streaming in Retail with Apache KafkaEvent Streaming in Retail with Apache Kafka
Event Streaming in Retail with Apache Kafka
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Apache Kafka for Real-time Supply Chainin the Food and Retail IndustryApache Kafka for Real-time Supply Chainin the Food and Retail Industry
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
 
Apache Kafka in Financial Services - Use Cases and Architectures
Apache Kafka in Financial Services - Use Cases and ArchitecturesApache Kafka in Financial Services - Use Cases and Architectures
Apache Kafka in Financial Services - Use Cases and Architectures
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
 
Apache Kafka in the Transportation and Logistics
Apache Kafka in the Transportation and LogisticsApache Kafka in the Transportation and Logistics
Apache Kafka in the Transportation and Logistics
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
 
Kafka internals
Kafka internalsKafka internals
Kafka internals
 
Kafka Streams State Stores Being Persistent
Kafka Streams State Stores Being PersistentKafka Streams State Stores Being Persistent
Kafka Streams State Stores Being Persistent
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid Cloud
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaReal-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
 
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
 
Streaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache KafkaStreaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache Kafka
 
Modern Data Flow
Modern Data FlowModern Data Flow
Modern Data Flow
 
Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology Comparison
 
Kubernetes: Reducing Infrastructure Cost & Complexity
Kubernetes: Reducing Infrastructure Cost & ComplexityKubernetes: Reducing Infrastructure Cost & Complexity
Kubernetes: Reducing Infrastructure Cost & Complexity
 

Similar to Machine Learning with Apache Kafka in Pharma and Life Sciences

Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021 Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021 Sanjay Padhi, Ph.D
 
Lifesaving AI and Javascript (JSConf Korea 2019)
Lifesaving AI and Javascript (JSConf Korea 2019)Lifesaving AI and Javascript (JSConf Korea 2019)
Lifesaving AI and Javascript (JSConf Korea 2019)Jaeman An
 
Webinar digitally transforming healthcare with blockchain
Webinar   digitally transforming healthcare with blockchainWebinar   digitally transforming healthcare with blockchain
Webinar digitally transforming healthcare with blockchainKaleido
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareDATA360US
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Yael Garten
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Shirshanka Das
 
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...Big Data Week
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overviewStratebi
 
Exploring modern analytics use cases
Exploring modern analytics use casesExploring modern analytics use cases
Exploring modern analytics use casesMariaDB plc
 
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFERFASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFERiQHub
 
Deep Learning & AI for Healthcare and Retail
Deep Learning & AI for Healthcare and RetailDeep Learning & AI for Healthcare and Retail
Deep Learning & AI for Healthcare and RetailE2E Networks Limited
 
Streaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsStreaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsDatawatchCorporation
 
Accelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsAccelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsArcadia Data
 
Scalable and Repeatable Machine Learning pipelines: A key requirement for you...
Scalable and Repeatable Machine Learning pipelines: A key requirement for you...Scalable and Repeatable Machine Learning pipelines: A key requirement for you...
Scalable and Repeatable Machine Learning pipelines: A key requirement for you...All Things Open
 
(HLS305) Transforming Cancer Treatment: Integrating Data to Deliver on the Pr...
(HLS305) Transforming Cancer Treatment: Integrating Data to Deliver on the Pr...(HLS305) Transforming Cancer Treatment: Integrating Data to Deliver on the Pr...
(HLS305) Transforming Cancer Treatment: Integrating Data to Deliver on the Pr...Amazon Web Services
 
The Evolution of Data Architecture
The Evolution of Data ArchitectureThe Evolution of Data Architecture
The Evolution of Data ArchitectureWei-Chiu Chuang
 
Analytic Excellence - Saying Goodbye to Old Constraints
Analytic Excellence - Saying Goodbye to Old ConstraintsAnalytic Excellence - Saying Goodbye to Old Constraints
Analytic Excellence - Saying Goodbye to Old ConstraintsInside Analysis
 
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...Databricks
 

Similar to Machine Learning with Apache Kafka in Pharma and Life Sciences (20)

Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021 Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021
 
Lifesaving AI and Javascript (JSConf Korea 2019)
Lifesaving AI and Javascript (JSConf Korea 2019)Lifesaving AI and Javascript (JSConf Korea 2019)
Lifesaving AI and Javascript (JSConf Korea 2019)
 
Webinar digitally transforming healthcare with blockchain
Webinar   digitally transforming healthcare with blockchainWebinar   digitally transforming healthcare with blockchain
Webinar digitally transforming healthcare with blockchain
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for Healthcare
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
 
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
 
Crosswalk
CrosswalkCrosswalk
Crosswalk
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
 
Exploring modern analytics use cases
Exploring modern analytics use casesExploring modern analytics use cases
Exploring modern analytics use cases
 
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFERFASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
 
Deep Learning & AI for Healthcare and Retail
Deep Learning & AI for Healthcare and RetailDeep Learning & AI for Healthcare and Retail
Deep Learning & AI for Healthcare and Retail
 
Vm turbo
Vm turboVm turbo
Vm turbo
 
Streaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsStreaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of Things
 
Accelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsAccelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time Analytics
 
Scalable and Repeatable Machine Learning pipelines: A key requirement for you...
Scalable and Repeatable Machine Learning pipelines: A key requirement for you...Scalable and Repeatable Machine Learning pipelines: A key requirement for you...
Scalable and Repeatable Machine Learning pipelines: A key requirement for you...
 
(HLS305) Transforming Cancer Treatment: Integrating Data to Deliver on the Pr...
(HLS305) Transforming Cancer Treatment: Integrating Data to Deliver on the Pr...(HLS305) Transforming Cancer Treatment: Integrating Data to Deliver on the Pr...
(HLS305) Transforming Cancer Treatment: Integrating Data to Deliver on the Pr...
 
The Evolution of Data Architecture
The Evolution of Data ArchitectureThe Evolution of Data Architecture
The Evolution of Data Architecture
 
Analytic Excellence - Saying Goodbye to Old Constraints
Analytic Excellence - Saying Goodbye to Old ConstraintsAnalytic Excellence - Saying Goodbye to Old Constraints
Analytic Excellence - Saying Goodbye to Old Constraints
 
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
 

More from Kai Wähner

The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaKai Wähner
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareApache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareKai Wähner
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureServerless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureKai Wähner
 
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryData Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryKai Wähner
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Kai Wähner
 
Apache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingApache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingKai Wähner
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesEvent Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesKai Wähner
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Kai Wähner
 
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR ModernizationApache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR ModernizationKai Wähner
 
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....Kai Wähner
 
Serverless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
Serverless Kafka on AWS as Part of a Cloud-native Data Lake ArchitectureServerless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
Serverless Kafka on AWS as Part of a Cloud-native Data Lake ArchitectureKai Wähner
 
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?Kai Wähner
 
Apache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
Apache Kafka and MQTT - Overview, Comparison, Use Cases, ArchitecturesApache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
Apache Kafka and MQTT - Overview, Comparison, Use Cases, ArchitecturesKai Wähner
 
Connected Vehicles and V2X with Apache Kafka
Connected Vehicles and V2X with Apache KafkaConnected Vehicles and V2X with Apache Kafka
Connected Vehicles and V2X with Apache KafkaKai Wähner
 
Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?Kai Wähner
 

More from Kai Wähner (15)

The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareApache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureServerless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
 
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryData Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
 
Apache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingApache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and Manufacturing
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesEvent Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
 
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR ModernizationApache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
 
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
 
Serverless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
Serverless Kafka on AWS as Part of a Cloud-native Data Lake ArchitectureServerless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
Serverless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
 
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
 
Apache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
Apache Kafka and MQTT - Overview, Comparison, Use Cases, ArchitecturesApache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
Apache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
 
Connected Vehicles and V2X with Apache Kafka
Connected Vehicles and V2X with Apache KafkaConnected Vehicles and V2X with Apache Kafka
Connected Vehicles and V2X with Apache Kafka
 
Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?
 

Recently uploaded

WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024VictoriaMetrics
 
AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdf
AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdfAzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdf
AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdfryanfarris8
 
WSO2Con2024 - Low-Code Integration Tooling
WSO2Con2024 - Low-Code Integration ToolingWSO2Con2024 - Low-Code Integration Tooling
WSO2Con2024 - Low-Code Integration ToolingWSO2
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationJuha-Pekka Tolvanen
 
WSO2Con2024 - GitOps in Action: Navigating Application Deployment in the Plat...
WSO2Con2024 - GitOps in Action: Navigating Application Deployment in the Plat...WSO2Con2024 - GitOps in Action: Navigating Application Deployment in the Plat...
WSO2Con2024 - GitOps in Action: Navigating Application Deployment in the Plat...WSO2
 
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...WSO2
 
WSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security ProgramWSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security ProgramWSO2
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in sowetomasabamasaba
 
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public AdministrationWSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public AdministrationWSO2
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2
 
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of TransformationWSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of TransformationWSO2
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 
WSO2Con2024 - Organization Management: The Revolution in B2B CIAM
WSO2Con2024 - Organization Management: The Revolution in B2B CIAMWSO2Con2024 - Organization Management: The Revolution in B2B CIAM
WSO2Con2024 - Organization Management: The Revolution in B2B CIAMWSO2
 
WSO2Con2024 - Hello Choreo Presentation - Kanchana
WSO2Con2024 - Hello Choreo Presentation - KanchanaWSO2Con2024 - Hello Choreo Presentation - Kanchana
WSO2Con2024 - Hello Choreo Presentation - KanchanaWSO2
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
 
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...WSO2
 

Recently uploaded (20)

WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdf
AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdfAzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdf
AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdf
 
WSO2Con2024 - Low-Code Integration Tooling
WSO2Con2024 - Low-Code Integration ToolingWSO2Con2024 - Low-Code Integration Tooling
WSO2Con2024 - Low-Code Integration Tooling
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the Situation
 
WSO2Con2024 - GitOps in Action: Navigating Application Deployment in the Plat...
WSO2Con2024 - GitOps in Action: Navigating Application Deployment in the Plat...WSO2Con2024 - GitOps in Action: Navigating Application Deployment in the Plat...
WSO2Con2024 - GitOps in Action: Navigating Application Deployment in the Plat...
 
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
 
WSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security ProgramWSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security Program
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public AdministrationWSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
 
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of TransformationWSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
WSO2Con2024 - Organization Management: The Revolution in B2B CIAM
WSO2Con2024 - Organization Management: The Revolution in B2B CIAMWSO2Con2024 - Organization Management: The Revolution in B2B CIAM
WSO2Con2024 - Organization Management: The Revolution in B2B CIAM
 
WSO2Con2024 - Hello Choreo Presentation - Kanchana
WSO2Con2024 - Hello Choreo Presentation - KanchanaWSO2Con2024 - Hello Choreo Presentation - Kanchana
WSO2Con2024 - Hello Choreo Presentation - Kanchana
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - Keynote
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
 

Machine Learning with Apache Kafka in Pharma and Life Sciences

  • 1. 1 Kai Waehner | Technology Evangelist, Confluent contact@kai-waehner.de | LinkedIn | @KaiWaehner | www.confluent.io | www.kai-waehner.de Streaming Machine Learning with Apache Kafka and Confluent in Pharma and Life Sciences
  • 2. 3 Use Cases in Pharma and Life Sciences for Event Streaming R&D Engineering Sales & Marketing Manufacturing & Quality Assurance Supply Chain Product Monitoring & After Sales Support VoC (Voice of Customer) Single View Customer Yield/Quality Optimization Improved Drug Yield Proactive Service Scheduling Testing & Simulation Drug Diversion Process/Quality Monitoring Inventory & Supply Chain Optimization Proactive Service Offers Patent Research and Analytics Personalized Offers / Ads EDW Offload Supply Chain Network Design/Risk Management Product Predictive Maintenance Clinical Trials Customer Segmentation Smart Products Serialization & e-Pedigree Product Usage Tracking GTM Global Facilities Inventory and Logistics Visibility Warranty & Recall Management www.kai-waehner.de | @KaiWaehner
  • 3. 4 Event Streaming in Pharma and Life Sciences Use Cases Supporting Business Value IoT Sensor Ingestion Digital Replatformi ng/ Mainframe Offload Customer 360 Faster Transactional Processing / Analysis Incl. Machine Learning / AI Microservices Architecture Online Fraud Detection Online Security (Syslog, Lg Aggregation, Splunk Replacement) Middleware Replacement Website / Core Operations (Central Nervous System) Real-time App Updates New Cloud App / Services + T2M Develop & Market New Drugs Connected Health / Remote Monitoring Global Shortage in Health Care Workers Rise in New & Chronic Health Issues Cybersecurity Threads T2M: Generic Competition Increase Revenue (Make Money) Decrease Costs (Save Money) Mitigate Risk (Protect Money) Business Value 1 | Business Use Case Strategic Driver 2 | Business Use Case Data Eng. / Infrastructure Use Case Web Click Streams Data Pipelines Messaging Microservice / Event Sourcing Stream Processing Data Ingestion Streaming ETL Log Aggregation www.kai-waehner.de | @KaiWaehner
  • 4. 555 Invitae Data Science + 24/7 Production https://www.confluent.io/kafka-summit- san-francisco-2019/from-zero-to- streaming-healthcare-in-production Bring comprehensive genetic information into mainstream medical practice to improve the quality of healthcare for billions of people. Our team's goal is enabling others to join in. Building an application entirely of Streams is a significant and in many ways liberating paradigm shift; building self-healing streams with aggregation, and deciding when to crash.
  • 5. 666 Babylon Health Connectivity + Agile Microservice Architecture https://www.confluent.io/kafka-summit- lon19/one-key-to-rule-them-all At Babylon, we believe it is possible to put an accessible and affordable health service in the hands of every person on earth; with GDPR and PII compliant security.
  • 6. 777 Bayer AG On Premise and Cloud + Hybrid Real Time Replication at Scale https://www.confluent.io/kafka-summit- sf18/bringing-streaming-data-to-the- masses Bayer Crop Science (formerly Monsanto) adopted a cloud first strategy and started a multi-year transition to the cloud. A Kafka-based cross-datacenter DataHub was created to facilitate this migration and to drive the shift to real-time stream processing. The DataHub has seen strong enterprise adoption and supports a myriad of use cases.
  • 7. 888 celmatix Real Time Aggregation of Heterogeneous Data + Governance / Security https://www.confluent.io/customers/ celmatix/ Through the development of digital tools and genetic insights focused on fertility, Celmatix is disrupting how women approach their lifelong reproductive health journey by empowering them and their physicians with more personalized information.
  • 8. 9Machine Learning and Event Streaming Improve Traditional and to Build New Use Cases in Pharma and Life Sciences www.kai-waehner.de | @KaiWaehner Streams Processing / AI / ML Clinical Trials Patents, Text etc Structured & unstructured Data IoT & Business Applications Multi-Hybrid- Cloud
  • 9. 10 Use Case: Drug Discovery “On average, it takes at least ten years for a new medicine to complete the journey from initial discovery to the marketplace” PhRMA http://phrma-docs.phrma.org/sites/default/files/pdf/rd_brochure_022307.pdf www.kai-waehner.de | @KaiWaehner
  • 10. 121212 Recursion Pharmaceutical Discovering Drugs in Real Time + Machine Learning https://www.confluent.io/customers/recursion https://www.confluent.io/kafka-summit-san- francisco-2019/discovering-drugs-with-kafka- streams Massively parallel system that combines experimental biology, artificial intelligence, automation and real-time event streaming to accelerate drug discovery. www.kai-waehner.de | @KaiWaehner
  • 11. 13 Image and Video Processing … (on high level) is “just” pixel (arrays of 0s and 1s) and matrix multiplication www.kai-waehner.de | @KaiWaehner
  • 12. 14 Drug Discovery in manual and slow, bursty batch mode, not scalable www.kai-waehner.de | @KaiWaehner
  • 13. 15 Drug Discovery in automated, scalable, reliable real time Mode www.kai-waehner.de | @KaiWaehner
  • 14. 16 Digital Image Processing (e.g. noise reduction) Streaming Analytics for Drug Discovery in Real Time at Scale Real Time Integration Layer Batch Reporting Platform BI DashboardEvent Streaming Platform Real Time Integration Layer Laboratory Streaming Platform Other Components Automated Drug Analysis All Data Processed Images Ingest Images Human Intelligence www.kai-waehner.de | @KaiWaehner Data Processing (e.g. filtering) Stateful Workflow Orchestration
  • 15. 18 Digital Image Processing for Drug Discovery Find drug treatments: • ML models can be trained to decide between healthy cells and disease cells with problematic genes • Grow healthy cells and disease cells in labs • Apply different drugs à Make disease cells look healthy again www.kai-waehner.de | @KaiWaehner
  • 16. 20 The First Analytic Models How to deploy the models in production? …real-time processing? …at scale? …24/7 zero uptime? www.kai-waehner.de | @KaiWaehner
  • 17. 21 Hidden Technical Debt in Machine Learning Systems https://papers.nips.cc/paper/5656-hidden-technical-debt-in-machine-learning-systems.pdf www.kai-waehner.de | @KaiWaehner
  • 18. 22 How does event streaming help in machine learning use cases? www.kai-waehner.de | @KaiWaehner
  • 19. 23 Scalable, Technology-Agnostic Machine Learning Infrastructures https://www.infoq.com/presentations/netflix-ml-meson https://eng.uber.com/michelangelo https://www.infoq.com/presentations/paypal-data-service-fraudwww.kai-waehner.de | @KaiWaehner
  • 20. 26 A Streaming Platform is the Underpinning of an Event-driven Architecture Microservices DBs SaaS apps Mobile Customer 360 Real-time fraud detection Data warehouse Producers Consumers Database change Microservices events SaaS data Customer experiences Streams of real time events Stream processing apps Connectors Connectors Stream processing apps www.kai-waehner.de | @KaiWaehner
  • 21. 27 Apache Kafka at Scale at Tech Giants > 7 trillion messages / day > 6 Petabytes / day “You name it” * Kafka Is not just used by tech giants ** Kafka is not just used for big datawww.kai-waehner.de | @KaiWaehner
  • 22. 28 Apache Kafka’s Open Ecosystem as Infrastructure for ML www.kai-waehner.de | @KaiWaehner
  • 23. 29 Apache Kafka’s Open Ecosystem as Infrastructure for ML Kafka Streams / ksqlDB Kafka Connect Rest Proxy Schema Registry Go/ .NET Kafka Producer ksqlDB Python Consumer www.kai-waehner.de | @KaiWaehner
  • 24. 30 Digital Image Processing (External SaaS Service + REST) Kafka, ksqlDB and TensorFlow for Drug Discovery in Real Time at Scale Kafka Client (.NET C++) Batch Reporting Platform BI Dashboard Confluent Server Kafka Connect Laboratory (Windows Machines) Confluent Platform Other Components Model Training and Scoring (Confluent Python Client + TensorFlow) All Data Processed Images Images Human Intelligence www.kai-waehner.de | @KaiWaehner Streaming ETL (ksqlDB) Stateful Workflow Orchestration (Kafka Streams) Database (MySQL) Kafka Connect (Debezium CDC) Historical Drugs Data
  • 25. 31 How do you implement a Kafka ML infrastructure? www.kai-waehner.de | @KaiWaehner
  • 26. 32 Ingestion of Images Replication MirrorMaker 2 / Confluent Replicator Kafka Connect Laboratory www.kai-waehner.de | @KaiWaehner
  • 27. 33 Data Preprocessing Preprocessing Filter, transform, anonymize, extract features, reduce noise, enhance brightness / contrast Streams Data Ready For Model Training www.kai-waehner.de | @KaiWaehner
  • 28. 34 SELECT image_id, experiment_id, image_details FROM image_channel i LEFT JOIN experiment_database e ON i.experiment_id = e.experiment_id WHERE e.image_type = ‘black_and_white'; Data Processing with ksqlDB www.kai-waehner.de | @KaiWaehner
  • 29. 37 TensorFlow Model — Convolutional Neural Network (CNN) for Image Recognition (as part of the ML Pipeline) www.kai-waehner.de | @KaiWaehner
  • 30. 38 Direct streaming ingestion for model training and / or scoring with TensorFlow I/O + Kafka Plugin (no additional data storage like S3 or HDFS required!) Time Model BModel A Producer Distributed Commit Log Streaming Ingestion and Model Training with TensorFlow IO https://github.com/tensorflow/io www.kai-waehner.de | @KaiWaehner
  • 32. 40 Today, Kafka works well for recent events, short horizon storage, and manual data balancing CONFIDENTIAL Kafka’s present-day design offers extraordinarily low messaging latency by storing topic data on fast disks that are collocated with brokers. This is usually good. But sometimes, you need to store a huge amount of data for a long time. Kafka Processing App Storage Transactions, auth, quota enforcement, compaction, ... www.kai-waehner.de | @KaiWaehner
  • 33. 41Tiered Storage for Kafka Object Store Processing Storage Transactions, auth, quota enforcement, compaction, ... Local Remote Kafka Apps Store Forever Older data is offloaded to inexpensive object storage, permitting it to be consumed at any time. Save $$$ Storage limitations, like capacity and duration, are effectively uncapped. Instantaneously scale up and down Your Kafka clusters will be able to automatically self-balance load and hence elastically scale (Only available in Confluent Platform)www.kai-waehner.de | @KaiWaehner
  • 34. 43 Reprocessing of Events ● New Consumer ○ e.g. a complete new microservices or a replacement of an existing application ● Error-Handling ○ Re-processing of data in case of error: Fix error and process events again ● Compliance / Regulatory Processing ○ Reprocessing of already processed data for legal reasons ○ Could be very old data (e.g. pharma: 10 years old) ● Query and Analysis of Existing Events ○ No need for another data store / data lake ○ Kafka Client Consumer for offset- or timestamp-based consumption of old events ○ ksqlDB (for simple pull queries) ○ Kafka-native analytics tool (e.g. Rockset with Kafka connector and ANSI SQL support for Tableau et al) ● Model Training ○ Consume events for model training with a) different one ML framework and different hyperparameters or b) different ML frameworks www.kai-waehner.de | @KaiWaehner
  • 35. 44 Streaming Machine Learning with Apache Kafka and Tiered Storage https://www.confluent.io/blog/streaming-machine- learning-with-tiered-storage/ www.kai-waehner.de | @KaiWaehner
  • 36. 45 How to deploy the analytic models? www.kai-waehner.de | @KaiWaehner
  • 37. 46 Local Predictions Model Training in Cloud Model Deployment at the Edge Analytic Model Separation of Model Training and Model Inference www.kai-waehner.de | @KaiWaehner
  • 38. 47 Streams Input Event Prediction Request Response Model Serving TensorFlow Serving gRPC / HTTP Application Stream Processing with External Model and RPC www.kai-waehner.de | @KaiWaehner Model
  • 39. 49 Prediction Stream Processing Model doPrediction() return value Stream Processing with Embedded Model Streams Input Event www.kai-waehner.de | @KaiWaehner
  • 40. 50 Prediction Kafka Client Model doPrediction() return value Client Application with Embedded Model Input Event www.kai-waehner.de | @KaiWaehner REST Proxy
  • 41. 52 “CREATE STREAM ImageAnalysis AS SELECT image_id, analyzeImage(image_details) FROM image_channel;“ User Defined Function (UDF) Model Deployment with Apache Kafka, ksqlDB and TensorFlow www.kai-waehner.de | @KaiWaehner
  • 42. 54 Model Training and Scoring with the same ML Pipeline (or even in the same Application) • Data Science team responsible for the whole model lifecycle • Beloved Python tool stack (Pandas, scikit learn, TensorFlow, Jupyter, …) • 24/7 production scale with Confluent Python Client (e.g. deployed in Docker containers on Kubernetes) www.kai-waehner.de | @KaiWaehner
  • 43. 55 Digital Image Processing (External SaaS Service + REST) Kafka, ksqlDB and TensorFlow for Drug Discovery in Real Time at Scale Kafka Client (.NET C++) Batch Reporting Platform BI Dashboard Confluent Server Kafka Connect Laboratory (Windows Machines) Confluent Platform Other Components Model Training and Scoring (Confluent Python Client + TensorFlow) All Data Processed Images Images Human Intelligence www.kai-waehner.de | @KaiWaehner Streaming ETL (ksqlDB) Stateful Workflow Orchestration (Kafka Streams) Database (MySQL) Kafka Connect (Debezium CDC) Historical Drugs Data
  • 44. 5656 Machine Learning + Apache Kafka à Examples @ Github https://github.com/kaiwaehner www.kai-waehner.de | @KaiWaehner
  • 45. 57 Image processing, workflow orchestration, legacy integration... ??? www.kai-waehner.de | @KaiWaehner
  • 46. 58 Image / Video Processing with Kafka Kafka-native Image / Video Processing vs. Chunk + re-assemble vs. Metadata-only + Object Store à All approaches are fine! J https://www.slideshare.net/JiangjieQin/handle-large-messages-in-apache-kafka-58692297 https://www.confluent.io/blog/bust-the-burglars-machine-learning-with-tensorflow-and-apache-kafka/ www.kai-waehner.de | @KaiWaehner
  • 47. 59 Workflow Orchestration Kafka-native vs. External Tool à both approaches are fine! J https://github.com/nsaje/dagger https://eventil.com/events/using-apache-kafka-in-a-closed-environment-with-centralized-orchestration https://zeebe.io/blog/2019/08/official-kafka-connector-for-zeebe/ www.kai-waehner.de | @KaiWaehner
  • 48. 60 Legacy Integration Date Amount 1/27/2017 $4.56 1/22/2017 $32.14 Mainframe Transaction Data Traditional Middleware Application Kafka Microservices Agile, lightweight (but scalable robust) Kafka microservice Big Data project (Elastic, Spark, AWS Services, …) 1) Direct Legacy MQ Communication with App 2) Kafka for decoupling between MQ and App 3) Direct communication via Kafka (no MQ anymore) 4) New projects and applications (independent or related to the existing migration projects) External Solution www.kai-waehner.de | @KaiWaehner
  • 50. 63 Generate added value from data The pharmaceutical industry today has an unprecedented wealth of opportunities to generate added value from data. These possibilities cover all relevant areas such as: • R&D / Engineering • Sales & Marketing • Manufacturing & QA • Supply Chain • Product Monitoring / After Sales Support Novel data use: • Better therapies • Faster and more accurate diagnoses • Faster drug development • Improvement of clinical studies • Real-World Data Generation • real-world evidence • Precision Medicine • Support Remote Health etc Challenges: • Data silos • Data integration • Data growth/explosion • Cloud & on-prem • Use of new technologies like AI/ML • Time2Market • Regulatory Affairs • Security • Performance • API Streams Processing / AI / ML Clinical Trials Patents, Text etc Structured & unstructured Data IoT & Business Applications Multi-Hybrid- Cloud
  • 51. 64 Questions? Let’s connect... Kai Waehner Technology Evangelist kai.waehner@confluent.io @KaiWaehner www.confluent.io www.kai-waehner.de LinkedIn