SlideShare a Scribd company logo
1 of 25
Download to read offline
14. January 2019, Apache Kafka Meetup, Hamburg
Stefan Frehse, Software Architect, Digitalization
Ingesting IoT data in Food Processing
Machines for Food Processing
Example: fish machines
2
Machines for Food Processing
3
11.01.20194
BAADER – Digital opportunities
A data driven food value chain
BAADER approach
Data-driven company by Apache Kafka
Summary
Agenda
BAADER Digitalization
OPPORTUNITIES OF FUTURE VALUE ADD
CURRENT VALUE CREATION
OPPORTUNITIES OF FUTURE VALUE
ADD
Machine information
runtime, speed, energy consumption
Product Data
Size, Weight, Quality
Business Process Data
Supplier, Order, Customer, Shipment
INFORMATION CHAIN
BAADER – Digital Opportunities
Expanding own value added through business process data capturing and using
BAADER Digitalization5
OPPORTUNITY OF FUTURE VALUE ADD OPPORTUNITY OF FUTURE VALUE ADD
11.01.20196
Capturing data from farm to fork
A data-driven Food Value Chain
6
Farm inputs Primary sector Transport Processing Retail Consumers
Farmers/input suppliers
• Optimize yield and operational management
• Anticipate future developments
• Provide product and process guarantees
Offtakers
• Enable market and consumer-driven production
• Shorten and optimize production chains
Value Chain
• Predict market dynamics to anticipate supply/demand and negotiate better prices
• Meet demand by consumers for more transparency, e.g. provenance, production methods
Reduce food waste and increase production
BAADER Digitalization
Recycling
11.01.20197 BAADER Digitalization7
ONE digital platform
as basis for all
future solutions in
the food processing
industry
Our vision
BAADER Approach
* Artificial Intelligence
11.01.20198
Visibility, consensus, performance for a data-driven company
Collecting data along the food value chain
8
Shared, replicated, permissioned
Farm record Transport
record
Weather
record
Processing
record
Packaging
record
Retailers
record
Auditors
record
End user
record
BAADER Digitalization
Single source of truth for a
Data-driven company
11.01.20199
Gathering data for food safety
Traceability
Temperature
Humidity
Air exchange
Transport time
Stunner settings
Time hanging
Transparency
Animal welfare indicators in poultry processing
Very brief overview
Vision: A data-driven company
10
Arbitrary
records
Apache Kafka
Transformation Aggregation
Central data log
Remote Service
Machine R&D
Predictive Maintenance
API
Dashboards / Reports
Collecting Farming and Transportation Records
Gathered from the farming process
● Farming: Chickens grow up and are catched after a
few weeks
● Critical parameters:
○ Temperature
○ Humidity
○ CO2
● GPS positions of the truck to forecast the ETA
○ Optimal preparation for reception
● Load information: number of birds, weight, sex
● => Animal welfare has a strong focus
11
+
Collecting Machine Data
Gathered from the slaughterhouse (factory)
● Installed OPC UA server (M2M protocol) provides
machine data
○ power consumption, voltage, line speed, air &
water temperature, bone size, fillet size,
quality, weight
● local IoT-gateway subscribes data from OPC UA
and forward the data to AWS IoT Core as they
occur
12
Processing record
Example: Collecting data from camera systems
11.01.201913
Camera before
chilling
Camera after
chilling
Upstreaming
steps+
Downstreaming
steps+
10:00 pm 1:00 pm Timeline
BAADER Digitalization
~190 birds/min
Evisceration Line CutUp Line
~190 birds/min
Product quality: Chicken
Collecting data from a camera system
11.01.201914 BAADER Digitalization
● Camera takes 2-4 pictures of
each chicken
● Images are analyzed in
real-time at the edge
● Quality and potential defects
are provided
● Sorting based on the quality
for downstreaming steps
(cutup, whole bird, …)
Data ingestion into Apache Kafka
Keeping the quality
11.01.2019BAADER Digitalization15
Apache Kafka
Cluster
MQTTAdaptor
AWSIoTCore(MQTT)
Elasticsearch / Kibana
MongoDB
AWS S3
KafkaConnector
Demo
Streaming SQL Processing
GPS
Machinedata
Weather
MQTT over TLS
Data source
KSQL and Chicken Quality
Creating a stream
CREATE STREAM quality (
thingId VARCHAR,
value STRUCT<quality STRUCT<code INT>>)
WITH (KAFKA_TOPIC='camera', VALUE_FORMAT='json');
● Creates a KSQL stream of the quality from the camera.
16
KSQL and Chicken Quality
Creating a windowed table for efficiency monitoring
CREATE TABLE empty_shackles_per_5_minutes AS
SELECT thingId, COUNT(*) FROM quality
WINDOW TUMBLING (SIZE 5 MINUTES)
WHERE value->quality->code = 4
GROUP by thingId;
● Create a table over a 5 minute window for empty shackles (code = 4)
● Application / Alerting: Notify factory manager about potential efficiency drop
17
Kafka and Transportation Data
Computing truck arrivals
● Topic that contains the GPS positions of a truck
18
{
"thingId": "4d5cc1d6-50ca-4bf9-bb6c-ddfaa9043d6a",
"thingType": "baader::dt::vehicle::daf::vehicle_geo_position",
"thingVersion": "0",
"value": {
"Latitude":"5X.6155128",
"Longitude":"-X.9469323",
“LastUpdateDataTimeInUtc":"2018-09-14T11:51:44.000Z"
}
"timestamp": 1536955501460
}
ETA Computation
Using KSQL for computing ETA
● KSQL: Added own stateless functions for SQL stream processing
● UDFs: User-defined functions in Java and just putting the compiled jar into the
confluent-platform environment
● Implemented a UDF for computing the ETA using Google Distance API
● Derived a new stream that calls the distance UDF
● The ETA is presented in a dashboard
19
Drive new businesses
Writing streaming applications
● Building a central data lake for machine data (power, voltage, water
consumptions) -> next machine generation
● Providing a transparent view of the entire food value chain
●
● Using easy-to-use streaming SQL (e.g., KSQL, LSQL) to create derived streams
○ Computing windowed reached SLA for the machines
○ Efficiency by computing number of empty shackles
○ Alerts when violating thresholds
20
Value of Apache Kafka
Context Digitalization
● Building streaming applications in the context of IoT
● Event-driven architectures
● Easy experimentation with low cost
● Single source of truth for a data-driven company
● … and the usual suspects
● Fault-tolerance
● Strict ordering
● High-Availability
● Scalability
21
Confluent Cloud Enterprise
● Fully managed Kafka Cluster for dev. and prod.
● Just configure: throughput and retention time
● Excellent support in terms of response and quality
● Secured environment using VPC peering on AWS
● Open-Source contributor to Apache Kafka and its entire ecosystem
22
Values
Today’s numbers
At the very early stage
● 40-50 msg/s
● Total: 140 data points
● 100 GB gathered from a single factory within a few months
● There is much more, but...
23
We are hiring:
https://stackoverflow.com/jobs/companies/baader
Backup
Kibana dashboard (15 min) of the camera system
25

More Related Content

What's hot

Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
confluent
 
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
Kai Wähner
 
Event Streaming Architecture for Industry 4.0 - Abdelkrim Hadjidj & Jan Kuni...
Event Streaming Architecture for Industry 4.0 -  Abdelkrim Hadjidj & Jan Kuni...Event Streaming Architecture for Industry 4.0 -  Abdelkrim Hadjidj & Jan Kuni...
Event Streaming Architecture for Industry 4.0 - Abdelkrim Hadjidj & Jan Kuni...
Flink Forward
 
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
confluent
 

What's hot (20)

Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
 
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
 
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniertFast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
 
Building a Streaming Pipeline on Kubernetes Using Kafka Connect, KSQLDB & Apa...
Building a Streaming Pipeline on Kubernetes Using Kafka Connect, KSQLDB & Apa...Building a Streaming Pipeline on Kubernetes Using Kafka Connect, KSQLDB & Apa...
Building a Streaming Pipeline on Kubernetes Using Kafka Connect, KSQLDB & Apa...
 
Elastically Scaling Kafka Using Confluent
Elastically Scaling Kafka Using ConfluentElastically Scaling Kafka Using Confluent
Elastically Scaling Kafka Using Confluent
 
Application modernization patterns with apache kafka, debezium, and kubernete...
Application modernization patterns with apache kafka, debezium, and kubernete...Application modernization patterns with apache kafka, debezium, and kubernete...
Application modernization patterns with apache kafka, debezium, and kubernete...
 
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
Bridge to Cloud: Using Apache Kafka to Migrate to AWSBridge to Cloud: Using Apache Kafka to Migrate to AWS
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
 
Fully-Managed, Multi-Tenant Kafka Clusters: Tips, Tricks, and Tools (Christop...
Fully-Managed, Multi-Tenant Kafka Clusters: Tips, Tricks, and Tools (Christop...Fully-Managed, Multi-Tenant Kafka Clusters: Tips, Tricks, and Tools (Christop...
Fully-Managed, Multi-Tenant Kafka Clusters: Tips, Tricks, and Tools (Christop...
 
Bridge Your Kafka Streams to Azure Webinar
Bridge Your Kafka Streams to Azure WebinarBridge Your Kafka Streams to Azure Webinar
Bridge Your Kafka Streams to Azure Webinar
 
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
 
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
 
One Kubernetes to rule them all (ZEUS 2019 Keynote)
One Kubernetes to rule them all (ZEUS 2019 Keynote)One Kubernetes to rule them all (ZEUS 2019 Keynote)
One Kubernetes to rule them all (ZEUS 2019 Keynote)
 
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes EverythingThe Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
 
Event Streaming Architecture for Industry 4.0 - Abdelkrim Hadjidj & Jan Kuni...
Event Streaming Architecture for Industry 4.0 -  Abdelkrim Hadjidj & Jan Kuni...Event Streaming Architecture for Industry 4.0 -  Abdelkrim Hadjidj & Jan Kuni...
Event Streaming Architecture for Industry 4.0 - Abdelkrim Hadjidj & Jan Kuni...
 
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
 
Confluent Cloud for Apache Kafka® | Google Cloud Next ’19
Confluent Cloud for Apache Kafka® | Google Cloud Next ’19Confluent Cloud for Apache Kafka® | Google Cloud Next ’19
Confluent Cloud for Apache Kafka® | Google Cloud Next ’19
 
ksqlDB Workshop
ksqlDB WorkshopksqlDB Workshop
ksqlDB Workshop
 
What's new in confluent platform 5.4 online talk
What's new in confluent platform 5.4 online talkWhat's new in confluent platform 5.4 online talk
What's new in confluent platform 5.4 online talk
 
Real time data processing and model inferncing platform with Kafka streams (N...
Real time data processing and model inferncing platform with Kafka streams (N...Real time data processing and model inferncing platform with Kafka streams (N...
Real time data processing and model inferncing platform with Kafka streams (N...
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
 

Similar to Ingesting IoT data in Food Processing

Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
confluent
 
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
confluent
 
Machine Learning with Apache Spark
Machine Learning with Apache SparkMachine Learning with Apache Spark
Machine Learning with Apache Spark
IBM Cloud Data Services
 
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
Ryousei Takano
 

Similar to Ingesting IoT data in Food Processing (20)

Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
 
Adjust Workshop - PUSHING AND PULLING YOUR DATA
Adjust Workshop - PUSHING AND PULLING YOUR DATA Adjust Workshop - PUSHING AND PULLING YOUR DATA
Adjust Workshop - PUSHING AND PULLING YOUR DATA
 
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
 
Getting started with azure event hubs and stream analytics services
Getting started with azure event hubs and stream analytics servicesGetting started with azure event hubs and stream analytics services
Getting started with azure event hubs and stream analytics services
 
EDA Meets Data Engineering – What's the Big Deal?
EDA Meets Data Engineering – What's the Big Deal?EDA Meets Data Engineering – What's the Big Deal?
EDA Meets Data Engineering – What's the Big Deal?
 
Christoph Bussler [Google Cloud] | IoT Event Processing and Analytics with In...
Christoph Bussler [Google Cloud] | IoT Event Processing and Analytics with In...Christoph Bussler [Google Cloud] | IoT Event Processing and Analytics with In...
Christoph Bussler [Google Cloud] | IoT Event Processing and Analytics with In...
 
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
 
Serving the Real-Time Data Needs of an Airport with Kafka Streams and KSQL
Serving the Real-Time Data Needs of an Airport with Kafka Streams and KSQLServing the Real-Time Data Needs of an Airport with Kafka Streams and KSQL
Serving the Real-Time Data Needs of an Airport with Kafka Streams and KSQL
 
Cloudera - IoT & Smart Cities
Cloudera - IoT & Smart CitiesCloudera - IoT & Smart Cities
Cloudera - IoT & Smart Cities
 
Cf summit-2016-monitoring-cf-sensu-graphite
Cf summit-2016-monitoring-cf-sensu-graphiteCf summit-2016-monitoring-cf-sensu-graphite
Cf summit-2016-monitoring-cf-sensu-graphite
 
Serhii Kholodniuk: What you need to know, before migrating data platform to G...
Serhii Kholodniuk: What you need to know, before migrating data platform to G...Serhii Kholodniuk: What you need to know, before migrating data platform to G...
Serhii Kholodniuk: What you need to know, before migrating data platform to G...
 
Machine Learning with Apache Spark
Machine Learning with Apache SparkMachine Learning with Apache Spark
Machine Learning with Apache Spark
 
OSDC 2019 | Democratizing Data at Go-JEK by Maulik Soneji
OSDC 2019 | Democratizing Data at Go-JEK by Maulik SonejiOSDC 2019 | Democratizing Data at Go-JEK by Maulik Soneji
OSDC 2019 | Democratizing Data at Go-JEK by Maulik Soneji
 
Distributed Solar Systems: Revolutionizing Time-Series Predictive Analytics, ...
Distributed Solar Systems: Revolutionizing Time-Series Predictive Analytics, ...Distributed Solar Systems: Revolutionizing Time-Series Predictive Analytics, ...
Distributed Solar Systems: Revolutionizing Time-Series Predictive Analytics, ...
 
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
 
Samsara Fleet Monitoring/Reefer Temperature Solution
Samsara Fleet Monitoring/Reefer Temperature SolutionSamsara Fleet Monitoring/Reefer Temperature Solution
Samsara Fleet Monitoring/Reefer Temperature Solution
 
Industry 4.0 for beginners
Industry 4.0 for beginnersIndustry 4.0 for beginners
Industry 4.0 for beginners
 
SOLUSI INDUSTRIAL IOT CONTEC
SOLUSI INDUSTRIAL IOT CONTECSOLUSI INDUSTRIAL IOT CONTEC
SOLUSI INDUSTRIAL IOT CONTEC
 
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
 
IoT Connected Brewery
IoT Connected BreweryIoT Connected Brewery
IoT Connected Brewery
 

More from confluent

More from confluent (20)

Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Evolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI EraEvolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI Era
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Santander Stream Processing with Apache Flink
Santander Stream Processing with Apache FlinkSantander Stream Processing with Apache Flink
Santander Stream Processing with Apache Flink
 
Unlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insightsUnlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insights
 
Workshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con FlinkWorkshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con Flink
 
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
 
AWS Immersion Day Mapfre - Confluent
AWS Immersion Day Mapfre   -   ConfluentAWS Immersion Day Mapfre   -   Confluent
AWS Immersion Day Mapfre - Confluent
 
Eventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkEventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalk
 
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent CloudQ&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
 
Citi TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep DiveCiti TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep Dive
 
Build real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with ConfluentBuild real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with Confluent
 
Q&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service MeshQ&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service Mesh
 
Citi Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka MicroservicesCiti Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka Microservices
 
Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3
 
Citi Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging ModernizationCiti Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging Modernization
 
Citi Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time dataCiti Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time data
 
Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2
 
Data In Motion Paris 2023
Data In Motion Paris 2023Data In Motion Paris 2023
Data In Motion Paris 2023
 
Confluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with SynthesisConfluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with Synthesis
 

Recently uploaded

Recently uploaded (20)

WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 
Strategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsStrategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering Teams
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
A Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyA Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System Strategy
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty Secure
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
Designing for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastDesigning for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at Comcast
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
The UX of Automation by AJ King, Senior UX Researcher, Ocado
The UX of Automation by AJ King, Senior UX Researcher, OcadoThe UX of Automation by AJ King, Senior UX Researcher, Ocado
The UX of Automation by AJ King, Senior UX Researcher, Ocado
 

Ingesting IoT data in Food Processing

  • 1. 14. January 2019, Apache Kafka Meetup, Hamburg Stefan Frehse, Software Architect, Digitalization Ingesting IoT data in Food Processing
  • 2. Machines for Food Processing Example: fish machines 2
  • 3. Machines for Food Processing 3
  • 4. 11.01.20194 BAADER – Digital opportunities A data driven food value chain BAADER approach Data-driven company by Apache Kafka Summary Agenda BAADER Digitalization
  • 5. OPPORTUNITIES OF FUTURE VALUE ADD CURRENT VALUE CREATION OPPORTUNITIES OF FUTURE VALUE ADD Machine information runtime, speed, energy consumption Product Data Size, Weight, Quality Business Process Data Supplier, Order, Customer, Shipment INFORMATION CHAIN BAADER – Digital Opportunities Expanding own value added through business process data capturing and using BAADER Digitalization5 OPPORTUNITY OF FUTURE VALUE ADD OPPORTUNITY OF FUTURE VALUE ADD
  • 6. 11.01.20196 Capturing data from farm to fork A data-driven Food Value Chain 6 Farm inputs Primary sector Transport Processing Retail Consumers Farmers/input suppliers • Optimize yield and operational management • Anticipate future developments • Provide product and process guarantees Offtakers • Enable market and consumer-driven production • Shorten and optimize production chains Value Chain • Predict market dynamics to anticipate supply/demand and negotiate better prices • Meet demand by consumers for more transparency, e.g. provenance, production methods Reduce food waste and increase production BAADER Digitalization Recycling
  • 7. 11.01.20197 BAADER Digitalization7 ONE digital platform as basis for all future solutions in the food processing industry Our vision BAADER Approach * Artificial Intelligence
  • 8. 11.01.20198 Visibility, consensus, performance for a data-driven company Collecting data along the food value chain 8 Shared, replicated, permissioned Farm record Transport record Weather record Processing record Packaging record Retailers record Auditors record End user record BAADER Digitalization Single source of truth for a Data-driven company
  • 9. 11.01.20199 Gathering data for food safety Traceability Temperature Humidity Air exchange Transport time Stunner settings Time hanging Transparency Animal welfare indicators in poultry processing
  • 10. Very brief overview Vision: A data-driven company 10 Arbitrary records Apache Kafka Transformation Aggregation Central data log Remote Service Machine R&D Predictive Maintenance API Dashboards / Reports
  • 11. Collecting Farming and Transportation Records Gathered from the farming process ● Farming: Chickens grow up and are catched after a few weeks ● Critical parameters: ○ Temperature ○ Humidity ○ CO2 ● GPS positions of the truck to forecast the ETA ○ Optimal preparation for reception ● Load information: number of birds, weight, sex ● => Animal welfare has a strong focus 11 +
  • 12. Collecting Machine Data Gathered from the slaughterhouse (factory) ● Installed OPC UA server (M2M protocol) provides machine data ○ power consumption, voltage, line speed, air & water temperature, bone size, fillet size, quality, weight ● local IoT-gateway subscribes data from OPC UA and forward the data to AWS IoT Core as they occur 12
  • 13. Processing record Example: Collecting data from camera systems 11.01.201913 Camera before chilling Camera after chilling Upstreaming steps+ Downstreaming steps+ 10:00 pm 1:00 pm Timeline BAADER Digitalization ~190 birds/min Evisceration Line CutUp Line ~190 birds/min
  • 14. Product quality: Chicken Collecting data from a camera system 11.01.201914 BAADER Digitalization ● Camera takes 2-4 pictures of each chicken ● Images are analyzed in real-time at the edge ● Quality and potential defects are provided ● Sorting based on the quality for downstreaming steps (cutup, whole bird, …)
  • 15. Data ingestion into Apache Kafka Keeping the quality 11.01.2019BAADER Digitalization15 Apache Kafka Cluster MQTTAdaptor AWSIoTCore(MQTT) Elasticsearch / Kibana MongoDB AWS S3 KafkaConnector Demo Streaming SQL Processing GPS Machinedata Weather MQTT over TLS Data source
  • 16. KSQL and Chicken Quality Creating a stream CREATE STREAM quality ( thingId VARCHAR, value STRUCT<quality STRUCT<code INT>>) WITH (KAFKA_TOPIC='camera', VALUE_FORMAT='json'); ● Creates a KSQL stream of the quality from the camera. 16
  • 17. KSQL and Chicken Quality Creating a windowed table for efficiency monitoring CREATE TABLE empty_shackles_per_5_minutes AS SELECT thingId, COUNT(*) FROM quality WINDOW TUMBLING (SIZE 5 MINUTES) WHERE value->quality->code = 4 GROUP by thingId; ● Create a table over a 5 minute window for empty shackles (code = 4) ● Application / Alerting: Notify factory manager about potential efficiency drop 17
  • 18. Kafka and Transportation Data Computing truck arrivals ● Topic that contains the GPS positions of a truck 18 { "thingId": "4d5cc1d6-50ca-4bf9-bb6c-ddfaa9043d6a", "thingType": "baader::dt::vehicle::daf::vehicle_geo_position", "thingVersion": "0", "value": { "Latitude":"5X.6155128", "Longitude":"-X.9469323", “LastUpdateDataTimeInUtc":"2018-09-14T11:51:44.000Z" } "timestamp": 1536955501460 }
  • 19. ETA Computation Using KSQL for computing ETA ● KSQL: Added own stateless functions for SQL stream processing ● UDFs: User-defined functions in Java and just putting the compiled jar into the confluent-platform environment ● Implemented a UDF for computing the ETA using Google Distance API ● Derived a new stream that calls the distance UDF ● The ETA is presented in a dashboard 19
  • 20. Drive new businesses Writing streaming applications ● Building a central data lake for machine data (power, voltage, water consumptions) -> next machine generation ● Providing a transparent view of the entire food value chain ● ● Using easy-to-use streaming SQL (e.g., KSQL, LSQL) to create derived streams ○ Computing windowed reached SLA for the machines ○ Efficiency by computing number of empty shackles ○ Alerts when violating thresholds 20
  • 21. Value of Apache Kafka Context Digitalization ● Building streaming applications in the context of IoT ● Event-driven architectures ● Easy experimentation with low cost ● Single source of truth for a data-driven company ● … and the usual suspects ● Fault-tolerance ● Strict ordering ● High-Availability ● Scalability 21
  • 22. Confluent Cloud Enterprise ● Fully managed Kafka Cluster for dev. and prod. ● Just configure: throughput and retention time ● Excellent support in terms of response and quality ● Secured environment using VPC peering on AWS ● Open-Source contributor to Apache Kafka and its entire ecosystem 22 Values
  • 23. Today’s numbers At the very early stage ● 40-50 msg/s ● Total: 140 data points ● 100 GB gathered from a single factory within a few months ● There is much more, but... 23
  • 25. Backup Kibana dashboard (15 min) of the camera system 25