Using the FLaNK Stack for edge ai (flink, nifi, kafka, kudu)

Timothy Spann
Timothy SpannDeveloper Advocate
Using The FLaNK Stack for Edge AI
Timothy Spann
Principal DataFlow Field Engineer
Cloudera
@PaasDev
2© 2020 Cloudera, Inc. All rights reserved.
© 2020 Cloudera, Inc. All rights reserved. 3
FLaNK Speaker
Who am I?
Principal DataFlow Field Engineer
@PaasDev
DZone Zone Leader and Big Data MVB
Future of Data Meetup Leader
ex-Pivotal Field Engineer
https://github.com/tspannhw https://www.datainmotion.dev/
© 2020 Cloudera, Inc. All rights reserved. 4
Welcome to Future of Data - Princeton - Virtual
@PaasDev
https://www.meetup.com/futureofdata-princeton/
From Big Data to AI to Streaming to Containers to
Cloud to Analytics to Cloud Storage to Fast Data to
Machine Learning to Microservices to ...
© 2020 Cloudera, Inc. All rights reserved. 5
Cloudera Commitment to the Community
Keynote @ Flink Forward 2020 San Francisco
https://www.youtube.com/watch?v=ckcOyRA6ZOc
● We support Flink deployments both
in the public cloud and on premise
● 2 PMC members onboard
● Multiple service integrations:
Schema Registry
HBase
Kudu
Atlas
Knox
● Regular blog posts, talks and
Apache code contributions
6
Analyze
Streaming OLAP Analytics &
Time Series Store Powered
by
Druid & Kudu
Buffer
Apache Kafka
Topics
Ingest Gateway
Powered by Kafka
Distribute
Apache NiFi
Data Flow Apps
Powered by NiFi
Buffer
Apache Kafka
Syndicate
topics
Syndicate Services
Powered by Kafka
Collect
Syndicate
topics
Syndicate Services
Powered by Kafka
Replication /
Data Deployment
Analyze
Streaming Analytics Apps
Stream Processing
Powered by Flink
Demo Reference Architecture
Data Collection
at the Edge
Apache NiFi / MiNiFi
- sensors, IoT
- databases
- file systems
- app sidecar
- live streams
- MQ
- logs
- network
Anything… you
name it!
© 2020 Cloudera, Inc. All rights reserved. 7
THE COMPLETE AND CONNECTED DATA LIFECYCLE
Collect
Edge & Flow
Management
ActData-in-
Motion
Curate
Data
Engineering
Report
Data
Warehouse
Serve
Operational
Database
Predict
Machine
Learning and AI
Data-at-
Rest
A Connected Data Lifecycle is Critical to Meet the Needs of Real-time Use CasesPOWERED BY
Distribute
Flow Management
Buffer
Streams
Messaging
Analyze
Streaming
Analytics
Enrichment Operational insights ScoringBatch curation
© 2020 Cloudera, Inc. All rights reserved. 8
End-to-End Schema and Lineage Management
© 2020 Cloudera, Inc. All rights reserved. 9
Streaming Data Pipelines with NiFi + Kafka + Flink
© 2020 Cloudera, Inc. All rights reserved. 10
CLOUDERA DATAFLOW DATA-IN-MOTION PLATFORM
11
Where Can I Run FLaNK Easily?
CDP services are optimized for the elastic compute &
‘always-on’ storage services provided by any cloud provider
Web service hosted and managed by Cloudera
Hosted in the your cloud environment, but managed by the
CDP Management Console
Shared Data Experience (SDX) technologies form a secure and
governed data lake backed by object storage (S3, ADLS, GCS)
Flow Management Streams Messaging Streaming Analytics
12
What is NiFi used for?
© 2020 Cloudera, Inc. All rights reserved. 13
Cloudera Flow Management
Enable easy ingestion, routing, management and delivery of any data anywhere (Edge, cloud,
data center) to any downstream system with built in end-to-end security and provenance
ACQUIRE PROCESS DELIVER
• Over 300 Prebuilt Processors
• Easy to build your own
• Parse, Enrich & Apply Schema
• Filter, Split, Merger & Route
• Throttle & Backpressure
• Guaranteed Delivery
• Full data provenance from acquisition to
delivery
• Diverse, Non-Traditional Sources
• Eco-system integration
Advanced tooling to industrialize flow development
(Flow Development Life Cycle)
© 2020 Cloudera, Inc. All rights reserved. 14
https://blog.cloudera.com/benchmarking-nifi-performance-and-scalability/
NiFi Processing Billions of Events
© 2020 Cloudera, Inc. All rights reserved. 15
STREAMS MESSAGING / APACHE KAFKA
Kafka Connect Support
Simple Data Movement
In/Out of Kafka
Schema Registry Ranger Plugin
Improved ACL and Audit for
Kafka and Schema Registry
Cruise Control Support
Intelligent Kafka Cluster
Rebalancing & Self Healing
© 2020 Cloudera, Inc. All rights reserved. 16
Key Capabilities
STREAMING ANALYTICS / APACHE FLINK
Flink SQL Support
Agile Streaming App
Development using SQL
Apache Flink Atlas Hook
Capture operational Flink
app metadata and lineage
Single View of Flink Yarn Jobs
Improve Developer Experience
& operational visibility
Demo
18
Edge AI to Cloud Streaming Pipeline
Device Data
SensorsEnergy Logs
Weather
Sensors
Aggregates
Energy
SQL
Analytics
MiNiFi
Agent
Deep Learning
Classification
Edge Private
Cloud
Multi-Public
Cloud
If You Missed This Live
MiNiFi Agents
© 2020 Cloudera, Inc. All rights reserved. 21
MiNiFi Java Agent
● Reads Sensor
Logs
● OpenVino NCC2
AI
● Reads Images
● Sends to NiFi
Gateway
Apache NiFi Gateway
processors, validates,
transforms, cleans,
routes and streams
events for additional
processing through
Apache Kafka topics.
MiNiFi Agents Running Deep Learning Classification And Sending Images and Results to NiFi Gateways
22© 2020 Cloudera, Inc. All rights reserved.
FLINK SQL DEMO - DML - CATALOG AND TABLES
SHOW catalogs;
USE CATALOG registry;
SHOW tables;
SELECT * FROM energy;
© 2020 Cloudera, Inc. All rights reserved. 23
INSERT INTO global_sensor_events
SELECT scada.uuid,scada.systemtime,scada.temperaturef,
scada.pressure,scada.humidity,scada.lux,scada.proximity,
scada.oxidising,scada.reducing,scada.nh3,scada.gasko,
energy.`current`,energy.voltage,energy.`power`,
energy.`total`,energy.fanstatus
FROM energy, scada
WHERE scada.systemtime = energy.systemtime;
WHERE IS THE FLINK CODE!??!
24© 2020 Cloudera, Inc. All rights reserved.
FLINK SQL DEMO
25© 2020 Cloudera, Inc. All rights reserved.
26© 2020 Cloudera, Inc. All rights reserved.
FLINK SQL DEMO - FLINK DASHBOARD
27© 2020 Cloudera, Inc. All rights reserved.
FLINK SQL DEMO
28© 2020 Cloudera, Inc. All rights reserved.
FLINK SQL DEMO
© 2020 Cloudera, Inc. All rights reserved. 29
{"uuid": "rpi4_uuid_jfx_20200826203733", "amplitude100": 1.2, "amplitude500": 0.6, "amplitude1000": 0.3, "lownoise": 0.6,
"midnoise": 0.2, "highnoise": 0.2, "amps": 0.3, "ipaddress": "192.168.1.76", "host": "rp4", "host_name": "rp4", "macaddress":
"6e:37:12:08:63:e1", "systemtime": "08/26/2020 16:37:34", "endtime": "1598474254.75", "runtime": "28179.03", "starttime":
"08/26/2020 08:47:54", "cpu": 48.3, "cpu_temp": "72.0", "diskusage": "40219.3 MB", "memory": 24.3, "id":
"20200826203733_28ce9520-6832-4f80-b17d-f36c21fd8fc9", "temperature": "47.2", "adjtemp": "35.8", "adjtempf": "76.4",
"temperaturef": "97.0", "pressure": 1010.0, "humidity": 8.3, "lux": 67.4, "proximity": 0, "oxidising": 77.9, "reducing": 184.6, "nh3":
144.7, "gasKO": "Oxidising: 77913.04 OhmsnReducing: 184625.00 OhmsnNH3: 144651.47 Ohms"}
SHOW ME THE DATA
© 2020 Cloudera, Inc. All rights reserved. 30
BME280 - temperature, pressure, humidity sensor
LTR-559 - light and proximity sensor
MICS6814 - analog gas sensor
ADS1015 ADC
MEMS - microphone
0.96-inch, 160 x 80 color LCD
WHERE DID THAT DATA COME FROM?
© 2020 Cloudera, Inc. All rights reserved. 31
© 2020 Cloudera, Inc. All rights reserved. 32
© 2020 Cloudera, Inc. All rights reserved. 33
© 2020 Cloudera, Inc. All rights reserved. 34
© 2020 Cloudera, Inc. All rights reserved. 35
© 2020 Cloudera, Inc. All rights reserved. 36
© 2020 Cloudera, Inc. All rights reserved. 37
Learn More
39© 2020 Cloudera, Inc. All rights reserved.
DEMO SOURCE CODE
● https://github.com/tspannhw/FlinkForwardGlobal2020
● https://github.com/tspannhw/ApacheConAtHome2020
● https://github.com/tspannhw/minifi-xaviernx
● https://github.com/tspannhw/minifi-jetson-nano
● https://github.com/tspannhw/minifi-enviroplus
The code, build scripts, schemas, table
DDL, Flink SQL, Kafka Connect
configuration, NiFi flows, HBase
tables, Kudu tables, Hive tables, HDFS
directories, alerts, images, HTML,
docs, links and all the goodies are
here. Please fork and contribute.
40© 2020 Cloudera, Inc. All rights reserved.
FLINK SQL LINKS
● https://github.com/cloudera/flink-tutorials
● https://docs.cloudera.com/csa/1.1.0/job-lifecycle/topics/csa-run-job.html
● https://github.com/asdaraujo/edge2ai-workshop/tree/master/flink
● https://ci.apache.org/projects/flink/flink-docs-stable/dev/table/sqlClient.html
● https://ci.apache.org/projects/flink/flink-docs-release-1.10/dev/table/sqlClient.html
● https://github.com/tspannhw/FlinkSQLDemo
● https://www.datainmotion.dev/2020/05/flank-low-code-streaming-populating.html
● https://github.com/tspannhw/meetup-sensors/tree/master/flink-sql
● https://www.datainmotion.dev/2020/05/flink-sql-preview.html
41© 2020 Cloudera, Inc. All rights reserved.
Flink SQL LINKS
● https://docs.cloudera.com/csa/1.2.0/flink-sql-table-api/topics/csa-kafka-regi
stry-avro.html
● https://github.com/tspannhw/ApacheConAtHome2020/
● https://github.com/tspannhw/FlinkSQLDemo
● https://github.com/tspannhw/meetup-sensors/tree/main/flink-sql
● https://github.com/cloudera/flink-tutorials/tree/master/flink-sql-tutorial
© 2020 Cloudera, Inc. All rights reserved. 42
TH N Y U
1 of 42

Recommended

When NOT to use Apache Kafka? by
When NOT to use Apache Kafka?When NOT to use Apache Kafka?
When NOT to use Apache Kafka?Kai Wähner
1.5K views12 slides
Best Practices for ETL with Apache NiFi on Kubernetes - Albert Lewandowski, G... by
Best Practices for ETL with Apache NiFi on Kubernetes - Albert Lewandowski, G...Best Practices for ETL with Apache NiFi on Kubernetes - Albert Lewandowski, G...
Best Practices for ETL with Apache NiFi on Kubernetes - Albert Lewandowski, G...GetInData
1K views56 slides
FLiP Into Trino by
FLiP Into TrinoFLiP Into Trino
FLiP Into TrinoTimothy Spann
700 views45 slides
Real time stock processing with apache nifi, apache flink and apache kafka by
Real time stock processing with apache nifi, apache flink and apache kafkaReal time stock processing with apache nifi, apache flink and apache kafka
Real time stock processing with apache nifi, apache flink and apache kafkaTimothy Spann
786 views10 slides
Introduction to Kafka Streams by
Introduction to Kafka StreamsIntroduction to Kafka Streams
Introduction to Kafka StreamsGuozhang Wang
29.7K views136 slides
TechEvent Databricks on Azure by
TechEvent Databricks on AzureTechEvent Databricks on Azure
TechEvent Databricks on AzureTrivadis
368 views46 slides

More Related Content

What's hot

Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the... by
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...DataWorks Summit/Hadoop Summit
32.8K views36 slides
Stream processing using Kafka by
Stream processing using KafkaStream processing using Kafka
Stream processing using KafkaKnoldus Inc.
1.6K views44 slides
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka by
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaKai Wähner
2K views38 slides
Deploying Flink on Kubernetes - David Anderson by
 Deploying Flink on Kubernetes - David Anderson Deploying Flink on Kubernetes - David Anderson
Deploying Flink on Kubernetes - David AndersonVerverica
2K views48 slides
Running Apache Spark on Kubernetes: Best Practices and Pitfalls by
Running Apache Spark on Kubernetes: Best Practices and PitfallsRunning Apache Spark on Kubernetes: Best Practices and Pitfalls
Running Apache Spark on Kubernetes: Best Practices and PitfallsDatabricks
2.9K views36 slides
Modernizing to a Cloud Data Architecture by
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
654 views22 slides

What's hot(20)

Stream processing using Kafka by Knoldus Inc.
Stream processing using KafkaStream processing using Kafka
Stream processing using Kafka
Knoldus Inc.1.6K views
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka by Kai Wähner
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
Kai Wähner2K views
Deploying Flink on Kubernetes - David Anderson by Ververica
 Deploying Flink on Kubernetes - David Anderson Deploying Flink on Kubernetes - David Anderson
Deploying Flink on Kubernetes - David Anderson
Ververica 2K views
Running Apache Spark on Kubernetes: Best Practices and Pitfalls by Databricks
Running Apache Spark on Kubernetes: Best Practices and PitfallsRunning Apache Spark on Kubernetes: Best Practices and Pitfalls
Running Apache Spark on Kubernetes: Best Practices and Pitfalls
Databricks2.9K views
Modernizing to a Cloud Data Architecture by Databricks
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
Databricks654 views
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap... by Flink Forward
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Flink Forward3.2K views
0-60: Tesla's Streaming Data Platform ( Jesse Yates, Tesla) Kafka Summit SF 2019 by confluent
0-60: Tesla's Streaming Data Platform ( Jesse Yates, Tesla) Kafka Summit SF 20190-60: Tesla's Streaming Data Platform ( Jesse Yates, Tesla) Kafka Summit SF 2019
0-60: Tesla's Streaming Data Platform ( Jesse Yates, Tesla) Kafka Summit SF 2019
confluent10.2K views
NiFi Best Practices for the Enterprise by Gregory Keys
NiFi Best Practices for the EnterpriseNiFi Best Practices for the Enterprise
NiFi Best Practices for the Enterprise
Gregory Keys3.5K views
Microservice 4.0 Journey - From Spring NetFlix OSS to Istio Service Mesh and ... by Daniel Oh
Microservice 4.0 Journey - From Spring NetFlix OSS to Istio Service Mesh and ...Microservice 4.0 Journey - From Spring NetFlix OSS to Istio Service Mesh and ...
Microservice 4.0 Journey - From Spring NetFlix OSS to Istio Service Mesh and ...
Daniel Oh11.5K views
Apache Spark on K8S Best Practice and Performance in the Cloud by Databricks
Apache Spark on K8S Best Practice and Performance in the CloudApache Spark on K8S Best Practice and Performance in the Cloud
Apache Spark on K8S Best Practice and Performance in the Cloud
Databricks8K views
Best Practices for Middleware and Integration Architecture Modernization with... by Claus Ibsen
Best Practices for Middleware and Integration Architecture Modernization with...Best Practices for Middleware and Integration Architecture Modernization with...
Best Practices for Middleware and Integration Architecture Modernization with...
Claus Ibsen1.5K views
Apache NiFi in the Hadoop Ecosystem by Bryan Bende
Apache NiFi in the Hadoop EcosystemApache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop Ecosystem
Bryan Bende7.9K views
A Thorough Comparison of Delta Lake, Iceberg and Hudi by Databricks
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Databricks11.1K views
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard by Paris Data Engineers !
Delta Lake OSS: Create reliable and performant Data Lake by Quentin AmbardDelta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20... by HostedbyConfluent
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
HostedbyConfluent36K views
Databricks Delta Lake and Its Benefits by Databricks
Databricks Delta Lake and Its BenefitsDatabricks Delta Lake and Its Benefits
Databricks Delta Lake and Its Benefits
Databricks5.1K views
Camel Day Italy 2021 - What's new in Camel 3 by Claus Ibsen
Camel Day Italy 2021 - What's new in Camel 3Camel Day Italy 2021 - What's new in Camel 3
Camel Day Italy 2021 - What's new in Camel 3
Claus Ibsen542 views
Ist Daten-Liberalismus der richtige Weg? by confluent
Ist Daten-Liberalismus der richtige Weg?Ist Daten-Liberalismus der richtige Weg?
Ist Daten-Liberalismus der richtige Weg?
confluent95 views

Similar to Using the FLaNK Stack for edge ai (flink, nifi, kafka, kudu)

Cloudera streaming with flink oct 29, 2020 meetup london by
Cloudera streaming with flink oct 29, 2020 meetup londonCloudera streaming with flink oct 29, 2020 meetup london
Cloudera streaming with flink oct 29, 2020 meetup londonTimothy Spann
495 views20 slides
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka... by
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...Timothy Spann
519 views15 slides
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023 by
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023ssuser73434e
54 views79 slides
Introduction to Apache NiFi 1.11.4 by
Introduction to Apache NiFi 1.11.4Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4Timothy Spann
1.1K views32 slides
Introduction to Apache NiFi 1.10 by
Introduction to Apache NiFi 1.10Introduction to Apache NiFi 1.10
Introduction to Apache NiFi 1.10Timothy Spann
2K views24 slides
Cracking the nut, solving edge ai with apache tools and frameworks by
Cracking the nut, solving edge ai with apache tools and frameworksCracking the nut, solving edge ai with apache tools and frameworks
Cracking the nut, solving edge ai with apache tools and frameworksTimothy Spann
713 views25 slides

Similar to Using the FLaNK Stack for edge ai (flink, nifi, kafka, kudu)(20)

Cloudera streaming with flink oct 29, 2020 meetup london by Timothy Spann
Cloudera streaming with flink oct 29, 2020 meetup londonCloudera streaming with flink oct 29, 2020 meetup london
Cloudera streaming with flink oct 29, 2020 meetup london
Timothy Spann495 views
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka... by Timothy Spann
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...
Timothy Spann519 views
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023 by ssuser73434e
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
ssuser73434e54 views
Introduction to Apache NiFi 1.11.4 by Timothy Spann
Introduction to Apache NiFi 1.11.4Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4
Timothy Spann1.1K views
Introduction to Apache NiFi 1.10 by Timothy Spann
Introduction to Apache NiFi 1.10Introduction to Apache NiFi 1.10
Introduction to Apache NiFi 1.10
Timothy Spann2K views
Cracking the nut, solving edge ai with apache tools and frameworks by Timothy Spann
Cracking the nut, solving edge ai with apache tools and frameworksCracking the nut, solving edge ai with apache tools and frameworks
Cracking the nut, solving edge ai with apache tools and frameworks
Timothy Spann713 views
The Never Landing Stream with HTAP and Streaming by Timothy Spann
The Never Landing Stream with HTAP and StreamingThe Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and Streaming
Timothy Spann254 views
[OpenStack Day in Korea 2015] Keynote 2 - Leveraging OpenStack to Realize the... by OpenStack Korea Community
[OpenStack Day in Korea 2015] Keynote 2 - Leveraging OpenStack to Realize the...[OpenStack Day in Korea 2015] Keynote 2 - Leveraging OpenStack to Realize the...
[OpenStack Day in Korea 2015] Keynote 2 - Leveraging OpenStack to Realize the...
Snowflake’s Cloud Data Platform and Modern Analytics by Senturus
Snowflake’s Cloud Data Platform and Modern AnalyticsSnowflake’s Cloud Data Platform and Modern Analytics
Snowflake’s Cloud Data Platform and Modern Analytics
Senturus147 views
cncf overview and building edge computing using kubernetes by Krishna-Kumar
cncf overview and building edge computing using kubernetescncf overview and building edge computing using kubernetes
cncf overview and building edge computing using kubernetes
Krishna-Kumar 402 views
Building Real-time Pipelines with FLaNK_ A Case Study with Transit Data by Timothy Spann
Building Real-time Pipelines with FLaNK_ A Case Study with Transit DataBuilding Real-time Pipelines with FLaNK_ A Case Study with Transit Data
Building Real-time Pipelines with FLaNK_ A Case Study with Transit Data
Timothy Spann193 views
Cracking the nut, solving edge ai with apache tools and frameworks by Timothy Spann
Cracking the nut, solving edge ai with apache tools and frameworksCracking the nut, solving edge ai with apache tools and frameworks
Cracking the nut, solving edge ai with apache tools and frameworks
Timothy Spann504 views
IoT Edge Data Processing with NVidia Jetson Nano oct 3 2019 by Timothy Spann
IoT  Edge Data Processing with NVidia Jetson Nano oct 3 2019IoT  Edge Data Processing with NVidia Jetson Nano oct 3 2019
IoT Edge Data Processing with NVidia Jetson Nano oct 3 2019
Timothy Spann1.5K views
GSJUG: Mastering Data Streaming Pipelines 09May2023 by Timothy Spann
GSJUG: Mastering Data Streaming Pipelines 09May2023GSJUG: Mastering Data Streaming Pipelines 09May2023
GSJUG: Mastering Data Streaming Pipelines 09May2023
Timothy Spann255 views
Cisco connect montreal 2018 compute v final by Cisco Canada
Cisco connect montreal 2018   compute v finalCisco connect montreal 2018   compute v final
Cisco connect montreal 2018 compute v final
Cisco Canada1.6K views
OSSFinance_UnlockingFinancialDatawithReal-TimePipelines.pdf by Timothy Spann
OSSFinance_UnlockingFinancialDatawithReal-TimePipelines.pdfOSSFinance_UnlockingFinancialDatawithReal-TimePipelines.pdf
OSSFinance_UnlockingFinancialDatawithReal-TimePipelines.pdf
Timothy Spann23 views
Session 8 - Creating Data Processing Services | Train the Trainers Program by FIWARE
Session 8 - Creating Data Processing Services | Train the Trainers ProgramSession 8 - Creating Data Processing Services | Train the Trainers Program
Session 8 - Creating Data Processing Services | Train the Trainers Program
FIWARE301 views
StampedeCon 2015 Keynote by Ken Owens
StampedeCon 2015 KeynoteStampedeCon 2015 Keynote
StampedeCon 2015 Keynote
Ken Owens1K views
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015 by StampedeCon
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
StampedeCon1.2K views

More from Timothy Spann

Building Real-Time Travel Alerts by
Building Real-Time Travel AlertsBuilding Real-Time Travel Alerts
Building Real-Time Travel AlertsTimothy Spann
165 views48 slides
JConWorld_ Continuous SQL with Kafka and Flink by
JConWorld_ Continuous SQL with Kafka and FlinkJConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and FlinkTimothy Spann
156 views36 slides
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines by
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
[EN]DSS23_tspann_Integrating LLM with Streaming Data PipelinesTimothy Spann
150 views25 slides
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo by
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines DemoEvolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines DemoTimothy Spann
162 views8 slides
CoC23_ Looking at the New Features of Apache NiFi by
CoC23_ Looking at the New Features of Apache NiFiCoC23_ Looking at the New Features of Apache NiFi
CoC23_ Looking at the New Features of Apache NiFiTimothy Spann
36 views24 slides
CoC23_ Let’s Monitor The Conditions at the Conference by
CoC23_ Let’s Monitor The Conditions at the ConferenceCoC23_ Let’s Monitor The Conditions at the Conference
CoC23_ Let’s Monitor The Conditions at the ConferenceTimothy Spann
17 views17 slides

More from Timothy Spann(20)

Building Real-Time Travel Alerts by Timothy Spann
Building Real-Time Travel AlertsBuilding Real-Time Travel Alerts
Building Real-Time Travel Alerts
Timothy Spann165 views
JConWorld_ Continuous SQL with Kafka and Flink by Timothy Spann
JConWorld_ Continuous SQL with Kafka and FlinkJConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and Flink
Timothy Spann156 views
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines by Timothy Spann
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
Timothy Spann150 views
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo by Timothy Spann
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines DemoEvolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Timothy Spann162 views
CoC23_ Looking at the New Features of Apache NiFi by Timothy Spann
CoC23_ Looking at the New Features of Apache NiFiCoC23_ Looking at the New Features of Apache NiFi
CoC23_ Looking at the New Features of Apache NiFi
Timothy Spann36 views
CoC23_ Let’s Monitor The Conditions at the Conference by Timothy Spann
CoC23_ Let’s Monitor The Conditions at the ConferenceCoC23_ Let’s Monitor The Conditions at the Conference
CoC23_ Let’s Monitor The Conditions at the Conference
Timothy Spann17 views
CoC23_Utilizing Real-Time Transit Data for Travel Optimization by Timothy Spann
CoC23_Utilizing Real-Time Transit Data for Travel OptimizationCoC23_Utilizing Real-Time Transit Data for Travel Optimization
CoC23_Utilizing Real-Time Transit Data for Travel Optimization
Timothy Spann31 views
Meetup - Brasil - Data In Motion - 2023 September 19 by Timothy Spann
Meetup - Brasil - Data In Motion - 2023 September 19Meetup - Brasil - Data In Motion - 2023 September 19
Meetup - Brasil - Data In Motion - 2023 September 19
Timothy Spann319 views
Implement a Universal Data Distribution Architecture to Manage All Streaming ... by Timothy Spann
Implement a Universal Data Distribution Architecture to Manage All Streaming ...Implement a Universal Data Distribution Architecture to Manage All Streaming ...
Implement a Universal Data Distribution Architecture to Manage All Streaming ...
Timothy Spann28 views
big data fest building modern data streaming apps by Timothy Spann
big data fest building modern data streaming appsbig data fest building modern data streaming apps
big data fest building modern data streaming apps
Timothy Spann317 views
Using Apache NiFi with Apache Pulsar for Fast Data On-Ramp by Timothy Spann
Using Apache NiFi with Apache Pulsar for Fast Data On-RampUsing Apache NiFi with Apache Pulsar for Fast Data On-Ramp
Using Apache NiFi with Apache Pulsar for Fast Data On-Ramp
Timothy Spann163 views
OSSNA Building Modern Data Streaming Apps by Timothy Spann
OSSNA Building Modern Data Streaming AppsOSSNA Building Modern Data Streaming Apps
OSSNA Building Modern Data Streaming Apps
Timothy Spann155 views
BestInFlowCompetitionTutorials03May2023 by Timothy Spann
BestInFlowCompetitionTutorials03May2023BestInFlowCompetitionTutorials03May2023
BestInFlowCompetitionTutorials03May2023
Timothy Spann11 views
Cloudera Sandbox Event Guidelines For Workflow by Timothy Spann
Cloudera Sandbox Event Guidelines For WorkflowCloudera Sandbox Event Guidelines For Workflow
Cloudera Sandbox Event Guidelines For Workflow
Timothy Spann32 views
Meet the Committers Webinar_ Lab Preparation by Timothy Spann
Meet the Committers Webinar_ Lab PreparationMeet the Committers Webinar_ Lab Preparation
Meet the Committers Webinar_ Lab Preparation
Timothy Spann32 views
Best Practices For Workflow by Timothy Spann
Best Practices For WorkflowBest Practices For Workflow
Best Practices For Workflow
Timothy Spann89 views
Meetup: Streaming Data Pipeline Development by Timothy Spann
Meetup:  Streaming Data Pipeline DevelopmentMeetup:  Streaming Data Pipeline Development
Meetup: Streaming Data Pipeline Development
Timothy Spann337 views
DevNexus: Apache Pulsar Development 101 with Java by Timothy Spann
DevNexus:  Apache Pulsar Development 101 with JavaDevNexus:  Apache Pulsar Development 101 with Java
DevNexus: Apache Pulsar Development 101 with Java
Timothy Spann261 views
Conf42 Python_ ML Enhanced Event Streaming Apps with Python Microservices by Timothy Spann
Conf42 Python_ ML Enhanced Event Streaming Apps with Python MicroservicesConf42 Python_ ML Enhanced Event Streaming Apps with Python Microservices
Conf42 Python_ ML Enhanced Event Streaming Apps with Python Microservices
Timothy Spann443 views

Recently uploaded

TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc
176 views29 slides
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... by
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...ShapeBlue
120 views17 slides
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...ShapeBlue
183 views18 slides
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue by
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueShapeBlue
139 views15 slides
Initiating and Advancing Your Strategic GIS Governance Strategy by
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategySafe Software
184 views68 slides
"Package management in monorepos", Zoltan Kochan by
"Package management in monorepos", Zoltan Kochan"Package management in monorepos", Zoltan Kochan
"Package management in monorepos", Zoltan KochanFwdays
34 views18 slides

Recently uploaded(20)

TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc176 views
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... by ShapeBlue
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
ShapeBlue120 views
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by ShapeBlue
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
ShapeBlue183 views
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue by ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
ShapeBlue139 views
Initiating and Advancing Your Strategic GIS Governance Strategy by Safe Software
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance Strategy
Safe Software184 views
"Package management in monorepos", Zoltan Kochan by Fwdays
"Package management in monorepos", Zoltan Kochan"Package management in monorepos", Zoltan Kochan
"Package management in monorepos", Zoltan Kochan
Fwdays34 views
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue by ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
ShapeBlue224 views
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue225 views
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... by ShapeBlue
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
ShapeBlue178 views
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ... by ShapeBlue
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
ShapeBlue129 views
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ... by Jasper Oosterveld
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
Why and How CloudStack at weSystems - Stephan Bienek - weSystems by ShapeBlue
Why and How CloudStack at weSystems - Stephan Bienek - weSystemsWhy and How CloudStack at weSystems - Stephan Bienek - weSystems
Why and How CloudStack at weSystems - Stephan Bienek - weSystems
ShapeBlue247 views
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And... by ShapeBlue
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...
ShapeBlue108 views
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT by ShapeBlue
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBITUpdates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
ShapeBlue208 views
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue by ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
ShapeBlue152 views
Business Analyst Series 2023 - Week 4 Session 7 by DianaGray10
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7
DianaGray10146 views
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023 by BookNet Canada
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
BookNet Canada44 views
The Power of Heat Decarbonisation Plans in the Built Environment by IES VE
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built Environment
IES VE84 views
"Running students' code in isolation. The hard way", Yurii Holiuk by Fwdays
"Running students' code in isolation. The hard way", Yurii Holiuk "Running students' code in isolation. The hard way", Yurii Holiuk
"Running students' code in isolation. The hard way", Yurii Holiuk
Fwdays36 views

Using the FLaNK Stack for edge ai (flink, nifi, kafka, kudu)

  • 1. Using The FLaNK Stack for Edge AI Timothy Spann Principal DataFlow Field Engineer Cloudera @PaasDev
  • 2. 2© 2020 Cloudera, Inc. All rights reserved.
  • 3. © 2020 Cloudera, Inc. All rights reserved. 3 FLaNK Speaker Who am I? Principal DataFlow Field Engineer @PaasDev DZone Zone Leader and Big Data MVB Future of Data Meetup Leader ex-Pivotal Field Engineer https://github.com/tspannhw https://www.datainmotion.dev/
  • 4. © 2020 Cloudera, Inc. All rights reserved. 4 Welcome to Future of Data - Princeton - Virtual @PaasDev https://www.meetup.com/futureofdata-princeton/ From Big Data to AI to Streaming to Containers to Cloud to Analytics to Cloud Storage to Fast Data to Machine Learning to Microservices to ...
  • 5. © 2020 Cloudera, Inc. All rights reserved. 5 Cloudera Commitment to the Community Keynote @ Flink Forward 2020 San Francisco https://www.youtube.com/watch?v=ckcOyRA6ZOc ● We support Flink deployments both in the public cloud and on premise ● 2 PMC members onboard ● Multiple service integrations: Schema Registry HBase Kudu Atlas Knox ● Regular blog posts, talks and Apache code contributions
  • 6. 6 Analyze Streaming OLAP Analytics & Time Series Store Powered by Druid & Kudu Buffer Apache Kafka Topics Ingest Gateway Powered by Kafka Distribute Apache NiFi Data Flow Apps Powered by NiFi Buffer Apache Kafka Syndicate topics Syndicate Services Powered by Kafka Collect Syndicate topics Syndicate Services Powered by Kafka Replication / Data Deployment Analyze Streaming Analytics Apps Stream Processing Powered by Flink Demo Reference Architecture Data Collection at the Edge Apache NiFi / MiNiFi - sensors, IoT - databases - file systems - app sidecar - live streams - MQ - logs - network Anything… you name it!
  • 7. © 2020 Cloudera, Inc. All rights reserved. 7 THE COMPLETE AND CONNECTED DATA LIFECYCLE Collect Edge & Flow Management ActData-in- Motion Curate Data Engineering Report Data Warehouse Serve Operational Database Predict Machine Learning and AI Data-at- Rest A Connected Data Lifecycle is Critical to Meet the Needs of Real-time Use CasesPOWERED BY Distribute Flow Management Buffer Streams Messaging Analyze Streaming Analytics Enrichment Operational insights ScoringBatch curation
  • 8. © 2020 Cloudera, Inc. All rights reserved. 8 End-to-End Schema and Lineage Management
  • 9. © 2020 Cloudera, Inc. All rights reserved. 9 Streaming Data Pipelines with NiFi + Kafka + Flink
  • 10. © 2020 Cloudera, Inc. All rights reserved. 10 CLOUDERA DATAFLOW DATA-IN-MOTION PLATFORM
  • 11. 11 Where Can I Run FLaNK Easily? CDP services are optimized for the elastic compute & ‘always-on’ storage services provided by any cloud provider Web service hosted and managed by Cloudera Hosted in the your cloud environment, but managed by the CDP Management Console Shared Data Experience (SDX) technologies form a secure and governed data lake backed by object storage (S3, ADLS, GCS) Flow Management Streams Messaging Streaming Analytics
  • 12. 12 What is NiFi used for?
  • 13. © 2020 Cloudera, Inc. All rights reserved. 13 Cloudera Flow Management Enable easy ingestion, routing, management and delivery of any data anywhere (Edge, cloud, data center) to any downstream system with built in end-to-end security and provenance ACQUIRE PROCESS DELIVER • Over 300 Prebuilt Processors • Easy to build your own • Parse, Enrich & Apply Schema • Filter, Split, Merger & Route • Throttle & Backpressure • Guaranteed Delivery • Full data provenance from acquisition to delivery • Diverse, Non-Traditional Sources • Eco-system integration Advanced tooling to industrialize flow development (Flow Development Life Cycle)
  • 14. © 2020 Cloudera, Inc. All rights reserved. 14 https://blog.cloudera.com/benchmarking-nifi-performance-and-scalability/ NiFi Processing Billions of Events
  • 15. © 2020 Cloudera, Inc. All rights reserved. 15 STREAMS MESSAGING / APACHE KAFKA Kafka Connect Support Simple Data Movement In/Out of Kafka Schema Registry Ranger Plugin Improved ACL and Audit for Kafka and Schema Registry Cruise Control Support Intelligent Kafka Cluster Rebalancing & Self Healing
  • 16. © 2020 Cloudera, Inc. All rights reserved. 16 Key Capabilities STREAMING ANALYTICS / APACHE FLINK Flink SQL Support Agile Streaming App Development using SQL Apache Flink Atlas Hook Capture operational Flink app metadata and lineage Single View of Flink Yarn Jobs Improve Developer Experience & operational visibility
  • 17. Demo
  • 18. 18 Edge AI to Cloud Streaming Pipeline Device Data SensorsEnergy Logs Weather Sensors Aggregates Energy SQL Analytics MiNiFi Agent Deep Learning Classification Edge Private Cloud Multi-Public Cloud
  • 19. If You Missed This Live
  • 21. © 2020 Cloudera, Inc. All rights reserved. 21 MiNiFi Java Agent ● Reads Sensor Logs ● OpenVino NCC2 AI ● Reads Images ● Sends to NiFi Gateway Apache NiFi Gateway processors, validates, transforms, cleans, routes and streams events for additional processing through Apache Kafka topics. MiNiFi Agents Running Deep Learning Classification And Sending Images and Results to NiFi Gateways
  • 22. 22© 2020 Cloudera, Inc. All rights reserved. FLINK SQL DEMO - DML - CATALOG AND TABLES SHOW catalogs; USE CATALOG registry; SHOW tables; SELECT * FROM energy;
  • 23. © 2020 Cloudera, Inc. All rights reserved. 23 INSERT INTO global_sensor_events SELECT scada.uuid,scada.systemtime,scada.temperaturef, scada.pressure,scada.humidity,scada.lux,scada.proximity, scada.oxidising,scada.reducing,scada.nh3,scada.gasko, energy.`current`,energy.voltage,energy.`power`, energy.`total`,energy.fanstatus FROM energy, scada WHERE scada.systemtime = energy.systemtime; WHERE IS THE FLINK CODE!??!
  • 24. 24© 2020 Cloudera, Inc. All rights reserved. FLINK SQL DEMO
  • 25. 25© 2020 Cloudera, Inc. All rights reserved.
  • 26. 26© 2020 Cloudera, Inc. All rights reserved. FLINK SQL DEMO - FLINK DASHBOARD
  • 27. 27© 2020 Cloudera, Inc. All rights reserved. FLINK SQL DEMO
  • 28. 28© 2020 Cloudera, Inc. All rights reserved. FLINK SQL DEMO
  • 29. © 2020 Cloudera, Inc. All rights reserved. 29 {"uuid": "rpi4_uuid_jfx_20200826203733", "amplitude100": 1.2, "amplitude500": 0.6, "amplitude1000": 0.3, "lownoise": 0.6, "midnoise": 0.2, "highnoise": 0.2, "amps": 0.3, "ipaddress": "192.168.1.76", "host": "rp4", "host_name": "rp4", "macaddress": "6e:37:12:08:63:e1", "systemtime": "08/26/2020 16:37:34", "endtime": "1598474254.75", "runtime": "28179.03", "starttime": "08/26/2020 08:47:54", "cpu": 48.3, "cpu_temp": "72.0", "diskusage": "40219.3 MB", "memory": 24.3, "id": "20200826203733_28ce9520-6832-4f80-b17d-f36c21fd8fc9", "temperature": "47.2", "adjtemp": "35.8", "adjtempf": "76.4", "temperaturef": "97.0", "pressure": 1010.0, "humidity": 8.3, "lux": 67.4, "proximity": 0, "oxidising": 77.9, "reducing": 184.6, "nh3": 144.7, "gasKO": "Oxidising: 77913.04 OhmsnReducing: 184625.00 OhmsnNH3: 144651.47 Ohms"} SHOW ME THE DATA
  • 30. © 2020 Cloudera, Inc. All rights reserved. 30 BME280 - temperature, pressure, humidity sensor LTR-559 - light and proximity sensor MICS6814 - analog gas sensor ADS1015 ADC MEMS - microphone 0.96-inch, 160 x 80 color LCD WHERE DID THAT DATA COME FROM?
  • 31. © 2020 Cloudera, Inc. All rights reserved. 31
  • 32. © 2020 Cloudera, Inc. All rights reserved. 32
  • 33. © 2020 Cloudera, Inc. All rights reserved. 33
  • 34. © 2020 Cloudera, Inc. All rights reserved. 34
  • 35. © 2020 Cloudera, Inc. All rights reserved. 35
  • 36. © 2020 Cloudera, Inc. All rights reserved. 36
  • 37. © 2020 Cloudera, Inc. All rights reserved. 37
  • 39. 39© 2020 Cloudera, Inc. All rights reserved. DEMO SOURCE CODE ● https://github.com/tspannhw/FlinkForwardGlobal2020 ● https://github.com/tspannhw/ApacheConAtHome2020 ● https://github.com/tspannhw/minifi-xaviernx ● https://github.com/tspannhw/minifi-jetson-nano ● https://github.com/tspannhw/minifi-enviroplus The code, build scripts, schemas, table DDL, Flink SQL, Kafka Connect configuration, NiFi flows, HBase tables, Kudu tables, Hive tables, HDFS directories, alerts, images, HTML, docs, links and all the goodies are here. Please fork and contribute.
  • 40. 40© 2020 Cloudera, Inc. All rights reserved. FLINK SQL LINKS ● https://github.com/cloudera/flink-tutorials ● https://docs.cloudera.com/csa/1.1.0/job-lifecycle/topics/csa-run-job.html ● https://github.com/asdaraujo/edge2ai-workshop/tree/master/flink ● https://ci.apache.org/projects/flink/flink-docs-stable/dev/table/sqlClient.html ● https://ci.apache.org/projects/flink/flink-docs-release-1.10/dev/table/sqlClient.html ● https://github.com/tspannhw/FlinkSQLDemo ● https://www.datainmotion.dev/2020/05/flank-low-code-streaming-populating.html ● https://github.com/tspannhw/meetup-sensors/tree/master/flink-sql ● https://www.datainmotion.dev/2020/05/flink-sql-preview.html
  • 41. 41© 2020 Cloudera, Inc. All rights reserved. Flink SQL LINKS ● https://docs.cloudera.com/csa/1.2.0/flink-sql-table-api/topics/csa-kafka-regi stry-avro.html ● https://github.com/tspannhw/ApacheConAtHome2020/ ● https://github.com/tspannhw/FlinkSQLDemo ● https://github.com/tspannhw/meetup-sensors/tree/main/flink-sql ● https://github.com/cloudera/flink-tutorials/tree/master/flink-sql-tutorial
  • 42. © 2020 Cloudera, Inc. All rights reserved. 42 TH N Y U