Time Series Analysis: DataFlow

Timothy Spann
Timothy SpannDeveloper Advocate
Time Series Analysis: DataFlow
Timothy Spann
Principal DataFlow Field Engineer
@PaasDev
2© 2020 Cloudera, Inc. All rights reserved.
© 2020 Cloudera, Inc. All rights reserved. 3
Welcome to Future of Data - Princeton
@PaasDev
https://www.meetup.com/futureofdata-princeton/
From Big Data to AI to Streaming to Containers to
Cloud to Analytics to Cloud Storage to Fast Data to
Machine Learning to Microservices to ...
© 2020 Cloudera, Inc. All rights reserved. 4
Welcome to Future of Data - New York
https://www.meetup.com/futureofdata-newyork/
From Big Data to AI to Streaming to Containers to
Cloud to Analytics to Cloud Storage to Fast Data to
Machine Learning to Microservices to ...
© 2020 Cloudera, Inc. All rights reserved. 5
Welcome to Future of Data - Philadelphia
@futureofdataphl
https://www.meetup.com/futureofdata-philadelphia/
From Big Data to AI to Streaming to Containers to
Cloud to Analytics to Cloud Storage to Fast Data to
Machine Learning to K8 to ...
© 2020 Cloudera, Inc. All rights reserved. 6
Meetup Presenter 1
Who am I?
Principal DataFlow Field Engineer
@PaasDev
DZone Zone Leader and Big Data MVB;
Princeton NJ Future of Data Meetup;
ex-Pivotal Field Engineer;
Apache Kafka, Tensorflow, Apache Spark RefCards
https://github.com/tspannhw https://www.datainmotion.dev/
https://dzone.com/users/297029/bunkertor.html
© 2019 Cloudera, Inc. All rights reserved. 7
CLOUDERA DATA PLATFORM
World’s first enterprise data cloud
© 2020 Cloudera, Inc. All rights reserved. 8
THE ENTERPRISE DATA CLOUD
COMPONENTS
Traditional Platform Consumption:
• Data Hub Clusters
New analytic experiences:
• Data Warehouse
• Machine Learning
• More to come
Control Plane services:
• Workload Manager
• Replication Manager
• Data Catalog
• Management Console
© 2020 Cloudera, Inc. All rights reserved. 9
CLOUDERA - THE ENTERPRISE DATA COMPANY
STREAMING &
DATA FLOW
DATA
WAREHOUSE
VISUAL
APPLICATIONS
Collect Report Enable
SECURITY | GOVERNANCE | LINEAGE | MANAGEMENT | AUTOMATION
DATA
ENGINEERING
Enrich
MACHINE
LEARNING & AI
Predict
© 2020 Cloudera, Inc. All rights reserved. 10
CSA
© 2020 Cloudera, Inc. All rights reserved. 11
Streaming Analytics Powered by Apache Flink
12© 2020 Cloudera, Inc. All rights reserved.
Buffer Analyze Connect
Modern AppsApache Kafka
gateway-west-
raw-sensors
gateway-central-
raw-sensors
gateway-east-
raw-sensors
C++ agent
US-Central Plants
C++ agent
US-East Plants
Data-at-Rest
opDB
CDP-DC
S3 Azure
Streaming
Analytics Powered
by FLINK
Microservices by
KAFKA Streams
Ingest Gateway
Powered by Kafka
US-West Plants
C++ agent
Model Scoring
Powered by CML
Data-at-Rest
Cloud Storage
Data Lake
Operational Stores
Data Collection
at the Edge
Collect
Apache NiFi & MiNiFi
Distribute
Apache NiFi
Data Flow Apps
Powered by NIFI
A DATA-IN-MOTION REFERENCE ARCHITECTURE
© 2020 Cloudera, Inc. All rights reserved. 13
SQL & Table API
● Unified APIs for streaming data and data at rest
○ Run the same query on batch and streaming data
○ ANSI SQL: No stream-specific syntax or semantics!
○ Many common stream analytics use cases supported
SELECT
userId,
COUNT(*) AS cnt
SESSION_START(clicktime, INTERVAL '30' MINUTE)
FROM clicks
GROUP BY
SESSION(clicktime, INTERVAL '30' MINUTE),
userId
Count clicks per user and session (defined
by 30 min. gap of inactivity).
Quick Flink SQL Demo Preview
FLaNK Stack https://github.com/tspannhw/MmFLaNK
https://www.datainmotion.dev/2019/11/introducing-mm-flank-apache-flink-stack.html
SELECT * FROM sensors;
© 2020 Cloudera, Inc. All rights reserved. 16
CFM
CSM
© 2020 Cloudera, Inc. All rights reserved. 17
CDF
The Active Data Warehouse with Apache Kudu
IOT Devices
Applications
Metrics
Logs & Files
HDFS/
Object Storage
Hot Storage
Cold Storage
SQL Real-Time
Analytics
Alerting
Event Driven
Applications
Dashboards
Authorization Audit & LineageAuthentication
Kerberos
Encryption
NavEncrypt
Sensor Data
https://www.datainmotion.dev/2020/04/predicting-sensor-readings-with-time.html
Sensors
● BME280 temperature, pressure, humidity sensor
● LTR-559 light and proximity sensor
● MICS6814 analog gas sensor
● ADS1015 ADC
● MEMS microphone
● 0.96-inch, 160 x 80 color LCD
Sensor Data - Edge
Sensor Data - Hydrate Date Lakes
Sensor Data - Example Row
{"uuid": "rpi4_uuid_omi_20200417211935", "amplitude100": 0.3, "amplitude500": 0.1, "amplitude1000": 0.1,
"lownoise": 0.1, "midnoise": 0.1, "highnoise": 0.1, "amps": 0.3, "ipaddress": "192.168.1.243", "host": "rp4",
"host_name": "rp4", "macaddress": "dc:a6:32:03:a6:e9", "systemtime": "04/17/2020 17:19:36", "endtime":
"1587158376.22", "runtime": "36.47", "starttime": "04/17/2020 17:18:58", "cpu": 0.0, "cpu_temp": "59.0",
"diskusage": "46651.6 MB", "memory": 6.3, "id": "20200417211935_7b7ae5da-905b-418b-94f1-270a15dbc1df",
"temperature": "38.7", "adjtemp": "29.7", "adjtempf": "65.5", "temperaturef": "81.7", "pressure": 1015.6,
"humidity": 6.8, "lux": 1.2, "proximity": 0, "oxidising": 8.3, "reducing": 306.4, "nh3": 129.5, "gasKO": "Oxidising:
8300.63 OhmsnReducing: 306352.94 OhmsnNH3: 129542.17 Ohms"}
Sensor Ingest Demo
Data Science Up Next
Time Series Analysis:  DataFlow
© 2020 Cloudera, Inc. All rights reserved. 26
LINKS
LINKS
● https://www.datainmotion.dev/2019/12/iot-series-minifi-agent-on-raspberry-pi.html
● https://learn.pimoroni.com/tutorial/sandyj/getting-started-with-enviro-plus
● https://github.com/tspannhw/meetup-sensors/
● https://github.com/tspannhw/ClouderaFlowManagementWorkshop
● https://github.com/tspannhw/minifi-enviroplus
● https://github.com/tspannhw/minifi-movidius-electric
● https://github.com/tspannhw/table-ddl
TH N Y U
1 of 28

Recommended

Building Real-Time Travel Alerts by
Building Real-Time Travel AlertsBuilding Real-Time Travel Alerts
Building Real-Time Travel AlertsTimothy Spann
165 views48 slides
JConWorld_ Continuous SQL with Kafka and Flink by
JConWorld_ Continuous SQL with Kafka and FlinkJConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and FlinkTimothy Spann
156 views36 slides
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines by
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
[EN]DSS23_tspann_Integrating LLM with Streaming Data PipelinesTimothy Spann
150 views25 slides
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo by
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines DemoEvolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines DemoTimothy Spann
162 views8 slides
CoC23_ Looking at the New Features of Apache NiFi by
CoC23_ Looking at the New Features of Apache NiFiCoC23_ Looking at the New Features of Apache NiFi
CoC23_ Looking at the New Features of Apache NiFiTimothy Spann
36 views24 slides
CoC23_ Let’s Monitor The Conditions at the Conference by
CoC23_ Let’s Monitor The Conditions at the ConferenceCoC23_ Let’s Monitor The Conditions at the Conference
CoC23_ Let’s Monitor The Conditions at the ConferenceTimothy Spann
17 views17 slides

More Related Content

More from Timothy Spann

The Never Landing Stream with HTAP and Streaming by
The Never Landing Stream with HTAP and StreamingThe Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and StreamingTimothy Spann
254 views39 slides
Meetup - Brasil - Data In Motion - 2023 September 19 by
Meetup - Brasil - Data In Motion - 2023 September 19Meetup - Brasil - Data In Motion - 2023 September 19
Meetup - Brasil - Data In Motion - 2023 September 19Timothy Spann
319 views33 slides
Implement a Universal Data Distribution Architecture to Manage All Streaming ... by
Implement a Universal Data Distribution Architecture to Manage All Streaming ...Implement a Universal Data Distribution Architecture to Manage All Streaming ...
Implement a Universal Data Distribution Architecture to Manage All Streaming ...Timothy Spann
28 views56 slides
Building Real-time Pipelines with FLaNK_ A Case Study with Transit Data by
Building Real-time Pipelines with FLaNK_ A Case Study with Transit DataBuilding Real-time Pipelines with FLaNK_ A Case Study with Transit Data
Building Real-time Pipelines with FLaNK_ A Case Study with Transit DataTimothy Spann
193 views45 slides
big data fest building modern data streaming apps by
big data fest building modern data streaming appsbig data fest building modern data streaming apps
big data fest building modern data streaming appsTimothy Spann
317 views55 slides
Using Apache NiFi with Apache Pulsar for Fast Data On-Ramp by
Using Apache NiFi with Apache Pulsar for Fast Data On-RampUsing Apache NiFi with Apache Pulsar for Fast Data On-Ramp
Using Apache NiFi with Apache Pulsar for Fast Data On-RampTimothy Spann
163 views27 slides

More from Timothy Spann(20)

The Never Landing Stream with HTAP and Streaming by Timothy Spann
The Never Landing Stream with HTAP and StreamingThe Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and Streaming
Timothy Spann254 views
Meetup - Brasil - Data In Motion - 2023 September 19 by Timothy Spann
Meetup - Brasil - Data In Motion - 2023 September 19Meetup - Brasil - Data In Motion - 2023 September 19
Meetup - Brasil - Data In Motion - 2023 September 19
Timothy Spann319 views
Implement a Universal Data Distribution Architecture to Manage All Streaming ... by Timothy Spann
Implement a Universal Data Distribution Architecture to Manage All Streaming ...Implement a Universal Data Distribution Architecture to Manage All Streaming ...
Implement a Universal Data Distribution Architecture to Manage All Streaming ...
Timothy Spann28 views
Building Real-time Pipelines with FLaNK_ A Case Study with Transit Data by Timothy Spann
Building Real-time Pipelines with FLaNK_ A Case Study with Transit DataBuilding Real-time Pipelines with FLaNK_ A Case Study with Transit Data
Building Real-time Pipelines with FLaNK_ A Case Study with Transit Data
Timothy Spann193 views
big data fest building modern data streaming apps by Timothy Spann
big data fest building modern data streaming appsbig data fest building modern data streaming apps
big data fest building modern data streaming apps
Timothy Spann317 views
Using Apache NiFi with Apache Pulsar for Fast Data On-Ramp by Timothy Spann
Using Apache NiFi with Apache Pulsar for Fast Data On-RampUsing Apache NiFi with Apache Pulsar for Fast Data On-Ramp
Using Apache NiFi with Apache Pulsar for Fast Data On-Ramp
Timothy Spann163 views
OSSNA Building Modern Data Streaming Apps by Timothy Spann
OSSNA Building Modern Data Streaming AppsOSSNA Building Modern Data Streaming Apps
OSSNA Building Modern Data Streaming Apps
Timothy Spann155 views
GSJUG: Mastering Data Streaming Pipelines 09May2023 by Timothy Spann
GSJUG: Mastering Data Streaming Pipelines 09May2023GSJUG: Mastering Data Streaming Pipelines 09May2023
GSJUG: Mastering Data Streaming Pipelines 09May2023
Timothy Spann255 views
BestInFlowCompetitionTutorials03May2023 by Timothy Spann
BestInFlowCompetitionTutorials03May2023BestInFlowCompetitionTutorials03May2023
BestInFlowCompetitionTutorials03May2023
Timothy Spann11 views
Cloudera Sandbox Event Guidelines For Workflow by Timothy Spann
Cloudera Sandbox Event Guidelines For WorkflowCloudera Sandbox Event Guidelines For Workflow
Cloudera Sandbox Event Guidelines For Workflow
Timothy Spann32 views
Meet the Committers Webinar_ Lab Preparation by Timothy Spann
Meet the Committers Webinar_ Lab PreparationMeet the Committers Webinar_ Lab Preparation
Meet the Committers Webinar_ Lab Preparation
Timothy Spann32 views
Best Practices For Workflow by Timothy Spann
Best Practices For WorkflowBest Practices For Workflow
Best Practices For Workflow
Timothy Spann89 views
Meetup: Streaming Data Pipeline Development by Timothy Spann
Meetup:  Streaming Data Pipeline DevelopmentMeetup:  Streaming Data Pipeline Development
Meetup: Streaming Data Pipeline Development
Timothy Spann337 views
DevNexus: Apache Pulsar Development 101 with Java by Timothy Spann
DevNexus:  Apache Pulsar Development 101 with JavaDevNexus:  Apache Pulsar Development 101 with Java
DevNexus: Apache Pulsar Development 101 with Java
Timothy Spann261 views
Conf42 Python_ ML Enhanced Event Streaming Apps with Python Microservices by Timothy Spann
Conf42 Python_ ML Enhanced Event Streaming Apps with Python MicroservicesConf42 Python_ ML Enhanced Event Streaming Apps with Python Microservices
Conf42 Python_ ML Enhanced Event Streaming Apps with Python Microservices
Timothy Spann443 views
ITPC Building Modern Data Streaming Apps by Timothy Spann
ITPC Building Modern Data Streaming AppsITPC Building Modern Data Streaming Apps
ITPC Building Modern Data Streaming Apps
Timothy Spann797 views
PythonWebConference_ Cloud Native Apache Pulsar Development 202 with Python by Timothy Spann
PythonWebConference_ Cloud Native Apache Pulsar Development 202 with PythonPythonWebConference_ Cloud Native Apache Pulsar Development 202 with Python
PythonWebConference_ Cloud Native Apache Pulsar Development 202 with Python
Timothy Spann430 views
PhillyJug Getting Started With Real-time Cloud Native Streaming With Java by Timothy Spann
PhillyJug  Getting Started With Real-time Cloud Native Streaming With JavaPhillyJug  Getting Started With Real-time Cloud Native Streaming With Java
PhillyJug Getting Started With Real-time Cloud Native Streaming With Java
Timothy Spann625 views
Why Spring Belongs In Your Data Stream (From Edge to Multi-Cloud) by Timothy Spann
Why Spring Belongs In Your Data Stream (From Edge to Multi-Cloud)Why Spring Belongs In Your Data Stream (From Edge to Multi-Cloud)
Why Spring Belongs In Your Data Stream (From Edge to Multi-Cloud)
Timothy Spann18 views

Recently uploaded

Data about the sector workshop by
Data about the sector workshopData about the sector workshop
Data about the sector workshopinfo828217
29 views27 slides
Applied physics letters journal.pdf by
Applied physics letters journal.pdfApplied physics letters journal.pdf
Applied physics letters journal.pdfaqsamukhtiyar88
5 views8 slides
Product Research sample.pdf by
Product Research sample.pdfProduct Research sample.pdf
Product Research sample.pdfAllenSingson
35 views29 slides
Infomatica-MDM.pptx by
Infomatica-MDM.pptxInfomatica-MDM.pptx
Infomatica-MDM.pptxKapil Rangwani
12 views16 slides
Oral presentation.pdf by
Oral presentation.pdfOral presentation.pdf
Oral presentation.pdfreemalmazroui8
5 views10 slides
Dr. Ousmane Badiane-2023 ReSAKSS Conference by
Dr. Ousmane Badiane-2023 ReSAKSS ConferenceDr. Ousmane Badiane-2023 ReSAKSS Conference
Dr. Ousmane Badiane-2023 ReSAKSS ConferenceAKADEMIYA2063
5 views34 slides

Recently uploaded(20)

Data about the sector workshop by info828217
Data about the sector workshopData about the sector workshop
Data about the sector workshop
info82821729 views
Product Research sample.pdf by AllenSingson
Product Research sample.pdfProduct Research sample.pdf
Product Research sample.pdf
AllenSingson35 views
Dr. Ousmane Badiane-2023 ReSAKSS Conference by AKADEMIYA2063
Dr. Ousmane Badiane-2023 ReSAKSS ConferenceDr. Ousmane Badiane-2023 ReSAKSS Conference
Dr. Ousmane Badiane-2023 ReSAKSS Conference
AKADEMIYA20635 views
PyData Global 2022 - Things I learned while running neural networks on microc... by SARADINDU SENGUPTA
PyData Global 2022 - Things I learned while running neural networks on microc...PyData Global 2022 - Things I learned while running neural networks on microc...
PyData Global 2022 - Things I learned while running neural networks on microc...
4_4_WP_4_06_ND_Model.pptx by d6fmc6kwd4
4_4_WP_4_06_ND_Model.pptx4_4_WP_4_06_ND_Model.pptx
4_4_WP_4_06_ND_Model.pptx
d6fmc6kwd47 views
Listed Instruments Survey 2022.pptx by secretariat4
Listed Instruments Survey  2022.pptxListed Instruments Survey  2022.pptx
Listed Instruments Survey 2022.pptx
secretariat4130 views
K-Drama Recommendation Using Python by FridaPutriassa
K-Drama Recommendation Using PythonK-Drama Recommendation Using Python
K-Drama Recommendation Using Python
FridaPutriassa7 views
Pydata Global 2023 - How can a learnt model unlearn something by SARADINDU SENGUPTA
Pydata Global 2023 - How can a learnt model unlearn somethingPydata Global 2023 - How can a learnt model unlearn something
Pydata Global 2023 - How can a learnt model unlearn something
LIVE OAK MEMORIAL PARK.pptx by ms2332always
LIVE OAK MEMORIAL PARK.pptxLIVE OAK MEMORIAL PARK.pptx
LIVE OAK MEMORIAL PARK.pptx
ms2332always8 views
Running PostgreSQL in a Kubernetes cluster: CloudNativePG by Nick Ivanov
Running PostgreSQL in a Kubernetes cluster: CloudNativePGRunning PostgreSQL in a Kubernetes cluster: CloudNativePG
Running PostgreSQL in a Kubernetes cluster: CloudNativePG
Nick Ivanov7 views
Best Home Security Systems.pptx by mogalang
Best Home Security Systems.pptxBest Home Security Systems.pptx
Best Home Security Systems.pptx
mogalang9 views
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language... by patiladiti752
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language...Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language...
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language...
patiladiti7528 views

Time Series Analysis: DataFlow

  • 1. Time Series Analysis: DataFlow Timothy Spann Principal DataFlow Field Engineer @PaasDev
  • 2. 2© 2020 Cloudera, Inc. All rights reserved.
  • 3. © 2020 Cloudera, Inc. All rights reserved. 3 Welcome to Future of Data - Princeton @PaasDev https://www.meetup.com/futureofdata-princeton/ From Big Data to AI to Streaming to Containers to Cloud to Analytics to Cloud Storage to Fast Data to Machine Learning to Microservices to ...
  • 4. © 2020 Cloudera, Inc. All rights reserved. 4 Welcome to Future of Data - New York https://www.meetup.com/futureofdata-newyork/ From Big Data to AI to Streaming to Containers to Cloud to Analytics to Cloud Storage to Fast Data to Machine Learning to Microservices to ...
  • 5. © 2020 Cloudera, Inc. All rights reserved. 5 Welcome to Future of Data - Philadelphia @futureofdataphl https://www.meetup.com/futureofdata-philadelphia/ From Big Data to AI to Streaming to Containers to Cloud to Analytics to Cloud Storage to Fast Data to Machine Learning to K8 to ...
  • 6. © 2020 Cloudera, Inc. All rights reserved. 6 Meetup Presenter 1 Who am I? Principal DataFlow Field Engineer @PaasDev DZone Zone Leader and Big Data MVB; Princeton NJ Future of Data Meetup; ex-Pivotal Field Engineer; Apache Kafka, Tensorflow, Apache Spark RefCards https://github.com/tspannhw https://www.datainmotion.dev/ https://dzone.com/users/297029/bunkertor.html
  • 7. © 2019 Cloudera, Inc. All rights reserved. 7 CLOUDERA DATA PLATFORM World’s first enterprise data cloud
  • 8. © 2020 Cloudera, Inc. All rights reserved. 8 THE ENTERPRISE DATA CLOUD COMPONENTS Traditional Platform Consumption: • Data Hub Clusters New analytic experiences: • Data Warehouse • Machine Learning • More to come Control Plane services: • Workload Manager • Replication Manager • Data Catalog • Management Console
  • 9. © 2020 Cloudera, Inc. All rights reserved. 9 CLOUDERA - THE ENTERPRISE DATA COMPANY STREAMING & DATA FLOW DATA WAREHOUSE VISUAL APPLICATIONS Collect Report Enable SECURITY | GOVERNANCE | LINEAGE | MANAGEMENT | AUTOMATION DATA ENGINEERING Enrich MACHINE LEARNING & AI Predict
  • 10. © 2020 Cloudera, Inc. All rights reserved. 10 CSA
  • 11. © 2020 Cloudera, Inc. All rights reserved. 11 Streaming Analytics Powered by Apache Flink
  • 12. 12© 2020 Cloudera, Inc. All rights reserved. Buffer Analyze Connect Modern AppsApache Kafka gateway-west- raw-sensors gateway-central- raw-sensors gateway-east- raw-sensors C++ agent US-Central Plants C++ agent US-East Plants Data-at-Rest opDB CDP-DC S3 Azure Streaming Analytics Powered by FLINK Microservices by KAFKA Streams Ingest Gateway Powered by Kafka US-West Plants C++ agent Model Scoring Powered by CML Data-at-Rest Cloud Storage Data Lake Operational Stores Data Collection at the Edge Collect Apache NiFi & MiNiFi Distribute Apache NiFi Data Flow Apps Powered by NIFI A DATA-IN-MOTION REFERENCE ARCHITECTURE
  • 13. © 2020 Cloudera, Inc. All rights reserved. 13 SQL & Table API ● Unified APIs for streaming data and data at rest ○ Run the same query on batch and streaming data ○ ANSI SQL: No stream-specific syntax or semantics! ○ Many common stream analytics use cases supported SELECT userId, COUNT(*) AS cnt SESSION_START(clicktime, INTERVAL '30' MINUTE) FROM clicks GROUP BY SESSION(clicktime, INTERVAL '30' MINUTE), userId Count clicks per user and session (defined by 30 min. gap of inactivity).
  • 14. Quick Flink SQL Demo Preview
  • 16. © 2020 Cloudera, Inc. All rights reserved. 16 CFM CSM
  • 17. © 2020 Cloudera, Inc. All rights reserved. 17 CDF The Active Data Warehouse with Apache Kudu IOT Devices Applications Metrics Logs & Files HDFS/ Object Storage Hot Storage Cold Storage SQL Real-Time Analytics Alerting Event Driven Applications Dashboards Authorization Audit & LineageAuthentication Kerberos Encryption NavEncrypt
  • 19. Sensors ● BME280 temperature, pressure, humidity sensor ● LTR-559 light and proximity sensor ● MICS6814 analog gas sensor ● ADS1015 ADC ● MEMS microphone ● 0.96-inch, 160 x 80 color LCD
  • 21. Sensor Data - Hydrate Date Lakes
  • 22. Sensor Data - Example Row {"uuid": "rpi4_uuid_omi_20200417211935", "amplitude100": 0.3, "amplitude500": 0.1, "amplitude1000": 0.1, "lownoise": 0.1, "midnoise": 0.1, "highnoise": 0.1, "amps": 0.3, "ipaddress": "192.168.1.243", "host": "rp4", "host_name": "rp4", "macaddress": "dc:a6:32:03:a6:e9", "systemtime": "04/17/2020 17:19:36", "endtime": "1587158376.22", "runtime": "36.47", "starttime": "04/17/2020 17:18:58", "cpu": 0.0, "cpu_temp": "59.0", "diskusage": "46651.6 MB", "memory": 6.3, "id": "20200417211935_7b7ae5da-905b-418b-94f1-270a15dbc1df", "temperature": "38.7", "adjtemp": "29.7", "adjtempf": "65.5", "temperaturef": "81.7", "pressure": 1015.6, "humidity": 6.8, "lux": 1.2, "proximity": 0, "oxidising": 8.3, "reducing": 306.4, "nh3": 129.5, "gasKO": "Oxidising: 8300.63 OhmsnReducing: 306352.94 OhmsnNH3: 129542.17 Ohms"}
  • 26. © 2020 Cloudera, Inc. All rights reserved. 26 LINKS
  • 27. LINKS ● https://www.datainmotion.dev/2019/12/iot-series-minifi-agent-on-raspberry-pi.html ● https://learn.pimoroni.com/tutorial/sandyj/getting-started-with-enviro-plus ● https://github.com/tspannhw/meetup-sensors/ ● https://github.com/tspannhw/ClouderaFlowManagementWorkshop ● https://github.com/tspannhw/minifi-enviroplus ● https://github.com/tspannhw/minifi-movidius-electric ● https://github.com/tspannhw/table-ddl
  • 28. TH N Y U