In this talk, we will walk you through the simple steps to build and deploy machine learning for sentiment analysis and YOLO object detection as part of an IoT application that starts from devices collecting sensor data and camera images with MiNiFi. This data is streamed to Apache NiFi which integrates with Cloudera Data Science Workbench for classification with models in real-time as part of the real-time event stream. We parse, filter, fork, sort, query with SQL, dissect, enrich, transform, join and aggregate data as it is ingested.
The data is landed in a Cloudera data store in the cloud for batch and interactive analytics with Spark, Hive, Phoenix, HBase, Druid, Kudu and Impala.
Reference:
https://blog.cloudera.com/blog/2019/02/integrating-machine-learning-models-into-your-big-data-pipelines-in-real-time-with-no-coding/
https://community.hortonworks.com/content/kbentry/239858/integrating-machine-learning-models-into-your-big.html
https://community.hortonworks.com/articles/239961/using-cloudera-data-science-workbench-with-apache.html
Speaker
Timothy Spann, Senior Solutions Engineer
Cloudera
8. Speed of Data Model Training Model Scoring Use Case
Batch
Batch
Batch
Batch Reporting,
Analytics,
Applications
Online
DS Applications/
Interactive
Dashboards
Streaming
In-stream
Streaming
Applications
Incremental/Online In-stream
Streaming
Applications
Training, Scoring and Monitoring