SlideShare a Scribd company logo
1 of 43
Download to read offline
Streaming Analytics for IoT-Oriented Applications
Adrian Bowles, PhD

Founder, STORM Insights, Inc.

Lead Analyst, AI, Aragon Research

info@storminsights.com
MAY 11, 2017
Context - New Data, New Demands, New Opportunities
IOT: Impact, Infrastructure & Implications
Streaming Analytics - What, How, and Why Now?
Open Source Strategies
Vendor Solutions
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
AGENDA: STREAMING ANALYTICS FOR IOT-ORIENTED APPLICATIONS
When everything is connected…

New sources of data emerge

New sources of value emerge

Old assumptions must be challenged
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
THE IMPACT OF THE IOT
Copyright (c) 2014 by Umbrellium Ltd.
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
SENSORS ARE EVERYWHERE
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
DATA ON THE MOVE
Today:
Delta Airlines processes
5,000,000 business events per day
Pratt & Whitney jet engine:
5,000 sensors producing
10GB/s/per engine.
Formula 1 car sensors produce
about 1.2GB/s
and we need to predict the future…
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
AS THE SCOPE CHANGES, SO MUST THE SOLUTIONS
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
2-TIER IOT ARCHITECTURE
Sensors/
Devices Compute Engine
Data Center

Cloud

Cluster

Network
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
2-TIER IOT ARCHITECTURE
Sensors/
Devices Compute Engine
Data Center

Cloud

Cluster

Network
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
2-TIER IOT ARCHITECTURE
Sensors/
Devices Compute Engine
Data Center

Cloud

Cluster

Network
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
3-TIER IOT ARCHITECTURE
Sensors/
Devices
Compute EngineGateways
Data Center

Cloud

Cluster

Network
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
3-TIER IOT ARCHITECTURE ENABLES DISTRIBUTED ANALYTICS
Sensors/
Devices
Compute Engine
Data Center

Cloud

Cluster

Network
Compute

Engines
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
3-TIER IOT ARCHITECTURE ENABLES DISTRIBUTED INTELLIGENCE & ANALYTICS
Sensors/
Devices
Train the Deep Learning Model
Data Center

Cloud

Cluster

Network
Compress & Run

The DL Model
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
A ROLE FOR COLLABORATION AND STANDARDS
Predix
Jasper Control Center
Watson IoT
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
IOT PLATFORMS
Context - New Data, New Demands, New Opportunities
IOT: Impact, Infrastructure & Implications
Streaming Analytics - What, How, and Why Now?
Open Source Strategies
Vendor Solutions
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
AGENDA: STREAMING ANALYTICS FOR IOT-ORIENTED APPLICATIONS
PRESCRIPTIVE
ANALYTICS
DESCRIPTIVE
PREDICTIVE
WHAT TO DO
FUTURE OR MISSING VALUES
KNOWN VALUES
ANALYTICS
STATIC
DATA
STORED
DATA
STATIC
DIVERTED OR
SAMPLED
DATA
STOP AND FRISK
STORED
DATA
STATIC
DIVERTED OR
SAMPLED
STREAMING
DATA
IN MOTION
STOP AND FRISK
STORED
DATA
STREAMINGIN MOTION
DATA
“You could not step twice into the same river.”
Heraclitus 535BC-474BC
To understand the contents (analyze)…
Divert the flow?
Pool the data?
Evaluate everything without changing the flow?
Sample? (catch and release?)
STREAMING
ANALYTICS
DESCRIPTIVE
DATA
PREDICTIVE
STREAMING ANALYTICS: MOVE THE PROCESS TO THE DATA
STREAMING ANALYTICS: STATISTICAL ANALYSIS OF DATA IN MOTION
DataSources
Store
Process/Transform
Key
Data Flows on the Edges, Queries on the Vertices
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
CONVENTIONAL DATA ARCHITECTURE
DataSources
Store
Process/Transform
Observe
Key
Data Flows on the Edges, Queries Everywhere
Sampling vs Monitoring Everything…
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
STREAMING DATA ARCHITECTURE
440
880
440
880
THE PROBLEM WITH SAMPLING…
440
880
440
880
440
880
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
THE PROBLEM WITH SAMPLING…
Context - New Data, New Demands, New Opportunities
IOT: Impact, Infrastructure & Implications
Streaming Analytics - What, How, and Why Now?
Open Source Strategies
Vendor Solutions
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
AGENDA: STREAMING ANALYTICS FOR IOT-ORIENTED APPLICATIONS
RELEVANT APACHE SOFTWARE FOUNDATION OPEN SOURCE PROJECTS
Apache Flink: “open-source stream processing framework for distributed, high-
performing, always-available, and accurate data streaming applications.”
Apache Samza: “a distributed stream processing framework. It uses Apache Kafka
for messaging, and Apache Hadoop YARN to provide fault tolerance, processor
isolation, security, and resource management.”
Apache Storm: “a free and open source distributed realtime computation system.
Storm makes it easy to reliably process unbounded streams of data, doing for
realtime processing what Hadoop did for batch processing.”
Apache Spark Streaming: “Spark Streaming brings Apache Spark's language-
integrated API to stream processing, letting you write streaming jobs the same way
you write batch jobs.”
Apache Apex: “Enterprise-grade unified stream and batch processing engine.”
Registered trademarks or trademarks of The Apache Software Foundation
Internode Communication
Distributed Cache
Indexed Results StoreProcessing
Machine Learning
Visualization
Cluster Management
Data
Collection
Data
Delivery
Development
YARN%
Courtesy of STRIIM
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
THE OPEN SOURCE ECOSYSTEM FOR ANALYTICS INFRASTRUCTURE
Context - New Data, New Demands, New Opportunities
IOT: Impact, Infrastructure & Implications
Streaming Analytics - What, How, and Why Now?
Open Source Strategies
Vendor Solutions
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
AGENDA: STREAMING ANALYTICS FOR IOT-ORIENTED APPLICATIONS
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
COMMERCIAL SOLUTIONS: PLATFORMS FOR STREAMING ANALYTICS AS A SERVICE
IBM Bluemix
Google Cloud Platform
Azure Stream Analytics
Oracle Stream Analytics
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Key Trend:

Streaming
Analytics for IOT

as a Service
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
STREAMING ANALYTICS AS A SERVICE
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
STREAMING ANALYTICS AS A SERVICE
Key Trend:

Streaming
Analytics for IOT

as a Service
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Key Trend:

Streaming
Analytics for IOT

as a Service
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
STREAMING ANALYTICS AS A SERVICE
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Key Trend:

Streaming
Analytics for IOT

as a Service
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
COMMERCIAL SOLUTIONS
SAP Event Stream Processor
Tibco Event Analytics
Cisco Connected Streaming Analytics
SAS Event Stream Processing
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
STREAMING ANALYTICS - EMERGING VENDORS TO WATCH
Integrate Analyze
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
DATATORRENT
Source: DataTorrent.com
Mul$ple'data''
sources'
Hadoop (HDFS)
(Existing)
ETL Jobs
Batch'/'
High5Latency'
Real'$me'/'
Low5Latency' ODS/ EDW
Real-Time
Applications
Legacy
Applications
Spark Hive
Big Data
Applications
Users'
CDC'
Flume'
KaAa'
Logs'
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
STRIIM
Integration
&
Intelligence
Source: STRIIM.com
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
STREAMANALYTIX
Source: streamanalytix.com
Data Sharpening
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
ZOOMDATA
Source: zoomdata.com
Do you have or can you capture streaming data that can increase your value proposition?
Data about your product that can improve performance, reliability, predictability…
Can you create value from new analysis of open data?
Adding your own data/algorithms to open data creates value.
Tips:
Start by evaluating the emerging open source de facto standards.
Choose an infrastructure that allows you to evaluate live streaming data in the context of
relevant historical data.
It’s All About the Data
GETTING STARTED…
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
adrian@storminsights.com
adrian@aragonresearch.com
Twitter @ajbowles
Skype ajbowles
If you would like to connect on LinkedIn, please let
me know that you that you registered for the
Smart Data webinar series.
Upcoming SmartData Webinar Dates & Topics
June 8 Machine Learning Case Studies

July 13 Advances in NLP I: Understanding

August 10 Organizing Data and Knowledge: 

The Role of Taxonomies and Ontologies 

KEEP IN TOUCH!
New Content from Aragon Research
AragonResearch.com
RESOURCES
https://azure.microsoft.com/en-us/services/stream-analytics
apex.apache.org
http://flink.apache.org
https://spark.apache.org/streaming/
http://samza.apache.org
https://streamanalytix.com
Slide 4 IOT Search: https://thingful.net
https://www.ibm.com/cloud-computing/bluemix/data-analytics
https://www.zoomdata.com
http://www.striim.com
https://www.datatorrent.com
https://aws.amazon.com/kinesis/streams/

More Related Content

What's hot

Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing
Seungyun Lee
 

What's hot (20)

Top 5 Deep Learning and AI Stories - October 6, 2017
Top 5 Deep Learning and AI Stories - October 6, 2017Top 5 Deep Learning and AI Stories - October 6, 2017
Top 5 Deep Learning and AI Stories - October 6, 2017
 
Data Science in the Enterprise
Data Science in the EnterpriseData Science in the Enterprise
Data Science in the Enterprise
 
Smart Data Slides: Machine Learning - Case Studies
Smart Data Slides: Machine Learning - Case StudiesSmart Data Slides: Machine Learning - Case Studies
Smart Data Slides: Machine Learning - Case Studies
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
HPC Top 5 Stories: March 22, 2017
HPC Top 5 Stories: March 22, 2017HPC Top 5 Stories: March 22, 2017
HPC Top 5 Stories: March 22, 2017
 
GTC China 2017 Highlights
GTC China 2017 HighlightsGTC China 2017 Highlights
GTC China 2017 Highlights
 
Transforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon ValleyTransforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon Valley
 
A Journey Through The Far Side Of Data Science
A Journey Through The Far Side Of Data ScienceA Journey Through The Far Side Of Data Science
A Journey Through The Far Side Of Data Science
 
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoTDigital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
 
Top 5 Deep Learning and AI Stories - April 20, 2018
Top 5 Deep Learning and AI Stories - April 20, 2018Top 5 Deep Learning and AI Stories - April 20, 2018
Top 5 Deep Learning and AI Stories - April 20, 2018
 
AI in the Financial Services Industry
AI in the Financial Services IndustryAI in the Financial Services Industry
AI in the Financial Services Industry
 
Robert Harrison, WMG - IIoT and Industry 4.0 in Automation Systems Engineering
Robert Harrison, WMG - IIoT and Industry 4.0 in Automation Systems EngineeringRobert Harrison, WMG - IIoT and Industry 4.0 in Automation Systems Engineering
Robert Harrison, WMG - IIoT and Industry 4.0 in Automation Systems Engineering
 
Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing
 
Top 5 AI and Deep Learning Stories - August 3, 2018
Top 5 AI and Deep Learning Stories - August 3, 2018Top 5 AI and Deep Learning Stories - August 3, 2018
Top 5 AI and Deep Learning Stories - August 3, 2018
 
Artificial intelligence of things(AIoT): What is AIoT: AIoT applications
Artificial intelligence of things(AIoT): What is AIoT: AIoT applicationsArtificial intelligence of things(AIoT): What is AIoT: AIoT applications
Artificial intelligence of things(AIoT): What is AIoT: AIoT applications
 
AI in Healthcare 2017
AI in Healthcare 2017AI in Healthcare 2017
AI in Healthcare 2017
 
Driving Computer Vision Research Innovation In Artificial Intelligence
Driving Computer Vision Research Innovation In Artificial IntelligenceDriving Computer Vision Research Innovation In Artificial Intelligence
Driving Computer Vision Research Innovation In Artificial Intelligence
 
Understanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingUnderstanding the New World of Cognitive Computing
Understanding the New World of Cognitive Computing
 
IBM Cognitive Manufacturing Overview Public
IBM Cognitive Manufacturing Overview PublicIBM Cognitive Manufacturing Overview Public
IBM Cognitive Manufacturing Overview Public
 
NVIDIA GTC21 AI Conference Highlights
NVIDIA GTC21 AI Conference Highlights NVIDIA GTC21 AI Conference Highlights
NVIDIA GTC21 AI Conference Highlights
 

Similar to Streaming Analytics for IoT-Oriented Applications

Connected IoT and Intelligent Solutions
Connected IoT and Intelligent SolutionsConnected IoT and Intelligent Solutions
Connected IoT and Intelligent Solutions
Amazon Web Services
 
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
Keith Kraus
 

Similar to Streaming Analytics for IoT-Oriented Applications (20)

Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
 
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
 
Connected IoT and Intelligent Solutions
Connected IoT and Intelligent SolutionsConnected IoT and Intelligent Solutions
Connected IoT and Intelligent Solutions
 
CWIN17 Rome / The sas platform for the new generation of ai and ml
CWIN17 Rome / The sas platform for the new generation of ai and mlCWIN17 Rome / The sas platform for the new generation of ai and ml
CWIN17 Rome / The sas platform for the new generation of ai and ml
 
Couchbase & HPCC Systems – A complete mobile & data platform in the enterprise
Couchbase & HPCC Systems – A complete mobile & data platform in the enterpriseCouchbase & HPCC Systems – A complete mobile & data platform in the enterprise
Couchbase & HPCC Systems – A complete mobile & data platform in the enterprise
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise
 
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
 
Industrial IoT Applications: Making the Connection and Extracting Value (IOT3...
Industrial IoT Applications: Making the Connection and Extracting Value (IOT3...Industrial IoT Applications: Making the Connection and Extracting Value (IOT3...
Industrial IoT Applications: Making the Connection and Extracting Value (IOT3...
 
R, Spark, Tensorflow, H20.ai Applied to Streaming Analytics
R, Spark, Tensorflow, H20.ai Applied to Streaming AnalyticsR, Spark, Tensorflow, H20.ai Applied to Streaming Analytics
R, Spark, Tensorflow, H20.ai Applied to Streaming Analytics
 
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
 
Apache spark empowering the real time data driven enterprise - StreamAnalytix...
Apache spark empowering the real time data driven enterprise - StreamAnalytix...Apache spark empowering the real time data driven enterprise - StreamAnalytix...
Apache spark empowering the real time data driven enterprise - StreamAnalytix...
 
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
Analysing Data in Real-time
Analysing Data in Real-timeAnalysing Data in Real-time
Analysing Data in Real-time
 
AWS Initiate Berlin - Das Zeitalter von Big Data - So nutzen Sie Daten zum En...
AWS Initiate Berlin - Das Zeitalter von Big Data - So nutzen Sie Daten zum En...AWS Initiate Berlin - Das Zeitalter von Big Data - So nutzen Sie Daten zum En...
AWS Initiate Berlin - Das Zeitalter von Big Data - So nutzen Sie Daten zum En...
 
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
 
IOT, Streaming Analytics and Machine Learning
IOT, Streaming Analytics and Machine Learning IOT, Streaming Analytics and Machine Learning
IOT, Streaming Analytics and Machine Learning
 
Security, ETL, BI & Analytics, and Software Integration
Security, ETL, BI & Analytics, and Software IntegrationSecurity, ETL, BI & Analytics, and Software Integration
Security, ETL, BI & Analytics, and Software Integration
 
Azure iot suite
Azure iot suiteAzure iot suite
Azure iot suite
 

More from DATAVERSITY

The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Recently uploaded

Recently uploaded (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 

Streaming Analytics for IoT-Oriented Applications

  • 1. Streaming Analytics for IoT-Oriented Applications Adrian Bowles, PhD Founder, STORM Insights, Inc. Lead Analyst, AI, Aragon Research info@storminsights.com MAY 11, 2017
  • 2. Context - New Data, New Demands, New Opportunities IOT: Impact, Infrastructure & Implications Streaming Analytics - What, How, and Why Now? Open Source Strategies Vendor Solutions Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. AGENDA: STREAMING ANALYTICS FOR IOT-ORIENTED APPLICATIONS
  • 3. When everything is connected… New sources of data emerge New sources of value emerge Old assumptions must be challenged Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. THE IMPACT OF THE IOT
  • 4. Copyright (c) 2014 by Umbrellium Ltd. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. SENSORS ARE EVERYWHERE
  • 5. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. DATA ON THE MOVE
  • 6. Today: Delta Airlines processes 5,000,000 business events per day Pratt & Whitney jet engine: 5,000 sensors producing 10GB/s/per engine. Formula 1 car sensors produce about 1.2GB/s and we need to predict the future… Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. AS THE SCOPE CHANGES, SO MUST THE SOLUTIONS
  • 7. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. 2-TIER IOT ARCHITECTURE Sensors/ Devices Compute Engine Data Center Cloud Cluster Network
  • 8. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. 2-TIER IOT ARCHITECTURE Sensors/ Devices Compute Engine Data Center Cloud Cluster Network
  • 9. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. 2-TIER IOT ARCHITECTURE Sensors/ Devices Compute Engine Data Center Cloud Cluster Network
  • 10. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. 3-TIER IOT ARCHITECTURE Sensors/ Devices Compute EngineGateways Data Center Cloud Cluster Network
  • 11. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. 3-TIER IOT ARCHITECTURE ENABLES DISTRIBUTED ANALYTICS Sensors/ Devices Compute Engine Data Center Cloud Cluster Network Compute Engines
  • 12. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. 3-TIER IOT ARCHITECTURE ENABLES DISTRIBUTED INTELLIGENCE & ANALYTICS Sensors/ Devices Train the Deep Learning Model Data Center Cloud Cluster Network Compress & Run The DL Model
  • 13. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. A ROLE FOR COLLABORATION AND STANDARDS
  • 14. Predix Jasper Control Center Watson IoT Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. IOT PLATFORMS
  • 15. Context - New Data, New Demands, New Opportunities IOT: Impact, Infrastructure & Implications Streaming Analytics - What, How, and Why Now? Open Source Strategies Vendor Solutions Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. AGENDA: STREAMING ANALYTICS FOR IOT-ORIENTED APPLICATIONS
  • 16. PRESCRIPTIVE ANALYTICS DESCRIPTIVE PREDICTIVE WHAT TO DO FUTURE OR MISSING VALUES KNOWN VALUES ANALYTICS
  • 20. STREAMINGIN MOTION DATA “You could not step twice into the same river.” Heraclitus 535BC-474BC To understand the contents (analyze)… Divert the flow? Pool the data? Evaluate everything without changing the flow? Sample? (catch and release?)
  • 21. STREAMING ANALYTICS DESCRIPTIVE DATA PREDICTIVE STREAMING ANALYTICS: MOVE THE PROCESS TO THE DATA STREAMING ANALYTICS: STATISTICAL ANALYSIS OF DATA IN MOTION
  • 22. DataSources Store Process/Transform Key Data Flows on the Edges, Queries on the Vertices Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. CONVENTIONAL DATA ARCHITECTURE
  • 23. DataSources Store Process/Transform Observe Key Data Flows on the Edges, Queries Everywhere Sampling vs Monitoring Everything… Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. STREAMING DATA ARCHITECTURE
  • 25. 440 880 440 880 440 880 Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. THE PROBLEM WITH SAMPLING…
  • 26. Context - New Data, New Demands, New Opportunities IOT: Impact, Infrastructure & Implications Streaming Analytics - What, How, and Why Now? Open Source Strategies Vendor Solutions Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. AGENDA: STREAMING ANALYTICS FOR IOT-ORIENTED APPLICATIONS
  • 27. RELEVANT APACHE SOFTWARE FOUNDATION OPEN SOURCE PROJECTS Apache Flink: “open-source stream processing framework for distributed, high- performing, always-available, and accurate data streaming applications.” Apache Samza: “a distributed stream processing framework. It uses Apache Kafka for messaging, and Apache Hadoop YARN to provide fault tolerance, processor isolation, security, and resource management.” Apache Storm: “a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing.” Apache Spark Streaming: “Spark Streaming brings Apache Spark's language- integrated API to stream processing, letting you write streaming jobs the same way you write batch jobs.” Apache Apex: “Enterprise-grade unified stream and batch processing engine.” Registered trademarks or trademarks of The Apache Software Foundation
  • 28. Internode Communication Distributed Cache Indexed Results StoreProcessing Machine Learning Visualization Cluster Management Data Collection Data Delivery Development YARN% Courtesy of STRIIM Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. THE OPEN SOURCE ECOSYSTEM FOR ANALYTICS INFRASTRUCTURE
  • 29. Context - New Data, New Demands, New Opportunities IOT: Impact, Infrastructure & Implications Streaming Analytics - What, How, and Why Now? Open Source Strategies Vendor Solutions Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. AGENDA: STREAMING ANALYTICS FOR IOT-ORIENTED APPLICATIONS
  • 30. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. COMMERCIAL SOLUTIONS: PLATFORMS FOR STREAMING ANALYTICS AS A SERVICE IBM Bluemix Google Cloud Platform Azure Stream Analytics Oracle Stream Analytics
  • 31. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Key Trend: Streaming Analytics for IOT as a Service Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. STREAMING ANALYTICS AS A SERVICE
  • 32. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. STREAMING ANALYTICS AS A SERVICE Key Trend: Streaming Analytics for IOT as a Service
  • 33. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Key Trend: Streaming Analytics for IOT as a Service Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. STREAMING ANALYTICS AS A SERVICE
  • 34. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Key Trend: Streaming Analytics for IOT as a Service
  • 35. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. COMMERCIAL SOLUTIONS SAP Event Stream Processor Tibco Event Analytics Cisco Connected Streaming Analytics SAS Event Stream Processing
  • 36. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. STREAMING ANALYTICS - EMERGING VENDORS TO WATCH Integrate Analyze
  • 37. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. DATATORRENT Source: DataTorrent.com
  • 38. Mul$ple'data'' sources' Hadoop (HDFS) (Existing) ETL Jobs Batch'/' High5Latency' Real'$me'/' Low5Latency' ODS/ EDW Real-Time Applications Legacy Applications Spark Hive Big Data Applications Users' CDC' Flume' KaAa' Logs' Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. STRIIM Integration & Intelligence Source: STRIIM.com
  • 39. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. STREAMANALYTIX Source: streamanalytix.com
  • 40. Data Sharpening Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. ZOOMDATA Source: zoomdata.com
  • 41. Do you have or can you capture streaming data that can increase your value proposition? Data about your product that can improve performance, reliability, predictability… Can you create value from new analysis of open data? Adding your own data/algorithms to open data creates value. Tips: Start by evaluating the emerging open source de facto standards. Choose an infrastructure that allows you to evaluate live streaming data in the context of relevant historical data. It’s All About the Data GETTING STARTED… Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
  • 42. adrian@storminsights.com adrian@aragonresearch.com Twitter @ajbowles Skype ajbowles If you would like to connect on LinkedIn, please let me know that you that you registered for the Smart Data webinar series. Upcoming SmartData Webinar Dates & Topics June 8 Machine Learning Case Studies July 13 Advances in NLP I: Understanding August 10 Organizing Data and Knowledge: 
 The Role of Taxonomies and Ontologies 
 KEEP IN TOUCH! New Content from Aragon Research AragonResearch.com
  • 43. RESOURCES https://azure.microsoft.com/en-us/services/stream-analytics apex.apache.org http://flink.apache.org https://spark.apache.org/streaming/ http://samza.apache.org https://streamanalytix.com Slide 4 IOT Search: https://thingful.net https://www.ibm.com/cloud-computing/bluemix/data-analytics https://www.zoomdata.com http://www.striim.com https://www.datatorrent.com https://aws.amazon.com/kinesis/streams/