SlideShare a Scribd company logo
IoT Analytics
Anjana Fernando
Senior Technical Lead
WSO2 Inc.
IoT Analytics in Action
Rolls Royce Trent 1000
Analytics data collected in
● Design
● Manufacture
● After-sales
One fan blade manufacturing -> 0.5 TB of data
Real-time data transmitted back to RR when planes are in-flight.
IoT Analytics in Action
Caterpillar
From autonomous mining trucks to locomotives, they have sensors monitoring fuel, idle time,
location for maximum operation efficiency.
Predictive maintenance has saved millions from timely fuel pump replacement to adjusting ship
hull cleaning intervals in their marine services.
IoT Analytics Categorized
● What type of data?
● How fast you need results?
● How much data to keep?
● Historical, real-time, or predictive?
● Cloud or fog / edge analytics?
IoT Analytics Data
● Time related data
○ Time series processing
■ Energy consumption with time
■ Failure prediction
■ Specialized DBs - OpenTSDB
● Location data
○ GPS / iBeacons
○ Used in agriculture
■ Detect soil moisture, crop growth
■ Manage irrigation equipment
○ Traffic planning
■ Monitor vehicle speeds, location for better route suggestions
○ Geospatial optimized processing engines - GeoTrellis
IoT Analytics Processing Modes
Do we need the results instantaneously?, or a few seconds
delay okay?, or else, results after several minutes or more is
fine?
IoT Analytics Processing Modes: Realtime / NRT
● The most often used processing mode in IoT
○ Immediately take action on some event occurring with the source
devices
■ Send out alerts from a temperature sensor hitting a limit
■ Notification in a car dashboard of low tire pressure
● Generating instant alerts and information based on the data sent by
sensors, requires stream processing. Process events one by one in
real-time to match to a predefined set of rules.
○ Apache Storm as a stream processing engine
■ Scalable and fault tolerant
○ For advanced pattern matching, a full fledged CEP engine can be
used, e.g. WSO2 CEP, Esper etc..
IoT Analytics Processing Modes: Interactive /
Batch
● Long term statistics generations, a batch processing system can be
used: Apache Hadoop, Apache Spark
○ Average temperature in a room in the last month
○ Total power usage of the house in the last year
● Interactive analytics with technologies such as Apache Drill and
indexed storage systems such as Couchbase.
● Most often, we may need to mash-up both batch analytics results with
real-time processing
○ Comparing a long term statistics result with incoming real-time
events for alerts etc..
● Batch operations can be brought together with an indexing system for
real-time analytics to lookup data instantly when required
○ Apache Lucene, WSO2 DAS Analytics / Event Tables
IoT Analytics Data Retention
● IoT devices generate high volume or different types of data
● We can decide to process right away when we receive it, and discard it,
or else, keep it for more detailed processing
● Big Data stores gives us the option to store huge amounts of data as
such.
● Purge the data, after the raw data is no longer required
IoT Analytics Processing: Hindsight/Insight/Foresight
● Hindsight can be achieved by processing historical data, and
understanding what has happened.
○ Batch processing systems such as Apache Hadoop and Apache
Spark is used in this area
○ Data visualization with dashboards, showing related data together
● Insight would be understanding what is happening now
○ Achieved with real-time processing systems
○ Scenario: How are my jet engines performing right now
● Foresight is predicting what is going to happen
○ Achieved with machine learning systems such as Apache Mahout,
Apache Spark MLlib, Microsoft Azure Machine Learning, WSO2 ML
○ Scenario: Predictive maintenance -> time to change specific parts
in my car, service scheduling on an aeroplane
So many things… So much processing…
● IoT will mean, naturally large amounts of data created, thus large
amount of computation resources are required
● Typical scenario of a centralized analytics server for all devices may not
be feasible all the time
○ Centralized analytics hardware may not be scalable for all the
thousands of devices getting added frequently
○ The network communication will get flooded with analytics chatter
when the device count increases
● Solution: edge analytics, a.k.a, fog analytics
○ Some of the analytics operations are offloaded to the end device
itself or to an immediate gateway, for doing most or some of the
analytics operations required. This creates a scalable infrastructure
for device management in the IoT ecosystem.
WSO2 Reference Architecture for IoT Analytics
Questions?
Thank You

More Related Content

What's hot

Big Data Streaming Analytics for Engineers
Big Data Streaming Analytics for EngineersBig Data Streaming Analytics for Engineers
Big Data Streaming Analytics for Engineers
Naunidh Chadha
 
Open stack billing system
Open stack billing systemOpen stack billing system
Open stack billing system
Alexander Tsirel
 
Latency SLOs Done Right @ SREcon EMEA 2019
Latency SLOs Done Right @ SREcon EMEA 2019Latency SLOs Done Right @ SREcon EMEA 2019
Latency SLOs Done Right @ SREcon EMEA 2019
Heinrich Hartmann
 
Open source historian
Open source historianOpen source historian
Open source historian
Geoff Nunan
 
Scalable Online Analytics for Monitoring
Scalable Online Analytics for MonitoringScalable Online Analytics for Monitoring
Scalable Online Analytics for Monitoring
Heinrich Hartmann
 
Cassandra Day London 2015: British Gas Connected Homes: 5 Things We Wish We H...
Cassandra Day London 2015: British Gas Connected Homes: 5 Things We Wish We H...Cassandra Day London 2015: British Gas Connected Homes: 5 Things We Wish We H...
Cassandra Day London 2015: British Gas Connected Homes: 5 Things We Wish We H...
DataStax Academy
 
WHODIS_kearns_presentation.v0a
WHODIS_kearns_presentation.v0aWHODIS_kearns_presentation.v0a
WHODIS_kearns_presentation.v0aEdward Kearns
 
InfluxDB Cloud Product Update
InfluxDB Cloud Product Update InfluxDB Cloud Product Update
InfluxDB Cloud Product Update
InfluxData
 
WWW19: SGX-PySpark: Secure Distributed Data Analytics
WWW19: SGX-PySpark: Secure Distributed Data AnalyticsWWW19: SGX-PySpark: Secure Distributed Data Analytics
WWW19: SGX-PySpark: Secure Distributed Data Analytics
LEGATO project
 
Da vinci classes
Da vinci classesDa vinci classes
Da vinci classes
Diyan Yordanov
 
Collecting Endpoint Security Logs Through Big Data Technology - Dedi Dwianto
Collecting Endpoint Security Logs Through Big Data Technology - Dedi DwiantoCollecting Endpoint Security Logs Through Big Data Technology - Dedi Dwianto
Collecting Endpoint Security Logs Through Big Data Technology - Dedi Dwianto
idsecconf
 
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | GrafanaStreaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
InfluxData
 
Tapjoy OpenStack Summit Paris Breakout Session
Tapjoy OpenStack Summit Paris Breakout SessionTapjoy OpenStack Summit Paris Breakout Session
Tapjoy OpenStack Summit Paris Breakout Session
Weston Jossey
 

What's hot (13)

Big Data Streaming Analytics for Engineers
Big Data Streaming Analytics for EngineersBig Data Streaming Analytics for Engineers
Big Data Streaming Analytics for Engineers
 
Open stack billing system
Open stack billing systemOpen stack billing system
Open stack billing system
 
Latency SLOs Done Right @ SREcon EMEA 2019
Latency SLOs Done Right @ SREcon EMEA 2019Latency SLOs Done Right @ SREcon EMEA 2019
Latency SLOs Done Right @ SREcon EMEA 2019
 
Open source historian
Open source historianOpen source historian
Open source historian
 
Scalable Online Analytics for Monitoring
Scalable Online Analytics for MonitoringScalable Online Analytics for Monitoring
Scalable Online Analytics for Monitoring
 
Cassandra Day London 2015: British Gas Connected Homes: 5 Things We Wish We H...
Cassandra Day London 2015: British Gas Connected Homes: 5 Things We Wish We H...Cassandra Day London 2015: British Gas Connected Homes: 5 Things We Wish We H...
Cassandra Day London 2015: British Gas Connected Homes: 5 Things We Wish We H...
 
WHODIS_kearns_presentation.v0a
WHODIS_kearns_presentation.v0aWHODIS_kearns_presentation.v0a
WHODIS_kearns_presentation.v0a
 
InfluxDB Cloud Product Update
InfluxDB Cloud Product Update InfluxDB Cloud Product Update
InfluxDB Cloud Product Update
 
WWW19: SGX-PySpark: Secure Distributed Data Analytics
WWW19: SGX-PySpark: Secure Distributed Data AnalyticsWWW19: SGX-PySpark: Secure Distributed Data Analytics
WWW19: SGX-PySpark: Secure Distributed Data Analytics
 
Da vinci classes
Da vinci classesDa vinci classes
Da vinci classes
 
Collecting Endpoint Security Logs Through Big Data Technology - Dedi Dwianto
Collecting Endpoint Security Logs Through Big Data Technology - Dedi DwiantoCollecting Endpoint Security Logs Through Big Data Technology - Dedi Dwianto
Collecting Endpoint Security Logs Through Big Data Technology - Dedi Dwianto
 
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | GrafanaStreaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
 
Tapjoy OpenStack Summit Paris Breakout Session
Tapjoy OpenStack Summit Paris Breakout SessionTapjoy OpenStack Summit Paris Breakout Session
Tapjoy OpenStack Summit Paris Breakout Session
 

Similar to IoT Analytics

WSO2Con ASIA 2016: IoT Analytics
WSO2Con ASIA 2016: IoT AnalyticsWSO2Con ASIA 2016: IoT Analytics
WSO2Con ASIA 2016: IoT Analytics
WSO2
 
Apache Cassandra Lunch #101: IoT and Cassandra
Apache Cassandra Lunch #101: IoT and CassandraApache Cassandra Lunch #101: IoT and Cassandra
Apache Cassandra Lunch #101: IoT and Cassandra
Anant Corporation
 
Extracting Insights from Data at Twitter
Extracting Insights from Data at TwitterExtracting Insights from Data at Twitter
Extracting Insights from Data at Twitter
Prasad Wagle
 
Streaming Analytics and Internet of Things - Geesara Prathap
Streaming Analytics and Internet of Things - Geesara PrathapStreaming Analytics and Internet of Things - Geesara Prathap
Streaming Analytics and Internet of Things - Geesara Prathap
WithTheBest
 
How to Develop and Operate Cloud First Data Platforms
How to Develop and Operate Cloud First Data PlatformsHow to Develop and Operate Cloud First Data Platforms
How to Develop and Operate Cloud First Data Platforms
Alluxio, Inc.
 
Internet of Things & Big Data
Internet of Things & Big DataInternet of Things & Big Data
Internet of Things & Big Data
Arun Rajput
 
How to Develop and Operate Cloud Native Data Platforms and Applications
How to Develop and Operate Cloud Native Data Platforms and ApplicationsHow to Develop and Operate Cloud Native Data Platforms and Applications
How to Develop and Operate Cloud Native Data Platforms and Applications
Alluxio, Inc.
 
AWS Big Data Demystified #1: Big data architecture lessons learned
AWS Big Data Demystified #1: Big data architecture lessons learned AWS Big Data Demystified #1: Big data architecture lessons learned
AWS Big Data Demystified #1: Big data architecture lessons learned
Omid Vahdaty
 
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ UberKafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
confluent
 
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a ServiceZeus: Uber’s Highly Scalable and Distributed Shuffle as a Service
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service
Databricks
 
Streamlio and IoT analytics with Apache Pulsar
Streamlio and IoT analytics with Apache PulsarStreamlio and IoT analytics with Apache Pulsar
Streamlio and IoT analytics with Apache Pulsar
Streamlio
 
Logging : How much is too much? Network Security Monitoring Talk @ hasgeek
Logging : How much is too much? Network Security Monitoring Talk @ hasgeekLogging : How much is too much? Network Security Monitoring Talk @ hasgeek
Logging : How much is too much? Network Security Monitoring Talk @ hasgeek
vivekrajan
 
Anurag Awasthi - Machine Learning applications for CloudStack
Anurag Awasthi - Machine Learning applications for CloudStackAnurag Awasthi - Machine Learning applications for CloudStack
Anurag Awasthi - Machine Learning applications for CloudStack
ShapeBlue
 
Sensing the world with data of things
Sensing the world with  data of thingsSensing the world with  data of things
Sensing the world with data of things
Sriskandarajah Suhothayan
 
Sensing the world with Data of Things
Sensing the world with Data of ThingsSensing the world with Data of Things
Sensing the world with Data of Things
Sriskandarajah Suhothayan
 
Elasticsearch as a time series database
Elasticsearch as a time series databaseElasticsearch as a time series database
Elasticsearch as a time series database
felixbarny
 
AWS big-data-demystified #1.1 | Big Data Architecture Lessons Learned | English
AWS big-data-demystified #1.1  | Big Data Architecture Lessons Learned | EnglishAWS big-data-demystified #1.1  | Big Data Architecture Lessons Learned | English
AWS big-data-demystified #1.1 | Big Data Architecture Lessons Learned | English
Omid Vahdaty
 
OSMC 2018 | Learnings, patterns and Uber’s metrics platform M3, open sourced ...
OSMC 2018 | Learnings, patterns and Uber’s metrics platform M3, open sourced ...OSMC 2018 | Learnings, patterns and Uber’s metrics platform M3, open sourced ...
OSMC 2018 | Learnings, patterns and Uber’s metrics platform M3, open sourced ...
NETWAYS
 
Streamsets and spark in Retail
Streamsets and spark in RetailStreamsets and spark in Retail
Streamsets and spark in Retail
Hari Shreedharan
 
Analytic Insights in Retail Using Apache Spark with Hari Shreedharan
Analytic Insights in Retail Using Apache Spark with Hari ShreedharanAnalytic Insights in Retail Using Apache Spark with Hari Shreedharan
Analytic Insights in Retail Using Apache Spark with Hari Shreedharan
Databricks
 

Similar to IoT Analytics (20)

WSO2Con ASIA 2016: IoT Analytics
WSO2Con ASIA 2016: IoT AnalyticsWSO2Con ASIA 2016: IoT Analytics
WSO2Con ASIA 2016: IoT Analytics
 
Apache Cassandra Lunch #101: IoT and Cassandra
Apache Cassandra Lunch #101: IoT and CassandraApache Cassandra Lunch #101: IoT and Cassandra
Apache Cassandra Lunch #101: IoT and Cassandra
 
Extracting Insights from Data at Twitter
Extracting Insights from Data at TwitterExtracting Insights from Data at Twitter
Extracting Insights from Data at Twitter
 
Streaming Analytics and Internet of Things - Geesara Prathap
Streaming Analytics and Internet of Things - Geesara PrathapStreaming Analytics and Internet of Things - Geesara Prathap
Streaming Analytics and Internet of Things - Geesara Prathap
 
How to Develop and Operate Cloud First Data Platforms
How to Develop and Operate Cloud First Data PlatformsHow to Develop and Operate Cloud First Data Platforms
How to Develop and Operate Cloud First Data Platforms
 
Internet of Things & Big Data
Internet of Things & Big DataInternet of Things & Big Data
Internet of Things & Big Data
 
How to Develop and Operate Cloud Native Data Platforms and Applications
How to Develop and Operate Cloud Native Data Platforms and ApplicationsHow to Develop and Operate Cloud Native Data Platforms and Applications
How to Develop and Operate Cloud Native Data Platforms and Applications
 
AWS Big Data Demystified #1: Big data architecture lessons learned
AWS Big Data Demystified #1: Big data architecture lessons learned AWS Big Data Demystified #1: Big data architecture lessons learned
AWS Big Data Demystified #1: Big data architecture lessons learned
 
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ UberKafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
 
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a ServiceZeus: Uber’s Highly Scalable and Distributed Shuffle as a Service
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service
 
Streamlio and IoT analytics with Apache Pulsar
Streamlio and IoT analytics with Apache PulsarStreamlio and IoT analytics with Apache Pulsar
Streamlio and IoT analytics with Apache Pulsar
 
Logging : How much is too much? Network Security Monitoring Talk @ hasgeek
Logging : How much is too much? Network Security Monitoring Talk @ hasgeekLogging : How much is too much? Network Security Monitoring Talk @ hasgeek
Logging : How much is too much? Network Security Monitoring Talk @ hasgeek
 
Anurag Awasthi - Machine Learning applications for CloudStack
Anurag Awasthi - Machine Learning applications for CloudStackAnurag Awasthi - Machine Learning applications for CloudStack
Anurag Awasthi - Machine Learning applications for CloudStack
 
Sensing the world with data of things
Sensing the world with  data of thingsSensing the world with  data of things
Sensing the world with data of things
 
Sensing the world with Data of Things
Sensing the world with Data of ThingsSensing the world with Data of Things
Sensing the world with Data of Things
 
Elasticsearch as a time series database
Elasticsearch as a time series databaseElasticsearch as a time series database
Elasticsearch as a time series database
 
AWS big-data-demystified #1.1 | Big Data Architecture Lessons Learned | English
AWS big-data-demystified #1.1  | Big Data Architecture Lessons Learned | EnglishAWS big-data-demystified #1.1  | Big Data Architecture Lessons Learned | English
AWS big-data-demystified #1.1 | Big Data Architecture Lessons Learned | English
 
OSMC 2018 | Learnings, patterns and Uber’s metrics platform M3, open sourced ...
OSMC 2018 | Learnings, patterns and Uber’s metrics platform M3, open sourced ...OSMC 2018 | Learnings, patterns and Uber’s metrics platform M3, open sourced ...
OSMC 2018 | Learnings, patterns and Uber’s metrics platform M3, open sourced ...
 
Streamsets and spark in Retail
Streamsets and spark in RetailStreamsets and spark in Retail
Streamsets and spark in Retail
 
Analytic Insights in Retail Using Apache Spark with Hari Shreedharan
Analytic Insights in Retail Using Apache Spark with Hari ShreedharanAnalytic Insights in Retail Using Apache Spark with Hari Shreedharan
Analytic Insights in Retail Using Apache Spark with Hari Shreedharan
 

More from Anjana Fernando

Ballerina – An Open-Source, Cloud-Native Programming Language for Microservices
Ballerina – An Open-Source, Cloud-Native Programming Language for MicroservicesBallerina – An Open-Source, Cloud-Native Programming Language for Microservices
Ballerina – An Open-Source, Cloud-Native Programming Language for Microservices
Anjana Fernando
 
Automatic Microservices Observability with Ballerina - GIDS 2021
Automatic Microservices Observability with Ballerina - GIDS 2021Automatic Microservices Observability with Ballerina - GIDS 2021
Automatic Microservices Observability with Ballerina - GIDS 2021
Anjana Fernando
 
Ballerina: An Open-Source, Cloud-Native Programming Language - GIDS 2021
Ballerina: An Open-Source, Cloud-Native Programming Language - GIDS 2021Ballerina: An Open-Source, Cloud-Native Programming Language - GIDS 2021
Ballerina: An Open-Source, Cloud-Native Programming Language - GIDS 2021
Anjana Fernando
 
Java Distributed Transactions
Java Distributed TransactionsJava Distributed Transactions
Java Distributed Transactions
Anjana Fernando
 
Monitoring Your Business with WSO2 BAM
Monitoring Your Business with WSO2 BAMMonitoring Your Business with WSO2 BAM
Monitoring Your Business with WSO2 BAM
Anjana Fernando
 
Data Services: Getting Your Data Into APIs
Data Services: Getting Your Data Into APIsData Services: Getting Your Data Into APIs
Data Services: Getting Your Data Into APIs
Anjana Fernando
 
Scalable Log Analysis with WSO2 BAM
Scalable Log Analysis with WSO2 BAMScalable Log Analysis with WSO2 BAM
Scalable Log Analysis with WSO2 BAM
Anjana Fernando
 
Data integration and Business Processes
Data integration and Business ProcessesData integration and Business Processes
Data integration and Business Processes
Anjana Fernando
 
Simultaneous analysis of massive data streams in real time and batch
Simultaneous analysis of massive data streams in real time and batchSimultaneous analysis of massive data streams in real time and batch
Simultaneous analysis of massive data streams in real time and batch
Anjana Fernando
 
Ballerina - A Programming Language for Cloud and DevOps
Ballerina - A Programming Language for Cloud and DevOpsBallerina - A Programming Language for Cloud and DevOps
Ballerina - A Programming Language for Cloud and DevOps
Anjana Fernando
 
Ballerina - Cloud Native Programming Language
Ballerina - Cloud Native Programming LanguageBallerina - Cloud Native Programming Language
Ballerina - Cloud Native Programming Language
Anjana Fernando
 
Ballerina - A Programming Language for Cloud and DevOps
Ballerina - A Programming Language for Cloud and DevOpsBallerina - A Programming Language for Cloud and DevOps
Ballerina - A Programming Language for Cloud and DevOps
Anjana Fernando
 
Effective microservices development with ballerina
Effective microservices development with ballerinaEffective microservices development with ballerina
Effective microservices development with ballerina
Anjana Fernando
 

More from Anjana Fernando (13)

Ballerina – An Open-Source, Cloud-Native Programming Language for Microservices
Ballerina – An Open-Source, Cloud-Native Programming Language for MicroservicesBallerina – An Open-Source, Cloud-Native Programming Language for Microservices
Ballerina – An Open-Source, Cloud-Native Programming Language for Microservices
 
Automatic Microservices Observability with Ballerina - GIDS 2021
Automatic Microservices Observability with Ballerina - GIDS 2021Automatic Microservices Observability with Ballerina - GIDS 2021
Automatic Microservices Observability with Ballerina - GIDS 2021
 
Ballerina: An Open-Source, Cloud-Native Programming Language - GIDS 2021
Ballerina: An Open-Source, Cloud-Native Programming Language - GIDS 2021Ballerina: An Open-Source, Cloud-Native Programming Language - GIDS 2021
Ballerina: An Open-Source, Cloud-Native Programming Language - GIDS 2021
 
Java Distributed Transactions
Java Distributed TransactionsJava Distributed Transactions
Java Distributed Transactions
 
Monitoring Your Business with WSO2 BAM
Monitoring Your Business with WSO2 BAMMonitoring Your Business with WSO2 BAM
Monitoring Your Business with WSO2 BAM
 
Data Services: Getting Your Data Into APIs
Data Services: Getting Your Data Into APIsData Services: Getting Your Data Into APIs
Data Services: Getting Your Data Into APIs
 
Scalable Log Analysis with WSO2 BAM
Scalable Log Analysis with WSO2 BAMScalable Log Analysis with WSO2 BAM
Scalable Log Analysis with WSO2 BAM
 
Data integration and Business Processes
Data integration and Business ProcessesData integration and Business Processes
Data integration and Business Processes
 
Simultaneous analysis of massive data streams in real time and batch
Simultaneous analysis of massive data streams in real time and batchSimultaneous analysis of massive data streams in real time and batch
Simultaneous analysis of massive data streams in real time and batch
 
Ballerina - A Programming Language for Cloud and DevOps
Ballerina - A Programming Language for Cloud and DevOpsBallerina - A Programming Language for Cloud and DevOps
Ballerina - A Programming Language for Cloud and DevOps
 
Ballerina - Cloud Native Programming Language
Ballerina - Cloud Native Programming LanguageBallerina - Cloud Native Programming Language
Ballerina - Cloud Native Programming Language
 
Ballerina - A Programming Language for Cloud and DevOps
Ballerina - A Programming Language for Cloud and DevOpsBallerina - A Programming Language for Cloud and DevOps
Ballerina - A Programming Language for Cloud and DevOps
 
Effective microservices development with ballerina
Effective microservices development with ballerinaEffective microservices development with ballerina
Effective microservices development with ballerina
 

Recently uploaded

How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
Globus
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
Paco van Beckhoven
 
SOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBrokerSOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
First Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User EndpointsFirst Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User Endpoints
Globus
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
Software Testing Exam imp Ques Notes.pdf
Software Testing Exam imp Ques Notes.pdfSoftware Testing Exam imp Ques Notes.pdf
Software Testing Exam imp Ques Notes.pdf
MayankTawar1
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
AMB-Review
 
Strategies for Successful Data Migration Tools.pptx
Strategies for Successful Data Migration Tools.pptxStrategies for Successful Data Migration Tools.pptx
Strategies for Successful Data Migration Tools.pptx
varshanayak241
 
Designing for Privacy in Amazon Web Services
Designing for Privacy in Amazon Web ServicesDesigning for Privacy in Amazon Web Services
Designing for Privacy in Amazon Web Services
KrzysztofKkol1
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
Globus
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
Ortus Solutions, Corp
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
abdulrafaychaudhry
 
Why React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdfWhy React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdf
ayushiqss
 
Visitor Management System in India- Vizman.app
Visitor Management System in India- Vizman.appVisitor Management System in India- Vizman.app
Visitor Management System in India- Vizman.app
NaapbooksPrivateLimi
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Globus
 
Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024
Sharepoint Designs
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
Juraj Vysvader
 
Advanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should KnowAdvanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should Know
Peter Caitens
 

Recently uploaded (20)

How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
 
SOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBrokerSOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBroker
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
First Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User EndpointsFirst Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User Endpoints
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 
Software Testing Exam imp Ques Notes.pdf
Software Testing Exam imp Ques Notes.pdfSoftware Testing Exam imp Ques Notes.pdf
Software Testing Exam imp Ques Notes.pdf
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
 
Strategies for Successful Data Migration Tools.pptx
Strategies for Successful Data Migration Tools.pptxStrategies for Successful Data Migration Tools.pptx
Strategies for Successful Data Migration Tools.pptx
 
Designing for Privacy in Amazon Web Services
Designing for Privacy in Amazon Web ServicesDesigning for Privacy in Amazon Web Services
Designing for Privacy in Amazon Web Services
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
 
Why React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdfWhy React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdf
 
Visitor Management System in India- Vizman.app
Visitor Management System in India- Vizman.appVisitor Management System in India- Vizman.app
Visitor Management System in India- Vizman.app
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
 
Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
 
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
 
Advanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should KnowAdvanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should Know
 

IoT Analytics

  • 1. IoT Analytics Anjana Fernando Senior Technical Lead WSO2 Inc.
  • 2. IoT Analytics in Action Rolls Royce Trent 1000 Analytics data collected in ● Design ● Manufacture ● After-sales One fan blade manufacturing -> 0.5 TB of data Real-time data transmitted back to RR when planes are in-flight.
  • 3. IoT Analytics in Action Caterpillar From autonomous mining trucks to locomotives, they have sensors monitoring fuel, idle time, location for maximum operation efficiency. Predictive maintenance has saved millions from timely fuel pump replacement to adjusting ship hull cleaning intervals in their marine services.
  • 4. IoT Analytics Categorized ● What type of data? ● How fast you need results? ● How much data to keep? ● Historical, real-time, or predictive? ● Cloud or fog / edge analytics?
  • 5. IoT Analytics Data ● Time related data ○ Time series processing ■ Energy consumption with time ■ Failure prediction ■ Specialized DBs - OpenTSDB ● Location data ○ GPS / iBeacons ○ Used in agriculture ■ Detect soil moisture, crop growth ■ Manage irrigation equipment ○ Traffic planning ■ Monitor vehicle speeds, location for better route suggestions ○ Geospatial optimized processing engines - GeoTrellis
  • 6. IoT Analytics Processing Modes Do we need the results instantaneously?, or a few seconds delay okay?, or else, results after several minutes or more is fine?
  • 7. IoT Analytics Processing Modes: Realtime / NRT ● The most often used processing mode in IoT ○ Immediately take action on some event occurring with the source devices ■ Send out alerts from a temperature sensor hitting a limit ■ Notification in a car dashboard of low tire pressure ● Generating instant alerts and information based on the data sent by sensors, requires stream processing. Process events one by one in real-time to match to a predefined set of rules. ○ Apache Storm as a stream processing engine ■ Scalable and fault tolerant ○ For advanced pattern matching, a full fledged CEP engine can be used, e.g. WSO2 CEP, Esper etc..
  • 8. IoT Analytics Processing Modes: Interactive / Batch ● Long term statistics generations, a batch processing system can be used: Apache Hadoop, Apache Spark ○ Average temperature in a room in the last month ○ Total power usage of the house in the last year ● Interactive analytics with technologies such as Apache Drill and indexed storage systems such as Couchbase. ● Most often, we may need to mash-up both batch analytics results with real-time processing ○ Comparing a long term statistics result with incoming real-time events for alerts etc.. ● Batch operations can be brought together with an indexing system for real-time analytics to lookup data instantly when required ○ Apache Lucene, WSO2 DAS Analytics / Event Tables
  • 9. IoT Analytics Data Retention ● IoT devices generate high volume or different types of data ● We can decide to process right away when we receive it, and discard it, or else, keep it for more detailed processing ● Big Data stores gives us the option to store huge amounts of data as such. ● Purge the data, after the raw data is no longer required
  • 10. IoT Analytics Processing: Hindsight/Insight/Foresight ● Hindsight can be achieved by processing historical data, and understanding what has happened. ○ Batch processing systems such as Apache Hadoop and Apache Spark is used in this area ○ Data visualization with dashboards, showing related data together ● Insight would be understanding what is happening now ○ Achieved with real-time processing systems ○ Scenario: How are my jet engines performing right now ● Foresight is predicting what is going to happen ○ Achieved with machine learning systems such as Apache Mahout, Apache Spark MLlib, Microsoft Azure Machine Learning, WSO2 ML ○ Scenario: Predictive maintenance -> time to change specific parts in my car, service scheduling on an aeroplane
  • 11. So many things… So much processing… ● IoT will mean, naturally large amounts of data created, thus large amount of computation resources are required ● Typical scenario of a centralized analytics server for all devices may not be feasible all the time ○ Centralized analytics hardware may not be scalable for all the thousands of devices getting added frequently ○ The network communication will get flooded with analytics chatter when the device count increases ● Solution: edge analytics, a.k.a, fog analytics ○ Some of the analytics operations are offloaded to the end device itself or to an immediate gateway, for doing most or some of the analytics operations required. This creates a scalable infrastructure for device management in the IoT ecosystem.
  • 12. WSO2 Reference Architecture for IoT Analytics