SlideShare a Scribd company logo
1 of 23
Make Streaming IoT
Analytics Work for You
Kanishk Mahajan & Dhruv Kumar
Hortonworks May 2016
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
IoT - Relevance
 7.2 Billion active SIM Cards
 25 Billion connected things
 500 Million tweets per day
– Average life of a tweet is 18 minutes
 2 Zettabytes per year of Global IP Traffic
– 80% is unstructured
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
The Four Vs of Big Data http://www.ibmbigdatahub.com/infographic/four-vs-big-data
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
5 Attributes of a Streaming Platform
 Ingest
 Process
 Analyze
 Respond
 Visualize
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Data Flow versus Stream Analytics for IoT
 Data Flow
– Ingest and route terabytes data into a ”unified firehose”
– Actively performance manage the latency and quality of these data flows –
• Across high variability of data formats, size of data and speed of data
 Real Time Stream Analytics
– Sub second event processing with linear scalability to billions of events
– Predictive Analytics at Scale
• Real time data aggregation across edge nodes while processing 10s of millions of events and
100s of gigabytes per second
– Guaranteed no data loss and events processed in order
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
6 Key Focus Areas for Building a Streaming Platform for IoT
1. Common Abstraction Layer
2. Latency
3. Lambda Architecture
1. “Orchestrate” over static and real time data
4. Scale-out
5. Rapid Application Development
6. Data Visualization
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform for IoT
1. Common Abstraction Layer
– Select one or more streaming engines
– Select one or more cloud providers
– Select one or more resource managers
– Select one or more event sources
– …Future Proof
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform for IoT
2. Latency
– 500 millisecond or less- Dashboards, Security Incidents, Asset Performance
– 20 milliseconds or less - Ad Networks, Preventive Maintenance
– …
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform for IoT
3. Lambda Architecture
– Integrate static and real time data
– Enrichment
– Orchestration of Batch workflows
– Predictive Classification and Scoring
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform for IoT
4. Scale Out
– Linear Scale out or scale down
– Resource Management
– Handle Transient workloads
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform for IoT
5. Rapid Application Development
– Industry vertical specific applications
– Aggregations
– Filters
– Multi Stream Correlations
– Splits
– Joins
– Normalizations
– Business Rules Editor
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform for IoT
6. Data Visualization
– Time Series Visualizations
– Metrics Dashboarding
– Trends
– Comparisons
– Thresholds
– Custom UI extensions
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
IoT - Real Time Stream Analytics versus Streaming Engine
 Abstracts underlying Streaming Engine- Storm, Spark Streaming, Flink..
 OOTB Support for multiple (cloud) event sources - Kafka, AWS Kinesis, Azure Event Hub
 Built-in Operators for Complex Event Processing
 Built-in Real Time Dashboarding- Metrics and Events
 PMML Support
 Pluggable Workflow Management
 Business Rules Editor and Rapid Application Development Framework
 Cloud Deployment
 Scalable Architecture
 Handles different latency requirements
IoT: The Big Picture
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2011
Hortonworks IoT Platform – The Big Picture
Page 15
Data Acquisition
Edge Processing
Real Time Stream Analytics
IoT Services
Rapid Application Development
IoT
ANALYTICS
CLOUD
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
IoT Edge
Data Acquisition & Processing
Device Specific
Protocol
Device Data
Acquisition
Edge
Processing
IoT Analytics
CloudHTTP(s)
Edge Processing
• Reliable Delivery
• Buffering and Flow Control
• Simple Event Processing
• Edge Analytics
High Availability
• Scale out and Clustering
• Multi Data Center Support
Security
• 2 way SSL
• Data Element Masking
• Flexible Routing based on Data
Sensitivity
Kafka Enablement
• Data Ingest and Drain based on
Kafka Consumer and Producer
Support
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Device Management
Agent
• Send Diagnostics
• Receive F/W updates
• Connectivity Logs
Enrollment &
Authentication
• 3-legged OAuth
based flow
• Encryption
• Confidentiality
Registry
• Device Metadata
Catalog
User Linkage
• Device - User Linkage
• May be many - many
between Device and
User
• Temporary or
Persistent
De-
Authorization
• Deregister
• Unlink User
• Disable
Use Cases - Connected Car
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Connected Car - Generator of Big Data:
http://www.slideshare.net/AditiTechnologies/how-internet-of-things-iot-is-reshaping-the-automotive-sector-infographic
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Connected Car - Use Cases http://gelookahead.economist.com/infograph/car-os/
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Reference Architeucture: Connected Car Platform and Hortonworks
For more details contact sales@hortonworks.com
Hortonworks Data Platform
Core Data
Engineers (CAD)
Systems of Record (ERP)
Customers (CRM)
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Apache NiFi Demo: https://www.youtube.com/watch?v=r4NsWbE4_-I
(Download here: http://hortonworks.com/products/hdf/
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Questions?
http://community.hortonworks.com

More Related Content

What's hot

Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterpriseUsing Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
DataWorks Summit
 
The Implacable advance of the data
The Implacable advance of the dataThe Implacable advance of the data
The Implacable advance of the data
DataWorks Summit
 
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
DataWorks Summit
 
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
DataWorks Summit/Hadoop Summit
 
Achieving a 360 degree view of manufacturing
Achieving a 360 degree view of manufacturingAchieving a 360 degree view of manufacturing
Achieving a 360 degree view of manufacturing
DataWorks Summit
 
Hortonworks, Novetta and Noble Energy Webinar
Hortonworks, Novetta and Noble Energy Webinar Hortonworks, Novetta and Noble Energy Webinar
Hortonworks, Novetta and Noble Energy Webinar
Hortonworks
 

What's hot (20)

Risk listening: monitoring for profitable growth
Risk listening: monitoring for profitable growthRisk listening: monitoring for profitable growth
Risk listening: monitoring for profitable growth
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
 
Dataguise hortonworks insurance_feb25
Dataguise hortonworks insurance_feb25Dataguise hortonworks insurance_feb25
Dataguise hortonworks insurance_feb25
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
 
Spark Summit EMEA - Arun Murthy's Keynote
Spark Summit EMEA - Arun Murthy's KeynoteSpark Summit EMEA - Arun Murthy's Keynote
Spark Summit EMEA - Arun Murthy's Keynote
 
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterpriseUsing Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
 
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
 
The Implacable advance of the data
The Implacable advance of the dataThe Implacable advance of the data
The Implacable advance of the data
 
Real-time Analytics in Financial: Use Case, Architecture and Challenges
Real-time Analytics in Financial: Use Case, Architecture and ChallengesReal-time Analytics in Financial: Use Case, Architecture and Challenges
Real-time Analytics in Financial: Use Case, Architecture and Challenges
 
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
 
Pivotal - Advanced Analytics for Telecommunications
Pivotal - Advanced Analytics for Telecommunications Pivotal - Advanced Analytics for Telecommunications
Pivotal - Advanced Analytics for Telecommunications
 
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
 
Overcoming the AI hype — and what enterprises should really focus on
Overcoming the AI hype — and what enterprises should really focus onOvercoming the AI hype — and what enterprises should really focus on
Overcoming the AI hype — and what enterprises should really focus on
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 
Achieving a 360 degree view of manufacturing
Achieving a 360 degree view of manufacturingAchieving a 360 degree view of manufacturing
Achieving a 360 degree view of manufacturing
 
Hortonworks, Novetta and Noble Energy Webinar
Hortonworks, Novetta and Noble Energy Webinar Hortonworks, Novetta and Noble Energy Webinar
Hortonworks, Novetta and Noble Energy Webinar
 
Apache Metron: Community Driven Cyber Security
Apache Metron: Community Driven Cyber Security Apache Metron: Community Driven Cyber Security
Apache Metron: Community Driven Cyber Security
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Real-time Analytics in Financial
Real-time Analytics in FinancialReal-time Analytics in Financial
Real-time Analytics in Financial
 

Viewers also liked

Viewers also liked (20)

IoT Analytics Company Presentation
IoT Analytics Company Presentation IoT Analytics Company Presentation
IoT Analytics Company Presentation
 
AWS re:Invent 2016: Understanding IoT Data: How to Leverage Amazon Kinesis in...
AWS re:Invent 2016: Understanding IoT Data: How to Leverage Amazon Kinesis in...AWS re:Invent 2016: Understanding IoT Data: How to Leverage Amazon Kinesis in...
AWS re:Invent 2016: Understanding IoT Data: How to Leverage Amazon Kinesis in...
 
Making Smarter Systems with IoT and Analytics
Making Smarter Systems with IoT and AnalyticsMaking Smarter Systems with IoT and Analytics
Making Smarter Systems with IoT and Analytics
 
IoT Analytics from Edge to Cloud - using IBM Informix
IoT Analytics from Edge to Cloud - using IBM InformixIoT Analytics from Edge to Cloud - using IBM Informix
IoT Analytics from Edge to Cloud - using IBM Informix
 
Kvm for ibm_z_systems_v1.1.2_limits
Kvm for ibm_z_systems_v1.1.2_limitsKvm for ibm_z_systems_v1.1.2_limits
Kvm for ibm_z_systems_v1.1.2_limits
 
Let op: Facebook is geen eiland! | Congres Facebook Marketing 2014 | PauwR | ...
Let op: Facebook is geen eiland! | Congres Facebook Marketing 2014 | PauwR | ...Let op: Facebook is geen eiland! | Congres Facebook Marketing 2014 | PauwR | ...
Let op: Facebook is geen eiland! | Congres Facebook Marketing 2014 | PauwR | ...
 
Nzs 1543-howibmservicemanagementunitehelpsmainframeo-160302232115
Nzs 1543-howibmservicemanagementunitehelpsmainframeo-160302232115Nzs 1543-howibmservicemanagementunitehelpsmainframeo-160302232115
Nzs 1543-howibmservicemanagementunitehelpsmainframeo-160302232115
 
Socialmedia affectingtheworldatlargev2-161010102038
Socialmedia affectingtheworldatlargev2-161010102038Socialmedia affectingtheworldatlargev2-161010102038
Socialmedia affectingtheworldatlargev2-161010102038
 
Datapowercommonusecases 130509114200-phpapp02
Datapowercommonusecases 130509114200-phpapp02Datapowercommonusecases 130509114200-phpapp02
Datapowercommonusecases 130509114200-phpapp02
 
Technical white paper--Optimizing Quality of Service with SAP HANAon Power Ra...
Technical white paper--Optimizing Quality of Service with SAP HANAon Power Ra...Technical white paper--Optimizing Quality of Service with SAP HANAon Power Ra...
Technical white paper--Optimizing Quality of Service with SAP HANAon Power Ra...
 
IBM Counter-Fraud Management for Insurance
IBM Counter-Fraud Management for InsuranceIBM Counter-Fraud Management for Insurance
IBM Counter-Fraud Management for Insurance
 
Ibm spectrum archive ee v1.2.2 performance_white_paper
Ibm spectrum archive ee  v1.2.2 performance_white_paperIbm spectrum archive ee  v1.2.2 performance_white_paper
Ibm spectrum archive ee v1.2.2 performance_white_paper
 
Data Analytics for IoT Device Deployments: Industry Trends and Architectural ...
Data Analytics for IoT Device Deployments: Industry Trends and Architectural ...Data Analytics for IoT Device Deployments: Industry Trends and Architectural ...
Data Analytics for IoT Device Deployments: Industry Trends and Architectural ...
 
Interconnect2017completewatsoniotjourneymap0216 170220225328
Interconnect2017completewatsoniotjourneymap0216 170220225328Interconnect2017completewatsoniotjourneymap0216 170220225328
Interconnect2017completewatsoniotjourneymap0216 170220225328
 
test
testtest
test
 
Wp102696 liberty java batch z os security
Wp102696   liberty java batch z os securityWp102696   liberty java batch z os security
Wp102696 liberty java batch z os security
 
00 revolução russa – 9º ano sj
00 revolução russa – 9º ano sj00 revolução russa – 9º ano sj
00 revolução russa – 9º ano sj
 
Paving the path to Narrowband 5G with LTE IoT
Paving the path to Narrowband 5G with LTE IoTPaving the path to Narrowband 5G with LTE IoT
Paving the path to Narrowband 5G with LTE IoT
 
Small Cell Industry Insight & Experience Sharing
Small Cell Industry Insight & Experience SharingSmall Cell Industry Insight & Experience Sharing
Small Cell Industry Insight & Experience Sharing
 
Data Analytics for IoT
Data Analytics for IoT Data Analytics for IoT
Data Analytics for IoT
 

Similar to Make Streaming IoT Analytics Work for You

Streamline - Stream Analytics for Everyone
Streamline - Stream Analytics for EveryoneStreamline - Stream Analytics for Everyone
Streamline - Stream Analytics for Everyone
DataWorks Summit/Hadoop Summit
 

Similar to Make Streaming IoT Analytics Work for You (20)

Make Streaming Analytics work for you: The Devil is in the Details
Make Streaming Analytics work for you: The Devil is in the DetailsMake Streaming Analytics work for you: The Devil is in the Details
Make Streaming Analytics work for you: The Devil is in the Details
 
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFIHarnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
 
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1
 
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big DataVoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
 
HDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi IntroductionHDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi Introduction
 
Unlocking insights in streaming data
Unlocking insights in streaming dataUnlocking insights in streaming data
Unlocking insights in streaming data
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical Enterprise
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
 
IoT Open Source Integration Comparison (Kura, Node-RED, Flogo, Apache Nifi, S...
IoT Open Source Integration Comparison (Kura, Node-RED, Flogo, Apache Nifi, S...IoT Open Source Integration Comparison (Kura, Node-RED, Flogo, Apache Nifi, S...
IoT Open Source Integration Comparison (Kura, Node-RED, Flogo, Apache Nifi, S...
 
Going Beyond the Device Heart Beat
Going Beyond the Device Heart BeatGoing Beyond the Device Heart Beat
Going Beyond the Device Heart Beat
 
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming DataDruid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
 
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
 
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
 
Reinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationReinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital Transformation
 
Streamline - Stream Analytics for Everyone
Streamline - Stream Analytics for EveryoneStreamline - Stream Analytics for Everyone
Streamline - Stream Analytics for Everyone
 
FIWARE Global Summit - FIWARE Overview
FIWARE Global Summit - FIWARE OverviewFIWARE Global Summit - FIWARE Overview
FIWARE Global Summit - FIWARE Overview
 
Hadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash CourseHadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash Course
 
Oracle Digital Business Transformation and Internet of Things by Ermin Prašović
Oracle Digital Business Transformation and Internet of Things by Ermin PrašovićOracle Digital Business Transformation and Internet of Things by Ermin Prašović
Oracle Digital Business Transformation and Internet of Things by Ermin Prašović
 
New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream
New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream
New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream
 

More from Hortonworks

More from Hortonworks (20)

IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with Cloudbreak
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log Events
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's New
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data Landscape
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache Druid
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with Ease
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 
4 Essential Steps for Managing Sensitive Data
4 Essential Steps for Managing Sensitive Data4 Essential Steps for Managing Sensitive Data
4 Essential Steps for Managing Sensitive Data
 
5 Steps to Create a Company Culture that Embraces the Power of Data
5 Steps to Create a Company Culture that Embraces the Power of Data5 Steps to Create a Company Culture that Embraces the Power of Data
5 Steps to Create a Company Culture that Embraces the Power of Data
 
Exploring the Heated-and Completely Unnecessary- Data Lake Debate
Exploring the Heated-and Completely Unnecessary- Data Lake DebateExploring the Heated-and Completely Unnecessary- Data Lake Debate
Exploring the Heated-and Completely Unnecessary- Data Lake Debate
 
Sprint's Data Modernization Journey
Sprint's Data Modernization JourneySprint's Data Modernization Journey
Sprint's Data Modernization Journey
 

Recently uploaded

一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
pyhepag
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
pyhepag
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
pyhepag
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
DilipVasan
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
RafigAliyev2
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
pyhepag
 

Recently uploaded (20)

一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity PredictionMachine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
Data analytics courses in Nepal Presentation
Data analytics courses in Nepal PresentationData analytics courses in Nepal Presentation
Data analytics courses in Nepal Presentation
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 

Make Streaming IoT Analytics Work for You

  • 1. Make Streaming IoT Analytics Work for You Kanishk Mahajan & Dhruv Kumar Hortonworks May 2016
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved IoT - Relevance  7.2 Billion active SIM Cards  25 Billion connected things  500 Million tweets per day – Average life of a tweet is 18 minutes  2 Zettabytes per year of Global IP Traffic – 80% is unstructured
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved The Four Vs of Big Data http://www.ibmbigdatahub.com/infographic/four-vs-big-data
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved 5 Attributes of a Streaming Platform  Ingest  Process  Analyze  Respond  Visualize
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Data Flow versus Stream Analytics for IoT  Data Flow – Ingest and route terabytes data into a ”unified firehose” – Actively performance manage the latency and quality of these data flows – • Across high variability of data formats, size of data and speed of data  Real Time Stream Analytics – Sub second event processing with linear scalability to billions of events – Predictive Analytics at Scale • Real time data aggregation across edge nodes while processing 10s of millions of events and 100s of gigabytes per second – Guaranteed no data loss and events processed in order
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved 6 Key Focus Areas for Building a Streaming Platform for IoT 1. Common Abstraction Layer 2. Latency 3. Lambda Architecture 1. “Orchestrate” over static and real time data 4. Scale-out 5. Rapid Application Development 6. Data Visualization
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform for IoT 1. Common Abstraction Layer – Select one or more streaming engines – Select one or more cloud providers – Select one or more resource managers – Select one or more event sources – …Future Proof
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform for IoT 2. Latency – 500 millisecond or less- Dashboards, Security Incidents, Asset Performance – 20 milliseconds or less - Ad Networks, Preventive Maintenance – …
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform for IoT 3. Lambda Architecture – Integrate static and real time data – Enrichment – Orchestration of Batch workflows – Predictive Classification and Scoring
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform for IoT 4. Scale Out – Linear Scale out or scale down – Resource Management – Handle Transient workloads
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform for IoT 5. Rapid Application Development – Industry vertical specific applications – Aggregations – Filters – Multi Stream Correlations – Splits – Joins – Normalizations – Business Rules Editor
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform for IoT 6. Data Visualization – Time Series Visualizations – Metrics Dashboarding – Trends – Comparisons – Thresholds – Custom UI extensions
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved IoT - Real Time Stream Analytics versus Streaming Engine  Abstracts underlying Streaming Engine- Storm, Spark Streaming, Flink..  OOTB Support for multiple (cloud) event sources - Kafka, AWS Kinesis, Azure Event Hub  Built-in Operators for Complex Event Processing  Built-in Real Time Dashboarding- Metrics and Events  PMML Support  Pluggable Workflow Management  Business Rules Editor and Rapid Application Development Framework  Cloud Deployment  Scalable Architecture  Handles different latency requirements
  • 14. IoT: The Big Picture
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved © Hortonworks Inc. 2011 Hortonworks IoT Platform – The Big Picture Page 15 Data Acquisition Edge Processing Real Time Stream Analytics IoT Services Rapid Application Development IoT ANALYTICS CLOUD
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved IoT Edge Data Acquisition & Processing Device Specific Protocol Device Data Acquisition Edge Processing IoT Analytics CloudHTTP(s) Edge Processing • Reliable Delivery • Buffering and Flow Control • Simple Event Processing • Edge Analytics High Availability • Scale out and Clustering • Multi Data Center Support Security • 2 way SSL • Data Element Masking • Flexible Routing based on Data Sensitivity Kafka Enablement • Data Ingest and Drain based on Kafka Consumer and Producer Support
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Device Management Agent • Send Diagnostics • Receive F/W updates • Connectivity Logs Enrollment & Authentication • 3-legged OAuth based flow • Encryption • Confidentiality Registry • Device Metadata Catalog User Linkage • Device - User Linkage • May be many - many between Device and User • Temporary or Persistent De- Authorization • Deregister • Unlink User • Disable
  • 18. Use Cases - Connected Car
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Connected Car - Generator of Big Data: http://www.slideshare.net/AditiTechnologies/how-internet-of-things-iot-is-reshaping-the-automotive-sector-infographic
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Connected Car - Use Cases http://gelookahead.economist.com/infograph/car-os/
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Reference Architeucture: Connected Car Platform and Hortonworks For more details contact sales@hortonworks.com Hortonworks Data Platform Core Data Engineers (CAD) Systems of Record (ERP) Customers (CRM)
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Apache NiFi Demo: https://www.youtube.com/watch?v=r4NsWbE4_-I (Download here: http://hortonworks.com/products/hdf/
  • 23. 23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Questions? http://community.hortonworks.com