SlideShare a Scribd company logo
© 2017 MapR TechnologiesMapR Confidential 1
Act Locally, Learn Globally
© 2017 MapR TechnologiesMapR Confidential 2
Interest in IIOT
spans Across
ALL industries
Source:
“The Internet Of Things Heat Map, 2016“
Forrester Research, Inc.
January 14, 2016
© 2017 MapR TechnologiesMapR Confidential 3
EXAMPLE OF INDUSTRIAL IoT – OIL & GAS / Manufacturing
(Honeywell)
Oil & Gas Sensors Measure Lots of Different
Things:
• Weight on Drill Line
• Oil, Water, Fluid Pressure
• Storage Tank Liquid Levels
• Device Positions
• …And Much More
Oil & Gas Sensors Emit Lots of Data:
• 18,900,000 Readings from 21,000 Devices per
Day
• 1.5TB per Day
Challenges:
• Where to Store all this Data?
• Where to Aggregate Data from Other Oil Rigs?
• How to Aggregate Data, Given Remote
Locations and Limited Network Connectivity
• How to Analyze this Data?
Problem Statement: As crude oil prices
remain historically low, oil & gas
companies are looking for ways to cut
production costs and streamline
operations.
Investments in IoT Make it Possible
to:
• Optimize well work by reducing site
visits.
• Better explore surface and
subsurface for oil.
• Improve equipment maintenance
and safety, and avoid shutdowns.
• Remotely monitor performance.
• Predict oil production.
© 2017 MapR TechnologiesMapR Confidential 4
• Highlights
– High storage capacity & compression for time series data
– Supports Millions of tags
– Advanced calculations & analysis
– Data visualization
– Guaranteed data centralization (ex. PHD remote historical data recovery)
Enterprise Historian
• How does it fair in today’s world?
– Only stores time series data without context.
– Manual export involved to correlate and aggregate from all sites
– Doesn't support unstructured or semi structured data
– Doesn't support deploying predictive models
– Poor Data visualization
– Long time to insights
OPC Server
© 2017 MapR TechnologiesMapR Confidential 5
INDUSTRIAL IoT CAN BE MORE CHALLENGING
IoT Devices/Sensors
No Place to Store,
Process IoT Data:
• Physical Space
Constraints
• High Volume,
Variety of Data
Intermittent and/or
Insecure Network
Connections
Central Platform is Not
Readily-Available
Similar Challenges at All
IoT End Points
? ?
© 2017 MapR TechnologiesMapR Confidential 6
How The Internet Works
Server
Cache
Cache
Gateway
Switch
Firewall
c1
c2
Gateway
Switch Firewall
c1
c2
Switch
Firewall c1
c2
© 2017 MapR TechnologiesMapR Confidential 7
How IoT is turning it upside down
Server
Cache
Cache
GatewaySwitchController
m4
m3
Gateway
Switch
Controller
m6
m5
Switch
Controllerm2
m1
© 2017 MapR TechnologiesMapR Confidential 8
Geo-distribution is Key Challenge for IoT
© 2017 MapR TechnologiesMapR Confidential 9
MapR Edge: What is It?
© 2017 MapR TechnologiesMapR Confidential 10
MAPR EDGE FOR INTERNET-OF-THINGS
GLOBAL DATA PLANE
MAPR EDGE
Small footprint
at the edge
MAPR CONVERGED
ENTERPRISE EDITION
[on-prem, hybrid, cloud]Send insights
back to edge
Aggregate, analyze
data at core
Reliable
replication
MAPR EDGE
Computing
power
close to
the data
Convergence
at the edge
MAPR EDGE
© 2017 MapR TechnologiesMapR Confidential 11
MAPR SOLVES BIG DATA AT THE EDGE
DESIGNED FOR EDGE LOCATIONS
Sites with slow or
occasionally connected
network access
Data sources
that create huge
volumes of data
E.g., oil rigs,
hospitals, vehicles,
remote offices, etc. INTEL NUC MINI PCS
(8.3” X 4.6” X 1.1”)
Space constrained locations requiring small footprints
• 3-5 node cluster, storage capacity limits, minimum 16GB RAM
• Optimized for mini PCs (e.g., Intel NUCs)
© 2017 MapR TechnologiesMapR Confidential 12
“Act Locally, Learn Globally” with MapR Edge
“Act Locally, Learn Globally” IoT applications leverage local data from
numerous sources for constructing machine learning or deep learning models
with global knowledge. These models are then rapidly deployed to the edge to
enable real-time decisions based on local events.
© 2017 MapR TechnologiesMapR Confidential 13
Oil Company Case Study
Source
1
Source
2
Source
1000
Houston
MAPR
Core
Cluster
Time to insight (48 hrs)
Manual Process
Before Edge
Source
1
Source
2
Source
1000
Houston
MAPR
Core
Cluster
Time to insight (<2 hrs)
Automated Process
1000s of
Oil & Drill Sources
Will do Pre Processing locally +at Core
(Custom App + Down Sampling)
After Edge
© 2017 MapR TechnologiesMapR Confidential 14
Medical Device Company Case Study
Source
1
Source
2
Source
100
Houston
MAPR
Core
Cluster
Time to insight (12 hrs)
Manual, Scripts, File Sync SW Process
Source
1
Source
2
Source
1000
Houston
MAPR
Core
Cluster
Time to insight (<15 mins)
Automated Process100s of Medical
machines in each
hospital
Will do Pre Processing locally +at Core
(Custom App + Anonymizations)
Machine Learning
Diagnosis App
Machine Learning
Diagnosis App
Before Edge After Edge
100s of Hospitals
© 2017 MapR TechnologiesMapR Confidential 15
Car Company Test & Dev Case Study
Source
1
Source
2
Source
1000
Houston
MAPR
Core
Cluster
Time to insight (24 hrs)
Manual, Scripts,
Req High Bandwidth Network
Source
1
Source
2
Source
1000
Houston
MAPR
Core
Cluster
Time to insight (<5 mins)
Automated Process1000s Test Cars
running in various
Conditions 24/7
Can do Pre Processing locally + at Core
(Custom App + Test Feedback Loop)
Car Test Analysis App
Before Edge After Edge
All Analysis only
done at Core
1-5TB/Day
© 2017 MapR TechnologiesMapR Confidential 16
Reference IIoT Architecture
Send insights
back to edge
Aggregate, analyze
data at core
MAPR EDGE
MapR-FS
MapR Data Platform
ETL Processing
(Hadoop, Spark)
Interactive
Query Engine
(Drill)
MapR-DB
Central Data Lake
Data Warehouses
Offload / Reload
Data Marts
Data
Exploration
Visualization
Predictive
Analytics
Analytics
NFS
Streams
Open
TSDB
© 2017 MapR TechnologiesMapR Confidential 17
Additional Resources
O’Reilly report by Ted Dunning & Ellen Friedman © March 2017
http://bit.ly/mapr-geo-distribution-ebook-pdf
O’Reilly book by Ted Dunning & Ellen Friedman
© March 2016
Read free courtesy of MapR
https://mapr.com/streaming-architecture-using-
apache-kafka-mapr-streams/
Sandbox:
https://www.mapr.com/products/mapr-sandbox-hadoop
Scaling Time series Databases:
https://community.mapr.com/community/exchange/blog/2017/06/21/scaling-time-series-analysis-on-the-mapr-
converged-data-platform
© 2017 MapR TechnologiesMapR Confidential 18
Q&A
@mapr
rmahajan@mapr.com
ENGAGE WITH US

More Related Content

What's hot

Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
MapR Technologies
 
Michael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - ParstreamMichael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - Parstream
Business of Software Conference
 
Jawsug hpc@jaws festa2016
Jawsug hpc@jaws festa2016Jawsug hpc@jaws festa2016
Jawsug hpc@jaws festa2016
Hiroshi Kobayashi
 
ttec - ParStream
ttec - ParStreamttec - ParStream
ttec - ParStream
Marco van der Hart
 
Xanadu for Big Data + IoT + Deep Learning + Cloud Integration Strategy
Xanadu for Big Data + IoT + Deep Learning + Cloud Integration StrategyXanadu for Big Data + IoT + Deep Learning + Cloud Integration Strategy
Xanadu for Big Data + IoT + Deep Learning + Cloud Integration Strategy
Alex G. Lee, Ph.D. Esq. CLP
 
Apache Spark and future of advanced analytics
Apache Spark and future of advanced analyticsApache Spark and future of advanced analytics
Apache Spark and future of advanced analytics
Muralidhar Somisetty
 
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
Carol McDonald
 
Liberating your Industrial Data
Liberating your Industrial DataLiberating your Industrial Data
Liberating your Industrial Data
Amazon Web Services
 
Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscape
MapR Technologies
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
MapR Technologies
 
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Matt Stubbs
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
MapR Technologies
 
AWS Enterprise Day | Big Data Analytics
AWS Enterprise Day | Big Data AnalyticsAWS Enterprise Day | Big Data Analytics
AWS Enterprise Day | Big Data Analytics
Amazon Web Services
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
MapR Technologies
 
The Synapse IoT Stack: Technology Trends in IOT and Big Data
The Synapse IoT Stack: Technology Trends in IOT and Big DataThe Synapse IoT Stack: Technology Trends in IOT and Big Data
The Synapse IoT Stack: Technology Trends in IOT and Big Data
InMobi Technology
 
Oil & Gas Big Data use cases
Oil & Gas Big Data use casesOil & Gas Big Data use cases
Oil & Gas Big Data use cases
elephantscale
 
TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16Terence White
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
MapR Technologies
 
Connecting the Dots- Closing Keynote - AWS Summit SG 2017
Connecting the Dots- Closing Keynote - AWS Summit SG 2017Connecting the Dots- Closing Keynote - AWS Summit SG 2017
Connecting the Dots- Closing Keynote - AWS Summit SG 2017
Amazon Web Services
 
[Webinar] Measure Twice, Build Once: Real-Time Predictive Analytics
[Webinar] Measure Twice, Build Once: Real-Time Predictive Analytics[Webinar] Measure Twice, Build Once: Real-Time Predictive Analytics
[Webinar] Measure Twice, Build Once: Real-Time Predictive Analytics
Infochimps, a CSC Big Data Business
 

What's hot (20)

Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
 
Michael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - ParstreamMichael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - Parstream
 
Jawsug hpc@jaws festa2016
Jawsug hpc@jaws festa2016Jawsug hpc@jaws festa2016
Jawsug hpc@jaws festa2016
 
ttec - ParStream
ttec - ParStreamttec - ParStream
ttec - ParStream
 
Xanadu for Big Data + IoT + Deep Learning + Cloud Integration Strategy
Xanadu for Big Data + IoT + Deep Learning + Cloud Integration StrategyXanadu for Big Data + IoT + Deep Learning + Cloud Integration Strategy
Xanadu for Big Data + IoT + Deep Learning + Cloud Integration Strategy
 
Apache Spark and future of advanced analytics
Apache Spark and future of advanced analyticsApache Spark and future of advanced analytics
Apache Spark and future of advanced analytics
 
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
 
Liberating your Industrial Data
Liberating your Industrial DataLiberating your Industrial Data
Liberating your Industrial Data
 
Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscape
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
 
AWS Enterprise Day | Big Data Analytics
AWS Enterprise Day | Big Data AnalyticsAWS Enterprise Day | Big Data Analytics
AWS Enterprise Day | Big Data Analytics
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
The Synapse IoT Stack: Technology Trends in IOT and Big Data
The Synapse IoT Stack: Technology Trends in IOT and Big DataThe Synapse IoT Stack: Technology Trends in IOT and Big Data
The Synapse IoT Stack: Technology Trends in IOT and Big Data
 
Oil & Gas Big Data use cases
Oil & Gas Big Data use casesOil & Gas Big Data use cases
Oil & Gas Big Data use cases
 
TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 
Connecting the Dots- Closing Keynote - AWS Summit SG 2017
Connecting the Dots- Closing Keynote - AWS Summit SG 2017Connecting the Dots- Closing Keynote - AWS Summit SG 2017
Connecting the Dots- Closing Keynote - AWS Summit SG 2017
 
[Webinar] Measure Twice, Build Once: Real-Time Predictive Analytics
[Webinar] Measure Twice, Build Once: Real-Time Predictive Analytics[Webinar] Measure Twice, Build Once: Real-Time Predictive Analytics
[Webinar] Measure Twice, Build Once: Real-Time Predictive Analytics
 

Similar to MapR Edge : Act Locally Learn Globally

Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
Matt Stubbs
 
Streaming patterns revolutionary architectures
Streaming patterns revolutionary architectures Streaming patterns revolutionary architectures
Streaming patterns revolutionary architectures
Carol McDonald
 
Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise
WeAreEsynergy
 
Predictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksPredictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural Networks
Justin Brandenburg
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
Ian Downard
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
MapR Technologies
 
Big Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business SolutionsBig Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business Solutions
Matt Stubbs
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
Carol McDonald
 
Real-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in ActionReal-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in Action
DataWorks Summit
 
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningStreaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine Learning
Ted Dunning
 
Big Data for Mobile Network Operator
Big Data for Mobile Network OperatorBig Data for Mobile Network Operator
Big Data for Mobile Network Operator
Erkan Örün
 
How Spark is Enabling the New Wave of Converged Cloud Applications
How Spark is Enabling the New Wave of Converged Cloud Applications How Spark is Enabling the New Wave of Converged Cloud Applications
How Spark is Enabling the New Wave of Converged Cloud Applications
MapR Technologies
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
MapR Technologies
 
Big data for MNO
Big data for MNOBig data for MNO
Big data for MNO
Christian Ferenz
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
MapR Technologies
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
DataWorks Summit
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
MapR Technologies
 
Map r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetupMap r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetup
Alan Iovine
 
In memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGainIn memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGain
Data Con LA
 
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 20187 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
Ellen Friedman
 

Similar to MapR Edge : Act Locally Learn Globally (20)

Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
 
Streaming patterns revolutionary architectures
Streaming patterns revolutionary architectures Streaming patterns revolutionary architectures
Streaming patterns revolutionary architectures
 
Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise
 
Predictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksPredictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural Networks
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
Big Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business SolutionsBig Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business Solutions
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
 
Real-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in ActionReal-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in Action
 
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningStreaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine Learning
 
Big Data for Mobile Network Operator
Big Data for Mobile Network OperatorBig Data for Mobile Network Operator
Big Data for Mobile Network Operator
 
How Spark is Enabling the New Wave of Converged Cloud Applications
How Spark is Enabling the New Wave of Converged Cloud Applications How Spark is Enabling the New Wave of Converged Cloud Applications
How Spark is Enabling the New Wave of Converged Cloud Applications
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
 
Big data for MNO
Big data for MNOBig data for MNO
Big data for MNO
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
 
Map r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetupMap r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetup
 
In memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGainIn memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGain
 
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 20187 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
 

Recently uploaded

FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 

MapR Edge : Act Locally Learn Globally

  • 1. © 2017 MapR TechnologiesMapR Confidential 1 Act Locally, Learn Globally
  • 2. © 2017 MapR TechnologiesMapR Confidential 2 Interest in IIOT spans Across ALL industries Source: “The Internet Of Things Heat Map, 2016“ Forrester Research, Inc. January 14, 2016
  • 3. © 2017 MapR TechnologiesMapR Confidential 3 EXAMPLE OF INDUSTRIAL IoT – OIL & GAS / Manufacturing (Honeywell) Oil & Gas Sensors Measure Lots of Different Things: • Weight on Drill Line • Oil, Water, Fluid Pressure • Storage Tank Liquid Levels • Device Positions • …And Much More Oil & Gas Sensors Emit Lots of Data: • 18,900,000 Readings from 21,000 Devices per Day • 1.5TB per Day Challenges: • Where to Store all this Data? • Where to Aggregate Data from Other Oil Rigs? • How to Aggregate Data, Given Remote Locations and Limited Network Connectivity • How to Analyze this Data? Problem Statement: As crude oil prices remain historically low, oil & gas companies are looking for ways to cut production costs and streamline operations. Investments in IoT Make it Possible to: • Optimize well work by reducing site visits. • Better explore surface and subsurface for oil. • Improve equipment maintenance and safety, and avoid shutdowns. • Remotely monitor performance. • Predict oil production.
  • 4. © 2017 MapR TechnologiesMapR Confidential 4 • Highlights – High storage capacity & compression for time series data – Supports Millions of tags – Advanced calculations & analysis – Data visualization – Guaranteed data centralization (ex. PHD remote historical data recovery) Enterprise Historian • How does it fair in today’s world? – Only stores time series data without context. – Manual export involved to correlate and aggregate from all sites – Doesn't support unstructured or semi structured data – Doesn't support deploying predictive models – Poor Data visualization – Long time to insights OPC Server
  • 5. © 2017 MapR TechnologiesMapR Confidential 5 INDUSTRIAL IoT CAN BE MORE CHALLENGING IoT Devices/Sensors No Place to Store, Process IoT Data: • Physical Space Constraints • High Volume, Variety of Data Intermittent and/or Insecure Network Connections Central Platform is Not Readily-Available Similar Challenges at All IoT End Points ? ?
  • 6. © 2017 MapR TechnologiesMapR Confidential 6 How The Internet Works Server Cache Cache Gateway Switch Firewall c1 c2 Gateway Switch Firewall c1 c2 Switch Firewall c1 c2
  • 7. © 2017 MapR TechnologiesMapR Confidential 7 How IoT is turning it upside down Server Cache Cache GatewaySwitchController m4 m3 Gateway Switch Controller m6 m5 Switch Controllerm2 m1
  • 8. © 2017 MapR TechnologiesMapR Confidential 8 Geo-distribution is Key Challenge for IoT
  • 9. © 2017 MapR TechnologiesMapR Confidential 9 MapR Edge: What is It?
  • 10. © 2017 MapR TechnologiesMapR Confidential 10 MAPR EDGE FOR INTERNET-OF-THINGS GLOBAL DATA PLANE MAPR EDGE Small footprint at the edge MAPR CONVERGED ENTERPRISE EDITION [on-prem, hybrid, cloud]Send insights back to edge Aggregate, analyze data at core Reliable replication MAPR EDGE Computing power close to the data Convergence at the edge MAPR EDGE
  • 11. © 2017 MapR TechnologiesMapR Confidential 11 MAPR SOLVES BIG DATA AT THE EDGE DESIGNED FOR EDGE LOCATIONS Sites with slow or occasionally connected network access Data sources that create huge volumes of data E.g., oil rigs, hospitals, vehicles, remote offices, etc. INTEL NUC MINI PCS (8.3” X 4.6” X 1.1”) Space constrained locations requiring small footprints • 3-5 node cluster, storage capacity limits, minimum 16GB RAM • Optimized for mini PCs (e.g., Intel NUCs)
  • 12. © 2017 MapR TechnologiesMapR Confidential 12 “Act Locally, Learn Globally” with MapR Edge “Act Locally, Learn Globally” IoT applications leverage local data from numerous sources for constructing machine learning or deep learning models with global knowledge. These models are then rapidly deployed to the edge to enable real-time decisions based on local events.
  • 13. © 2017 MapR TechnologiesMapR Confidential 13 Oil Company Case Study Source 1 Source 2 Source 1000 Houston MAPR Core Cluster Time to insight (48 hrs) Manual Process Before Edge Source 1 Source 2 Source 1000 Houston MAPR Core Cluster Time to insight (<2 hrs) Automated Process 1000s of Oil & Drill Sources Will do Pre Processing locally +at Core (Custom App + Down Sampling) After Edge
  • 14. © 2017 MapR TechnologiesMapR Confidential 14 Medical Device Company Case Study Source 1 Source 2 Source 100 Houston MAPR Core Cluster Time to insight (12 hrs) Manual, Scripts, File Sync SW Process Source 1 Source 2 Source 1000 Houston MAPR Core Cluster Time to insight (<15 mins) Automated Process100s of Medical machines in each hospital Will do Pre Processing locally +at Core (Custom App + Anonymizations) Machine Learning Diagnosis App Machine Learning Diagnosis App Before Edge After Edge 100s of Hospitals
  • 15. © 2017 MapR TechnologiesMapR Confidential 15 Car Company Test & Dev Case Study Source 1 Source 2 Source 1000 Houston MAPR Core Cluster Time to insight (24 hrs) Manual, Scripts, Req High Bandwidth Network Source 1 Source 2 Source 1000 Houston MAPR Core Cluster Time to insight (<5 mins) Automated Process1000s Test Cars running in various Conditions 24/7 Can do Pre Processing locally + at Core (Custom App + Test Feedback Loop) Car Test Analysis App Before Edge After Edge All Analysis only done at Core 1-5TB/Day
  • 16. © 2017 MapR TechnologiesMapR Confidential 16 Reference IIoT Architecture Send insights back to edge Aggregate, analyze data at core MAPR EDGE MapR-FS MapR Data Platform ETL Processing (Hadoop, Spark) Interactive Query Engine (Drill) MapR-DB Central Data Lake Data Warehouses Offload / Reload Data Marts Data Exploration Visualization Predictive Analytics Analytics NFS Streams Open TSDB
  • 17. © 2017 MapR TechnologiesMapR Confidential 17 Additional Resources O’Reilly report by Ted Dunning & Ellen Friedman © March 2017 http://bit.ly/mapr-geo-distribution-ebook-pdf O’Reilly book by Ted Dunning & Ellen Friedman © March 2016 Read free courtesy of MapR https://mapr.com/streaming-architecture-using- apache-kafka-mapr-streams/ Sandbox: https://www.mapr.com/products/mapr-sandbox-hadoop Scaling Time series Databases: https://community.mapr.com/community/exchange/blog/2017/06/21/scaling-time-series-analysis-on-the-mapr- converged-data-platform
  • 18. © 2017 MapR TechnologiesMapR Confidential 18 Q&A @mapr rmahajan@mapr.com ENGAGE WITH US

Editor's Notes

  1. Explain numbers;
  2. Explain the numbers
  3. Explain Historian: Large database
  4. To deal with the devices that can create terabytes of data in short time windows, a new architecture is required. While the MapR Converged Data Platform has always been good for the consumer IoT model and for traditional big data use cases, the industrial IoT model now has an optimal approach with MapR Edge. MapR Edge is a small footprint edition of the MapR Converged Data Platform that addresses the need to capture, process, and analyze IoT data close to the source and is optimized to run on small-footprint, commodity hardware. MapR Edge puts computing power close to the data to enable processing in real-time. It works in conjunction with a core MapR deployment so you can perform local processing while also taking advantage of a larger, centralized cluster of aggregated data.
  5. What does it solve ? Whats the ideal environment for edge? Its ideal for dealing with the characteristics of edge locations in industrial IoT environments. Data sources that create huge volumes of data such as mining sites, blast sites, manufacturing plants are ideal environments for MapR Edge. Slow or occasionally connected sites can be managed by the bandwidth-aware replication in MapR. Space constrained locations that require a small computing footprint can be handled by MapR Edge, as it runs as a 3 to 5 node cluster on mini PCs such as the Intel NUCs, which are typically the size of a small book and can be bought off the shelf. Other vendors make similar PCs that are hardened for remote and even harsh locations as part of an IoT deployment.
  6. We describe the interaction of MapR Edge clusters with a centralized core cluster as “act locally, learn globally.” This architecture gives you processing power to immediately act on events close to the data source, while also delivering the data, or subsets of it, to the central cluster to gain globally-based insights on aggregated data. These insights, in the form of machine learning models, are then deployed to the edge clusters for more real-time processing. This architecture is just another way that MapR lets you operationalize your data.
  7. One of our oil company customers had a manual and expensive process for collecting data from their oil wells. They drive out to the remote locations, download the data on laptops, and drive to headquarters to upload that data. This is the only practical way for them to collect data since the amount of data that’s create overwhelms their connection bandwidth. With MapR, they can use edge nodes to monitor the wells and ingest data, as well as down-sample the collected data into a size that can be easily delivered over the Internet to the home cluster.
  8. One of the other example for edge is a large medical device company in Europe. They sell MRI Machines and other medical equipment to the hospitals. Traditionally, there was a manual process and manual file syncing of the data coming from the medical devices on a batch manner, and then sent to the central cluster where a machine learning diagnosis app will look at the data and send diagnosis was sent back to the hospital. With MapR edge in picture, they have started doing processing locally at the edge, and they anonymize the scan data for compliance purposes, send it over the Internet to the core cluster to do large-scale processing, and then return the results immediately. They not only get quick turnaround, but they can also store the private data at the medical center versus having it stored at the home cluster.
  9. MapR is also working with companies like Audi and Daimler for their connected car and self driving use cases. They perform long test runs on new cars that collect a huge amount of data from sensors. After a 24-hour test run, they swap out disks from the car with a new set of disks, and then are able to analyze the data taken from the car. This forces delays of up to 24 hours since there is no reasonable connectivity from the car to the home base during test runs. With MapR Edge in the trunk, they can monitor the car during the test run and respond immediately should an error occur. But more importantly, they can use the cluster to identify the most important data, which pertains to events surrounding a driving exception. These exceptions can be “critical interventions” for self-driving cars, in which the human driver had to take control of the car due to some unforeseeable condition.
  10. That leads us to a reference architecture slide. If you take mining as an example : In mining, self-driving vehicles, including mine cars and ore trucks, are helping to streamline operations and reduce costs. Using sensors to monitor the health of machinery in use, companies can shift to a condition-based maintenance model (maintaining equipment when there is an actual need through predictive analytics) rather than relying on a regular maintenance schedule or repairing equipment only when it breaks down. On the far left, We have small edge cluster on a worksite or manufacturing site ingesting time series data coming from drills, valves , and trucks. Once the raw or aggregated data is transferred to the central data platform, it is typically enriched with the work orders, alarms, meta data etc coming from your traditional SAP systems or asset management systems. This central data lake can be used to provide various dashobards to mining and process engineers to not only visualize, explore data coming from various work sites, but also ability to correlate it. It also becomes a central data lake for your analyts and data science team to work on building predictive models, which can then be deployed on the edge.