SlideShare a Scribd company logo
© 2017 MapR TechnologiesMapR Confidential 1
State of the Art Robot Predictive
Maintenance with Real-time
Sensor Data
Mateusz Dymczyk, Software Engineer @ h2o.ai
Mathieu Dumoulin, Data Engineer @ MapR
DataWorks Summit Sydney 2017
© 2017 MapR Technologies 2
Mathieu Dumoulin
• Data Engineer, MapR
Technologies
• From Montreal, Canada
• Living in Tokyo, Japan for
last 3 years
• Previous experience as
data scientist, search
engineer and data
engineer.
© 2017 MapR Technologies
Industry 4.0 is Now
source: PwC 2016 Global Industry 4.0 Survey
• End-to-end factory digitization
• 907B$/y investment until 2020
• Japan and Germany early
leaders
• Smart sensors everywhere
• Refine products based on
advanced data analysis
© 2017 MapR Technologies 4
Predictive Maintenance: Lower Cost, Greater Efficiency
• Less unplanned downtime
• Just-in-time order of
rare/expensive parts
• Less inspection downtime
• Increased operational
efficiency
© 2017 MapR Technologies 5
Requirements for a Predictive Maintenance System
Create Business Value!
• Limited time, budget and people
• Don’t impact normal operation
• Ok to miss failure sometimes
• “Perfect” is the enemy of the
“good enough”
• Improve the model over time
© 2017 MapR Technologies 6
Requirement: Know Robot State in 3 sec or less
© 2017 MapR Technologies 7
Requirement: Scale to 100’s of Robots
Tesla Factory photo by Paul Sakuma/AP
© 2017 MapR Technologies 8
Requirement: Low R&D Cost - Use Standard OSS
© 2017 MapR Technologies 9
We Made It!
• 4 Engineers, 2 Months (part-time)
• Real-time Anomaly Detection
• Scalable by default
• Standard Big Data software
• Minimized custom code
• Streaming Architecture
• Prediction Microservice
© 2017 MapR Technologies 10
Video of solution in action 2m
© 2017 MapR Technologies 11
How We Made it
• A clear goal: Real-time anomaly detection using data from
sensor on a robot
• Small team of 4 Engineers
• Existing hardware
• A Robot
– Wireless sensor LPMS-B2
– Augmented Reality (AR) headset
• Existing expertise
– Machine Learning
– Enterprise big data
– Sensor and AR hardware
• No additional dependencies
© 2017 MapR Technologies 12
Data Source: LPMS-B2 Measurement Unit
• Bluetooth wireless
• Lots of sensors:
– 3-axis gyroscope
– 3-axis accelerometer
– 3-axis magnetometer
– temperature, pressure and humidity
• Very noisy data
© 2017 MapR Technologies 13
Demo Pipeline
© 2017 MapR Technologies 14
Demo Pipeline – Normal State
© 2017 MapR Technologies 15
Demo Pipeline – Anomaly State
© 2017 MapR Technologies 16
Machine Learning
© 2017 MapR TechnologiesMapR Confidential 17
Machine Learning Project Flow
Explore and
Analyze
Choose
Algorithm
Build
Model
Evaluate
Model
Put into
production
Problem
evaluation &
definition
Data
preparation
© 2017 MapR Technologies 18
Problem definition
1. Problem:
1. Automatically deduce data patterns
describing the normal state of a machine
2. Create a model classifying machine’s current
state (normal/pre-failure)
2. Machine Learning goal/metrics:
1. Detect abnormal events > 90% accuracy
2. Avoid false positives
3. Decide output
Normal State (OK!)
PREDICT FAILURE
© 2017 MapR Technologies 19
Data preparation
1. Choose (after consultation with hardware engineers)
only linear acceleration data (X, Y, Z)
2. “Window” (concatenate into single record) data from
200ms worth of records
200ms window
Ref: 21 Great Articles and Tutorials on Time Series
© 2017 MapR Technologies 20
Algorithm selection
• Unlabeled data -> unsupervised learning
• Training data consists only of data
during “normal state” runs
– Only train on normal op. data
• Conclusion: anomaly detection
• Possible algorithms:
• Simple auto encoders
• LSTM auto encoders
• KNN, Local Outlier Factor Anomaly Detection
Get Ted Dunning’s Anomaly Dectection Book
Anomaly!
© 2017 MapR Technologies 21
ML – Anomaly Detection
• Unsupervised
• Anomaly detection
• H2O uses autoencoder
algorithm (deep learning)
• H2O’s R API for modeling
• Very productive API
• Good graphs
• Parameter tuning of
models
• See H2O’s training-book on GitHub
© 2017 MapR TechnologiesMapR Confidential
Anomaly detection with Autoencoders
• 1
• 2
• 3
• 4
• 5
© 2017 MapR Technologies 23
Tooling
© 2017 MapR Technologies 24
Training the Model
© 2017 MapR Technologies 25
ML – Results
Note: Time window: 200ms, Threshold: 1SD
© 2017 MapR Technologies 26
H2O - production
1. H2O REST endpoint
1. Out-of-the-box after modeling
2. Great for testing
3. Slow for production
2. Deployable WAR
1. Easy to deploy if an application server is running
2. Requires H2O.ai’s Steam
3. POJO/MOJO
1. Pure Java classes/serialized objects
2. Fast
3. Very easy to use
© 2017 MapR Technologies 27
Deploy to Production:
How to Make Real-time Predictions
© 2017 MapR Technologies 28
Real-time Predictions
© 2017 MapR TechnologiesMapR Confidential 29
Real-time Predictions - Multiple Outputs
© 2017 MapR TechnologiesMapR Confidential 30
Real-time Predictions – Scale Up
© 2017 MapR Technologies 31
Conclusion: You Can Do it Too!
© 2017 MapR Technologies 32
• OSS enterprise big data software is
much better than custom, closed
source systems for next gen AI
applications
• Don’t get stuck on Machine Learning
complexity
• Converged Platforms reduce
complexity (MapR)
Predictive Maintenance is Real and it’s Now
Poster by J. Howard Miller (1943)
© 2017 MapR Technologies 33
Q&A
ENGAGE WITH US
mateusz@h2o.ai
mathieu.dumoulin@mapr.com
PROJECT GITHUB:
github.com/mdymczyk/iot-pipeline
Our thanks to:
LP RESEARCH
www.lp-research.com
contact: Klaus Peterson
klaus@lp-research.com
© 2017 MapR Technologies 34
Thank you to LP-RESEARCH!
Hardware design and production
Expertise in Motion sensors
Gyroscope
Accelerometer
Magnetometer
Sensor fusion algorithm
development
Multi-platform application
development
See all our products: https://www.lp-research.com/products/
LPMS-B2 LPMS-CU2 LPMS-CANAL2 LPMS-USBAL2OEM also
available!

More Related Content

What's hot

Innovations for Asset Integrity Management
Innovations for Asset Integrity ManagementInnovations for Asset Integrity Management
Innovations for Asset Integrity ManagementSGS
 
Is Reliability Centered Maintenance (RCM) right for you?
Is Reliability Centered Maintenance (RCM) right for you?Is Reliability Centered Maintenance (RCM) right for you?
Is Reliability Centered Maintenance (RCM) right for you?Nancy Regan
 
Predictive Maintenance in the Industrial Internet of Things
Predictive Maintenance in the Industrial Internet of ThingsPredictive Maintenance in the Industrial Internet of Things
Predictive Maintenance in the Industrial Internet of ThingsTibbo
 
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and SimulinkApplying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and SimulinkGerardo Pardo-Castellote
 
Generic Vehicle Architecture – DDS at the Core.
Generic Vehicle Architecture – DDS at the Core.Generic Vehicle Architecture – DDS at the Core.
Generic Vehicle Architecture – DDS at the Core.Real-Time Innovations (RTI)
 
Predictive Maintenance
Predictive MaintenancePredictive Maintenance
Predictive MaintenanceSaama
 
Machine Learning & Predictive Maintenance
Machine Learning &  Predictive MaintenanceMachine Learning &  Predictive Maintenance
Machine Learning & Predictive MaintenanceArnab Biswas
 
How to Digitize Industrial Manufacturing with Azure IoT Edge, InfluxDB, and M...
How to Digitize Industrial Manufacturing with Azure IoT Edge, InfluxDB, and M...How to Digitize Industrial Manufacturing with Azure IoT Edge, InfluxDB, and M...
How to Digitize Industrial Manufacturing with Azure IoT Edge, InfluxDB, and M...InfluxData
 
Predictive Maintenance
Predictive MaintenancePredictive Maintenance
Predictive Maintenancefljungbe
 
Unit 9 implementing the reliability strategy
Unit 9  implementing the reliability strategyUnit 9  implementing the reliability strategy
Unit 9 implementing the reliability strategyCharlton Inao
 
Maintenance module1 ppt number 3
Maintenance module1 ppt number 3Maintenance module1 ppt number 3
Maintenance module1 ppt number 3Dhanesh S
 
Elastimold - HV High Voltage Joints, Terminations, Loadbreak & Deadbreak Conn...
Elastimold - HV High Voltage Joints, Terminations, Loadbreak & Deadbreak Conn...Elastimold - HV High Voltage Joints, Terminations, Loadbreak & Deadbreak Conn...
Elastimold - HV High Voltage Joints, Terminations, Loadbreak & Deadbreak Conn...Thorne & Derrick International
 
Predictive Maintenance Systems, Technologies & Equipment Management Softwares...
Predictive Maintenance Systems, Technologies & Equipment Management Softwares...Predictive Maintenance Systems, Technologies & Equipment Management Softwares...
Predictive Maintenance Systems, Technologies & Equipment Management Softwares...proudmehta
 
Energy IIoT - Industrial Internet of Things (IIoT) in Decentralized Digital O...
Energy IIoT - Industrial Internet of Things (IIoT) in Decentralized Digital O...Energy IIoT - Industrial Internet of Things (IIoT) in Decentralized Digital O...
Energy IIoT - Industrial Internet of Things (IIoT) in Decentralized Digital O...crlima10
 

What's hot (20)

Innovations for Asset Integrity Management
Innovations for Asset Integrity ManagementInnovations for Asset Integrity Management
Innovations for Asset Integrity Management
 
Is Reliability Centered Maintenance (RCM) right for you?
Is Reliability Centered Maintenance (RCM) right for you?Is Reliability Centered Maintenance (RCM) right for you?
Is Reliability Centered Maintenance (RCM) right for you?
 
Predictive Maintenance in the Industrial Internet of Things
Predictive Maintenance in the Industrial Internet of ThingsPredictive Maintenance in the Industrial Internet of Things
Predictive Maintenance in the Industrial Internet of Things
 
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and SimulinkApplying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
 
Generic Vehicle Architecture – DDS at the Core.
Generic Vehicle Architecture – DDS at the Core.Generic Vehicle Architecture – DDS at the Core.
Generic Vehicle Architecture – DDS at the Core.
 
Predictive Maintenance
Predictive MaintenancePredictive Maintenance
Predictive Maintenance
 
Machine Learning & Predictive Maintenance
Machine Learning &  Predictive MaintenanceMachine Learning &  Predictive Maintenance
Machine Learning & Predictive Maintenance
 
How to Digitize Industrial Manufacturing with Azure IoT Edge, InfluxDB, and M...
How to Digitize Industrial Manufacturing with Azure IoT Edge, InfluxDB, and M...How to Digitize Industrial Manufacturing with Azure IoT Edge, InfluxDB, and M...
How to Digitize Industrial Manufacturing with Azure IoT Edge, InfluxDB, and M...
 
Cost & Reliability
Cost & ReliabilityCost & Reliability
Cost & Reliability
 
Cmms
CmmsCmms
Cmms
 
DCIM
DCIMDCIM
DCIM
 
Predictive Maintenance
Predictive MaintenancePredictive Maintenance
Predictive Maintenance
 
Hima cyber security
Hima cyber securityHima cyber security
Hima cyber security
 
Unit 9 implementing the reliability strategy
Unit 9  implementing the reliability strategyUnit 9  implementing the reliability strategy
Unit 9 implementing the reliability strategy
 
Maintenance module1 ppt number 3
Maintenance module1 ppt number 3Maintenance module1 ppt number 3
Maintenance module1 ppt number 3
 
Elastimold - HV High Voltage Joints, Terminations, Loadbreak & Deadbreak Conn...
Elastimold - HV High Voltage Joints, Terminations, Loadbreak & Deadbreak Conn...Elastimold - HV High Voltage Joints, Terminations, Loadbreak & Deadbreak Conn...
Elastimold - HV High Voltage Joints, Terminations, Loadbreak & Deadbreak Conn...
 
Predictive Maintenance Systems, Technologies & Equipment Management Softwares...
Predictive Maintenance Systems, Technologies & Equipment Management Softwares...Predictive Maintenance Systems, Technologies & Equipment Management Softwares...
Predictive Maintenance Systems, Technologies & Equipment Management Softwares...
 
Yokogawa DPharp EJX and EJA
Yokogawa DPharp EJX and EJAYokogawa DPharp EJX and EJA
Yokogawa DPharp EJX and EJA
 
Preventive maintenance
Preventive maintenance Preventive maintenance
Preventive maintenance
 
Energy IIoT - Industrial Internet of Things (IIoT) in Decentralized Digital O...
Energy IIoT - Industrial Internet of Things (IIoT) in Decentralized Digital O...Energy IIoT - Industrial Internet of Things (IIoT) in Decentralized Digital O...
Energy IIoT - Industrial Internet of Things (IIoT) in Decentralized Digital O...
 

Similar to Real-Time Robot Predictive Maintenance in Action

State of the Art Robot Predictive Maintenance with Real-time Sensor Data
State of the Art Robot Predictive Maintenance with Real-time Sensor DataState of the Art Robot Predictive Maintenance with Real-time Sensor Data
State of the Art Robot Predictive Maintenance with Real-time Sensor DataMathieu Dumoulin
 
Predictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksPredictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksJustin Brandenburg
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Technologies
 
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningStreaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningTed Dunning
 
MapR Edge : Act Locally Learn Globally
MapR Edge : Act Locally Learn GloballyMapR Edge : Act Locally Learn Globally
MapR Edge : Act Locally Learn Globallyridhav
 
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...Mathieu Dumoulin
 
MapR and Machine Learning Primer
MapR and Machine Learning PrimerMapR and Machine Learning Primer
MapR and Machine Learning PrimerMathieu Dumoulin
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsMapR Technologies
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMapR Technologies
 
Map r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetupMap r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetupAlan Iovine
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016Mathieu Dumoulin
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Carol McDonald
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsMapR Technologies
 
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...Carol McDonald
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleIan Downard
 
Progress for big data in Kubernetes
Progress for big data in KubernetesProgress for big data in Kubernetes
Progress for big data in KubernetesTed Dunning
 
Machine Learning Logistics
Machine Learning LogisticsMachine Learning Logistics
Machine Learning LogisticsTed Dunning
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...DataWorks Summit
 
Machine Learning logistics
Machine Learning logisticsMachine Learning logistics
Machine Learning logisticsTed Dunning
 
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...The Hive
 

Similar to Real-Time Robot Predictive Maintenance in Action (20)

State of the Art Robot Predictive Maintenance with Real-time Sensor Data
State of the Art Robot Predictive Maintenance with Real-time Sensor DataState of the Art Robot Predictive Maintenance with Real-time Sensor Data
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
 
Predictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksPredictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural Networks
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
 
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningStreaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine Learning
 
MapR Edge : Act Locally Learn Globally
MapR Edge : Act Locally Learn GloballyMapR Edge : Act Locally Learn Globally
MapR Edge : Act Locally Learn Globally
 
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
 
MapR and Machine Learning Primer
MapR and Machine Learning PrimerMapR and Machine Learning Primer
MapR and Machine Learning Primer
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
 
Map r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetupMap r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetup
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
 
Progress for big data in Kubernetes
Progress for big data in KubernetesProgress for big data in Kubernetes
Progress for big data in Kubernetes
 
Machine Learning Logistics
Machine Learning LogisticsMachine Learning Logistics
Machine Learning Logistics
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
 
Machine Learning logistics
Machine Learning logisticsMachine Learning logistics
Machine Learning logistics
 
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
 

More from DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...Elena Simperl
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...Sri Ambati
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaRTTS
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxAbida Shariff
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Product School
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsExpeed Software
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Product School
 
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»QADay
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesThousandEyes
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backElena Simperl
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
 

Recently uploaded (20)

Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 

Real-Time Robot Predictive Maintenance in Action

  • 1. © 2017 MapR TechnologiesMapR Confidential 1 State of the Art Robot Predictive Maintenance with Real-time Sensor Data Mateusz Dymczyk, Software Engineer @ h2o.ai Mathieu Dumoulin, Data Engineer @ MapR DataWorks Summit Sydney 2017
  • 2. © 2017 MapR Technologies 2 Mathieu Dumoulin • Data Engineer, MapR Technologies • From Montreal, Canada • Living in Tokyo, Japan for last 3 years • Previous experience as data scientist, search engineer and data engineer.
  • 3. © 2017 MapR Technologies Industry 4.0 is Now source: PwC 2016 Global Industry 4.0 Survey • End-to-end factory digitization • 907B$/y investment until 2020 • Japan and Germany early leaders • Smart sensors everywhere • Refine products based on advanced data analysis
  • 4. © 2017 MapR Technologies 4 Predictive Maintenance: Lower Cost, Greater Efficiency • Less unplanned downtime • Just-in-time order of rare/expensive parts • Less inspection downtime • Increased operational efficiency
  • 5. © 2017 MapR Technologies 5 Requirements for a Predictive Maintenance System Create Business Value! • Limited time, budget and people • Don’t impact normal operation • Ok to miss failure sometimes • “Perfect” is the enemy of the “good enough” • Improve the model over time
  • 6. © 2017 MapR Technologies 6 Requirement: Know Robot State in 3 sec or less
  • 7. © 2017 MapR Technologies 7 Requirement: Scale to 100’s of Robots Tesla Factory photo by Paul Sakuma/AP
  • 8. © 2017 MapR Technologies 8 Requirement: Low R&D Cost - Use Standard OSS
  • 9. © 2017 MapR Technologies 9 We Made It! • 4 Engineers, 2 Months (part-time) • Real-time Anomaly Detection • Scalable by default • Standard Big Data software • Minimized custom code • Streaming Architecture • Prediction Microservice
  • 10. © 2017 MapR Technologies 10 Video of solution in action 2m
  • 11. © 2017 MapR Technologies 11 How We Made it • A clear goal: Real-time anomaly detection using data from sensor on a robot • Small team of 4 Engineers • Existing hardware • A Robot – Wireless sensor LPMS-B2 – Augmented Reality (AR) headset • Existing expertise – Machine Learning – Enterprise big data – Sensor and AR hardware • No additional dependencies
  • 12. © 2017 MapR Technologies 12 Data Source: LPMS-B2 Measurement Unit • Bluetooth wireless • Lots of sensors: – 3-axis gyroscope – 3-axis accelerometer – 3-axis magnetometer – temperature, pressure and humidity • Very noisy data
  • 13. © 2017 MapR Technologies 13 Demo Pipeline
  • 14. © 2017 MapR Technologies 14 Demo Pipeline – Normal State
  • 15. © 2017 MapR Technologies 15 Demo Pipeline – Anomaly State
  • 16. © 2017 MapR Technologies 16 Machine Learning
  • 17. © 2017 MapR TechnologiesMapR Confidential 17 Machine Learning Project Flow Explore and Analyze Choose Algorithm Build Model Evaluate Model Put into production Problem evaluation & definition Data preparation
  • 18. © 2017 MapR Technologies 18 Problem definition 1. Problem: 1. Automatically deduce data patterns describing the normal state of a machine 2. Create a model classifying machine’s current state (normal/pre-failure) 2. Machine Learning goal/metrics: 1. Detect abnormal events > 90% accuracy 2. Avoid false positives 3. Decide output Normal State (OK!) PREDICT FAILURE
  • 19. © 2017 MapR Technologies 19 Data preparation 1. Choose (after consultation with hardware engineers) only linear acceleration data (X, Y, Z) 2. “Window” (concatenate into single record) data from 200ms worth of records 200ms window Ref: 21 Great Articles and Tutorials on Time Series
  • 20. © 2017 MapR Technologies 20 Algorithm selection • Unlabeled data -> unsupervised learning • Training data consists only of data during “normal state” runs – Only train on normal op. data • Conclusion: anomaly detection • Possible algorithms: • Simple auto encoders • LSTM auto encoders • KNN, Local Outlier Factor Anomaly Detection Get Ted Dunning’s Anomaly Dectection Book Anomaly!
  • 21. © 2017 MapR Technologies 21 ML – Anomaly Detection • Unsupervised • Anomaly detection • H2O uses autoencoder algorithm (deep learning) • H2O’s R API for modeling • Very productive API • Good graphs • Parameter tuning of models • See H2O’s training-book on GitHub
  • 22. © 2017 MapR TechnologiesMapR Confidential Anomaly detection with Autoencoders • 1 • 2 • 3 • 4 • 5
  • 23. © 2017 MapR Technologies 23 Tooling
  • 24. © 2017 MapR Technologies 24 Training the Model
  • 25. © 2017 MapR Technologies 25 ML – Results Note: Time window: 200ms, Threshold: 1SD
  • 26. © 2017 MapR Technologies 26 H2O - production 1. H2O REST endpoint 1. Out-of-the-box after modeling 2. Great for testing 3. Slow for production 2. Deployable WAR 1. Easy to deploy if an application server is running 2. Requires H2O.ai’s Steam 3. POJO/MOJO 1. Pure Java classes/serialized objects 2. Fast 3. Very easy to use
  • 27. © 2017 MapR Technologies 27 Deploy to Production: How to Make Real-time Predictions
  • 28. © 2017 MapR Technologies 28 Real-time Predictions
  • 29. © 2017 MapR TechnologiesMapR Confidential 29 Real-time Predictions - Multiple Outputs
  • 30. © 2017 MapR TechnologiesMapR Confidential 30 Real-time Predictions – Scale Up
  • 31. © 2017 MapR Technologies 31 Conclusion: You Can Do it Too!
  • 32. © 2017 MapR Technologies 32 • OSS enterprise big data software is much better than custom, closed source systems for next gen AI applications • Don’t get stuck on Machine Learning complexity • Converged Platforms reduce complexity (MapR) Predictive Maintenance is Real and it’s Now Poster by J. Howard Miller (1943)
  • 33. © 2017 MapR Technologies 33 Q&A ENGAGE WITH US mateusz@h2o.ai mathieu.dumoulin@mapr.com PROJECT GITHUB: github.com/mdymczyk/iot-pipeline Our thanks to: LP RESEARCH www.lp-research.com contact: Klaus Peterson klaus@lp-research.com
  • 34. © 2017 MapR Technologies 34 Thank you to LP-RESEARCH! Hardware design and production Expertise in Motion sensors Gyroscope Accelerometer Magnetometer Sensor fusion algorithm development Multi-platform application development See all our products: https://www.lp-research.com/products/ LPMS-B2 LPMS-CU2 LPMS-CANAL2 LPMS-USBAL2OEM also available!

Editor's Notes

  1. Industry 4.0 is all about digitization of the factory. Sensors everywhere. All this data makes possible new opportunities for automation, cost savings, higher productivity and higher quality. Our talk will focus on Data & Analytics for improving the efficiency of operations of factories with lots of industrial robots. We combine Smart sensors, DB Analytics (ML), Cloud computing and AR to power a real-world, state of the art predictive analytics system.
  2. Predictive Maintenance generates value from the following
  3. Requirements for such a system start with a clear view of business value before any work is done. Need to have an image of the impact of a successful system on the business.
  4. Based on known real-world requirement of state of the art Japanese car-parts manufacturers.
  5. Scale with number of sensors, robots and factories. GB a day quickly become many GB per hour or even minutes. This is comfortably on moderate sized clusters (5-25 nodes) using current big data platforms used by attendees of Strata.
  6. Standard big data OSS has come a long way over the past 5 years and is now at a point where a state of the art project like this can be constructed by judicious assembly of projects: Distributed storage Distributed streaming Distributed stream processing Distributed machine learning
  7. ありがとうございました もう少し、デモについて詳しく説明します。 ロボットに動作検知をするためのセンサーがついています。 センサーはマシンの振動(しんどう)やノイズを検知しています。 このデータは Raspberry Pi に無線で送信されます。 Raspberry Pi がデータを収集して、MapRに送信します。 アナリティカルパートでモデルを作ります。 オペレーショナルなパートで作成したモデルで、robotの状態を可視化システムでoperatorがreal-timeで監視できます。 これは先ほどお話しした通りですね。
  8. 異常がない場合、ご覧いただいた通り緑のマークが表示されます。
  9. 異常がある場合は、赤いマークが表示されます。 その後、フレンドリーなジャーマニーエンジニアを呼ぶ必要があるとわかります。
  10. What do we even want?! I.E.: Data gathering Feature selection, extraction, engineering and data transformation 3) Pick all potential algorithms 4) Build a model using your library/tool of choice 5) Evaluate according to previously defined metrics 6) If not good enough then either try a different approach, features or method parameters 7) Otherwise extract the model and put it into production!
  11. Mention why we are doing it with machine learning at all! No rules, automatically learn the best parameters for each application without new coding and not based on supervised techniques. Especially good when we don’t know what we are looking for: machines can break in a variety of ways.
  12. Mention why we are doing it with machine learning at all! No rules, automatically learn the best parameters for each application without new coding and not based on supervised techniques. Especially good when we don’t know what we are looking for: machines can break in a variety of ways.
  13. Mention why we are doing it with machine learning at all! No rules, automatically learn the best parameters for each application without new coding and not based on supervised techniques. Especially good when we don’t know what we are looking for: machines can break in a variety of ways. Peeking: ML modeling mistake where some data is used to train a model includes information about the answer
  14. 教師なし学習 => unsupervised learning 異常認識 => anomaly detection The real data is very noisy Why use ML at all? We don’t want to use rules for every type of robot and every situation Don’t mention threshold, just say we did some parameter tuning of the ML algorithms or something /
  15. Keep?
  16. Mention why we are doing it with machine learning at all! No rules, automatically learn the best parameters for each application without new coding and not based on supervised techniques. Especially good when we don’t know what we are looking for: machines can break in a variety of ways.
  17. 閾値 (shiki-ichi) => threshold 標準偏差 (hyoujun-hensa) => SD
  18. Mention why we are doing it with machine learning at all! No rules, automatically learn the best parameters for each application without new coding and not based on supervised techniques. Especially good when we don’t know what we are looking for: machines can break in a variety of ways.
  19. Amazing evolution in last few years Lots of learning material, Experts are out there A minimal working model is easy to make (H2O) Get more value: Improve it over time Contract with experts as needed Only one cluster, little configuration, ”it just works”