SlideShare a Scribd company logo
1 of 17
Hortonworks Industrial Data Platform
IIoT & Predictive Analytics in the Energy
Industry
Kenneth Smith – General Manager, Energy
@KennethSmith99
2 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Internet of Things Data Sources
 User Generated Content (Web & Mobile)
– Twitter, Facebook, Snapchat, YouTube
– Clickstream, Ads, User Engagement
– Payments: Paypal, Venmo
 Internet of Anything (IoAT)
– Wind Turbines, Oil Rigs, Cars
– Weather Stations, Smart Grids
– RFID Tags, Beacons, Wearables
What generates data?
3 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Industrial IoT Market Opportunity Estimates
“In other words, the industrial internet will be worth more than twice the consumer internet”
https://www.forbes.com/sites/louiscolumbus/2016/11/27/roundup-of-
internet-of-things-forecasts-and-market-estimates-2016/#ad68a67292d5
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Why Hortonworks for IIoT??
 Technology to deliver the only end-to-end OPEN SOURCE IoT data platform for “industrials”.
 It’s not just about time-series data; it’s the ability to collect, manage, and analyze all
pertinent structured & unstructured data sets related to an industrial asset, operation,
process, piece of equipment, etc. in in addition to time-series.
 Open Connected Data Platforms enables OT/IT/ET convergence to build descriptive,
predictive, & prescriptive applications.
 An open source IIoT platforms allow operators to maintain control over their data and
analytics vs. a ”closed” OEM’s IIoT product telling them when their own equipment needs
replacing.
 An open IIoT platform is applicable across all asset intensive industries with “moving metal”;
oil & gas, utilities, mining, manufacturing, automotive, transportation, agriculture, etc.
 “Data is not a competitive advantage. It’s the algorithms you build to analyze your data
that will differentiate you from your competitors.”
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Is the Energy Industry Ready to Embrace an Open Model?
http://www.lockheedmartin.com/us/news/press-releases/2016/january/160114-mst-us-exxonmobil-awards-
lockheed-martin-next-generation-refining-and-chemical-facility-automation-system-contract.html
ExxonMobil representatives
express frustration when
observing step change
improvements in adjacent
industries enabled by open
technologies. Those adjacent
industries have deployed
significantly higher function
software that have lowered
lifecycle cost and delivered
higher return on investment.
The explosive growth of technologies driven by the Internet of Things (IoT) including
cloud computing, mobile computing, embedded computing, and consumer electronics
makes it obvious that the mainstream industrial automation industry can deliver
more value with the adoption of an open, multi-vendor platform approach.
http://www.automation.com/automati
on-news/article/exxonmobil-to-build-
next-generation-multi-vendor-
automation-architecture
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Industrial Processes Create Large Amounts of Data
Always on, always connected devices generate a
constant stream of data related to the operations of
industrial businesses
These datasets contain:
• What happened?
• Why something happened or not?
• Quantification of events
These datasets go by many names:
• “SCADA Data”
• “Control System Data”
• “Historian Data”
• “Machine Data”
• “Measurement Logs”
How are my …
People?
Processes?
Equipment?
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Why Collect this Data with Hadoop?
• Scale and Flexibility
– Linear & low cost scale out of storage &
processing
– Bring compute to data
– Multi tenant environment that allows multiple
modes of simultaneous ingestion & interaction
• Increased value of data
– Reduce the friction of data access “everything is
accessed in one place”
– Simplified or new analytic applications
• Democratize data
– Single point of access
– Simplified access & security controls
DATASYSTEMSOURCES
SCADA ERP EPM
Governance
&Integration
Security
Operations
Data Access
Data Management
APPLICATIONS
Business
Analytics
Advanced Process
Control
Operations
Planning Suites
AG. Image source "© Siemens AG 2015, All rights reserved"
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
My Industrial Dataset is in Hadoop! Now what?
Uses for Industrial Datasets
• Condition Based Monitoring
• Single View of an Asset
• Dashboards & Mobile Applications
• Statistics & Predictive Analytics
• Event Based Surveillance
• Remote Operations Support
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Field Data Capture Office or Datacenter
Hortonworks Industrial Data Analytics Platform – In Practice
OPC UA/DA, WITSML
Video, Audio
Commodity Market
Weather, Environmental
Social Media
IoT, Machine Data, Historians
Central HDP Cluster
Hive
Central HDF Cluster
NiFi
Kafka
Storm
Streaming
Options
HBase Solr
YARN
HDFS
Location 1
NiFi
Location n
NiFi
Data Center
Data Ingestion Framework
End users
DATA IN MOTION – H DF DATA AT R EST – H DP
HDF Edge (MiNiFi + NiFi)
 Reliable collection
 Small footprint
 Edge processing
 Data provenance
 Integrates with core
policies
HDF Core (NiFi with Streaming)
 Processing at larger scale
 Distributed stream processing
HDP
 Security and data governance
 Monitoring, management, operations
 Applications
 Analytics
Structured / Unstructured Data Sets
10 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Hortonworks Connected Data Platforms in Energy & Utilities
Source: https://www.cm-collaborative-tech.com/wp-content/uploads/2016/11/Smart-grid-A-1.jpg
Predictive MaintenanceFraud DetectionExternal Sources
(Weather, Social
Media, GPS, etc.)
Single View of Customer
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Time Series Analytics for Power Generation Anomaly Detection
 Two week engagement – no direct knowledge of existing systems
 Two days were able to isolate problem down from 5000 potential
causes to 19 using standard data science algorithms
 Company investigated findings and found a valve was installed
backwards causing plant to shutdown
 Plant failure hasn’t occurred since, saving millions of dollars in
unplanned shutdowns
 VP of Engineering – “I never thought we would see a solution like
this”
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Vertically Integrated Utility’s Data Journey
Accelerating Revenue Protection with an Open Analytics Platform
 One of the largest electric power holding companies in the US that supplies electricity to approximately 7.4 million
customers and operates natural gas distribution services serving more than 1.5 million customers.
 Revenue Protection Use Case: Protect revenue from theft, malfunctioning meters, and misconfigured meters.
 Why HDP: The only cost effective platform able to do parallel / multi-node analytics on large data sets.
 Currently have loaded 200 Billion rows of meter data across 80 nodes of HDP growing to 1.4 Trillion by 2020 from all of
their service areas.
 Previous energy theft data science process: Predictive model was run on a laptop 1x per week for 10K accounts at a time
and produced 100 leads weekly for investigation. At that rate, it would have taken them 6 months to process one state’s
data (all states/enterprise data would take much longer)
 Current process: Leveraging HDP the run time to analyze one state’s data has been reduced from 6 months to less than
an hour, producing theft leads from the entire data set in minutes.
 Expected realized business value from the Revenue Protection use-case to be tens of millions of dollars by 2020.
 Other use-case include predictive equipment maintenance on a time-series data ingested from OSIsoft Pi and a “Next
Best Action” program for cross-selling opportunities on goods and services.
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Utility Big Data Journey – Crawl, Walk, Run
Arrears & Credit
Collections*
•Better identify customers
prior to going into
Arrears to start payment
plans
Revenue Protection*
•More quickly identify theft,
malfunctioning meters and
misconfigured meters across
entire customer base with HDP
•Estimated business value –
millions of dollars in previously
unrecognized revenue
360 Degree View of
Customer*
•Aggregate customer data
across enterprise: usage,
billing, profile info,
surveys, call center logs,
order history, social media
sentiment, etc.
•Develop customer
segmentation models &
KPI’s to improve customer
service, reduce call center
volumes/times, feed next
best action programs, etc.
Predictive
Maintenance*
•Ingest time-series data
from control systems
and previous
maintenance records to
identify patterns in
malfunctioning
equipment
•Shift from time-based
maintenance to
condition-based to
prioritize and optimize
maintenance resources
and operations.
Outage Detection
& Prevention*
•Identify outages in
real-time, notify
customers of outages
and reduce time to
resolution
•Better forecasting
models = lower service
costs, reduced truck
rolls, increased
revenue, and higher
customer satisfaction
• Start Small, Think Big
• Improve Top-line and bottom-line revenue
• Develop In-house talent
*AMI data is the foundation for both T&D and customer-focused use-cases.
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Manufacturing Data Lake for Global Operations
Capabilities
• Capture new and breakdown existing
operational data silos
• Democratize data access to a wider audience
• Flexible architecture to incorporate the latest
Apache open source/3rd party/customer
innovations
• Foster community
“Not an ops historian but a enterprise
historian of ALL PROCESS DATA”
Design
• Embedded analytics and visualizations
• Embedded open source graphical data ingest
• Proven at scale – 1M tags / minute
• Comply with existing security, governance and
operations
• Built for extension
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Develop: Using HDP and HDF for Industrial IOT
Hortonworks Customer
• Schlumberger Drilling Technology
Application Area
• Real Time Drilling Data Delivery - WITSML
A Few Requirements
• Provenance – knowing where the data came
from is crucial (and often missing) to real time
decision making especially when dealing with
3,000 wells per month
• Visualization – the ability to visualize the data
flow at a granular level aids in troubleshooting
and operational understanding
• Reduced overhead leveraging NiFi vs. previously
built custom-coded solution
http://www.slideshare.net/HadoopSummit/from-zero-to-data-flow-in-hours-with-apache-nifi-64032731
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Real-time Remote Surveillance
Requirement – A New Business Model:
• Fluid and flexible data platforms that can quickly integrate raw data
and deliver actionable intelligence to people and processes
• Ability to operate when network connectivity with a data center or
the shore is intermittent, latent and provide minimal bandwidth
• Analysis of large volumes of data and avoid data being stranded and
out of reach for analysts and support teams.
• Move from an operations posture of reacting and suffering from
unnecessary downtime, equipment failures, efficiency losses, and
safety risks
• Increases the collective expertise available to support safer and
more efficient operations
Solution and Outcomes – New Sources of Value:
• HDF aggregates, prioritizes, compresses and encrypts control system
data before sending it over a 64 KB/sec satellite link to the data
center in real-time
• Data from top drives, BOPs and other equipment is in HDP and every
data consumer from data scientist to BI users can be serviced from
their tool of choice
• Key data consumption patterns enabled include KPI dashboards,
condition-based monitoring and maintenance, event-based
surveillance, and traditional BI reporting; ensuring safer more
efficient operations
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Open source is a way to enable a group of
collaborative people to further their
individual interests while contributing back to
the community for the common good.
Open source

More Related Content

What's hot

Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...Hortonworks
 
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks
 
Protecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersProtecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersDataWorks Summit
 
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power SystemsDelivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power SystemsHortonworks
 
Powering Big Data Success On-Prem and in the Cloud
Powering Big Data Success On-Prem and in the CloudPowering Big Data Success On-Prem and in the Cloud
Powering Big Data Success On-Prem and in the CloudHortonworks
 
Intro to Spark with Zeppelin
Intro to Spark with ZeppelinIntro to Spark with Zeppelin
Intro to Spark with ZeppelinHortonworks
 
Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014Hortonworks
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash CourseDataWorks Summit
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash CourseDataWorks Summit
 
Edw Optimization Solution
Edw Optimization Solution Edw Optimization Solution
Edw Optimization Solution Hortonworks
 
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFiIntelligently Collecting Data at the Edge - Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFiDataWorks Summit
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsHortonworks
 
Design a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDFDesign a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDFHortonworks
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseHortonworks
 
What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0?What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0?DataWorks Summit
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionDataWorks Summit/Hadoop Summit
 

What's hot (20)

Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
 
Why is my Hadoop* job slow?
Why is my Hadoop* job slow?Why is my Hadoop* job slow?
Why is my Hadoop* job slow?
 
Apache Hadoop Crash Course
Apache Hadoop Crash CourseApache Hadoop Crash Course
Apache Hadoop Crash Course
 
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2
 
Protecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersProtecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against Disasters
 
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power SystemsDelivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
 
Powering Big Data Success On-Prem and in the Cloud
Powering Big Data Success On-Prem and in the CloudPowering Big Data Success On-Prem and in the Cloud
Powering Big Data Success On-Prem and in the Cloud
 
Intro to Spark with Zeppelin
Intro to Spark with ZeppelinIntro to Spark with Zeppelin
Intro to Spark with Zeppelin
 
Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash Course
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash Course
 
Edw Optimization Solution
Edw Optimization Solution Edw Optimization Solution
Edw Optimization Solution
 
Modernise your EDW - Data Lake
Modernise your EDW - Data LakeModernise your EDW - Data Lake
Modernise your EDW - Data Lake
 
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFiIntelligently Collecting Data at the Edge - Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
 
Design a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDFDesign a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDF
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical Enterprise
 
What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0?What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0?
 
Apache Atlas: Governance for your Data
Apache Atlas: Governance for your DataApache Atlas: Governance for your Data
Apache Atlas: Governance for your Data
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
 

Similar to Hortonworks Open Connected Data Platforms for IoT and Predictive Big Data Analytics for the Energy Industry

IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...
IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...
IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...DataWorks Summit
 
Hortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data ScienceHortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data ScienceThiago Santiago
 
Reinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationReinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationDenodo
 
Powering the Future of Data  
Powering the Future of Data	   Powering the Future of Data	   
Powering the Future of Data  Bilot
 
Achieving a 360 degree view of manufacturing
Achieving a 360 degree view of manufacturingAchieving a 360 degree view of manufacturing
Achieving a 360 degree view of manufacturingDataWorks Summit
 
Risk listening: monitoring for profitable growth
Risk listening: monitoring for profitable growthRisk listening: monitoring for profitable growth
Risk listening: monitoring for profitable growthDataWorks Summit
 
HP Communications and Media | Solutions IoT Platform
HP Communications and Media | Solutions IoT Platform HP Communications and Media | Solutions IoT Platform
HP Communications and Media | Solutions IoT Platform Norberto Enomoto
 
Achieving a 360-degree view of manufacturing via open source industrial data ...
Achieving a 360-degree view of manufacturing via open source industrial data ...Achieving a 360-degree view of manufacturing via open source industrial data ...
Achieving a 360-degree view of manufacturing via open source industrial data ...DataWorks Summit
 
Hortonworks - How Hadoop makes the successful Retailer.
Hortonworks - How Hadoop makes the successful Retailer. Hortonworks - How Hadoop makes the successful Retailer.
Hortonworks - How Hadoop makes the successful Retailer. Mats Johansson
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATAHortonworks
 
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...Hortonworks
 
Big Data for Product Managers
Big Data for Product ManagersBig Data for Product Managers
Big Data for Product ManagersPentaho
 
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHow to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHortonworks
 
Hortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with HadoopHortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with HadoopMats Johansson
 
HP Iot platform and solution plans
HP Iot platform and solution plansHP Iot platform and solution plans
HP Iot platform and solution plansJeff Edlund
 
Connecting Home/Building, Life and Car..The Importance of Insurance Risk Moni...
Connecting Home/Building, Life and Car..The Importance of Insurance Risk Moni...Connecting Home/Building, Life and Car..The Importance of Insurance Risk Moni...
Connecting Home/Building, Life and Car..The Importance of Insurance Risk Moni...DataWorks Summit
 
Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...
Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...
Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...COIICV
 
The Platform for the Industrial Internet of Things (IIoT)
The Platform for the Industrial Internet of Things (IIoT)The Platform for the Industrial Internet of Things (IIoT)
The Platform for the Industrial Internet of Things (IIoT)Gerardo Pardo-Castellote
 
Crossing the performance chasm with open power - IBM
Crossing the performance chasm with open power - IBMCrossing the performance chasm with open power - IBM
Crossing the performance chasm with open power - IBMDiego Alberto Tamayo
 

Similar to Hortonworks Open Connected Data Platforms for IoT and Predictive Big Data Analytics for the Energy Industry (20)

IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...
IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...
IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...
 
Hortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data ScienceHortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data Science
 
Reinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationReinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital Transformation
 
Powering the Future of Data  
Powering the Future of Data	   Powering the Future of Data	   
Powering the Future of Data  
 
Achieving a 360 degree view of manufacturing
Achieving a 360 degree view of manufacturingAchieving a 360 degree view of manufacturing
Achieving a 360 degree view of manufacturing
 
Risk listening: monitoring for profitable growth
Risk listening: monitoring for profitable growthRisk listening: monitoring for profitable growth
Risk listening: monitoring for profitable growth
 
HP Communications and Media | Solutions IoT Platform
HP Communications and Media | Solutions IoT Platform HP Communications and Media | Solutions IoT Platform
HP Communications and Media | Solutions IoT Platform
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
 
Achieving a 360-degree view of manufacturing via open source industrial data ...
Achieving a 360-degree view of manufacturing via open source industrial data ...Achieving a 360-degree view of manufacturing via open source industrial data ...
Achieving a 360-degree view of manufacturing via open source industrial data ...
 
Hortonworks - How Hadoop makes the successful Retailer.
Hortonworks - How Hadoop makes the successful Retailer. Hortonworks - How Hadoop makes the successful Retailer.
Hortonworks - How Hadoop makes the successful Retailer.
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
 
Big Data for Product Managers
Big Data for Product ManagersBig Data for Product Managers
Big Data for Product Managers
 
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHow to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
 
Hortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with HadoopHortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with Hadoop
 
HP Iot platform and solution plans
HP Iot platform and solution plansHP Iot platform and solution plans
HP Iot platform and solution plans
 
Connecting Home/Building, Life and Car..The Importance of Insurance Risk Moni...
Connecting Home/Building, Life and Car..The Importance of Insurance Risk Moni...Connecting Home/Building, Life and Car..The Importance of Insurance Risk Moni...
Connecting Home/Building, Life and Car..The Importance of Insurance Risk Moni...
 
Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...
Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...
Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...
 
The Platform for the Industrial Internet of Things (IIoT)
The Platform for the Industrial Internet of Things (IIoT)The Platform for the Industrial Internet of Things (IIoT)
The Platform for the Industrial Internet of Things (IIoT)
 
Crossing the performance chasm with open power - IBM
Crossing the performance chasm with open power - IBMCrossing the performance chasm with open power - IBM
Crossing the performance chasm with open power - IBM
 

More from DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformWSO2
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightSafe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governanceWSO2
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 

Recently uploaded (20)

Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 

Hortonworks Open Connected Data Platforms for IoT and Predictive Big Data Analytics for the Energy Industry

  • 1. Hortonworks Industrial Data Platform IIoT & Predictive Analytics in the Energy Industry Kenneth Smith – General Manager, Energy @KennethSmith99
  • 2. 2 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Internet of Things Data Sources  User Generated Content (Web & Mobile) – Twitter, Facebook, Snapchat, YouTube – Clickstream, Ads, User Engagement – Payments: Paypal, Venmo  Internet of Anything (IoAT) – Wind Turbines, Oil Rigs, Cars – Weather Stations, Smart Grids – RFID Tags, Beacons, Wearables What generates data?
  • 3. 3 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Industrial IoT Market Opportunity Estimates “In other words, the industrial internet will be worth more than twice the consumer internet” https://www.forbes.com/sites/louiscolumbus/2016/11/27/roundup-of- internet-of-things-forecasts-and-market-estimates-2016/#ad68a67292d5
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Why Hortonworks for IIoT??  Technology to deliver the only end-to-end OPEN SOURCE IoT data platform for “industrials”.  It’s not just about time-series data; it’s the ability to collect, manage, and analyze all pertinent structured & unstructured data sets related to an industrial asset, operation, process, piece of equipment, etc. in in addition to time-series.  Open Connected Data Platforms enables OT/IT/ET convergence to build descriptive, predictive, & prescriptive applications.  An open source IIoT platforms allow operators to maintain control over their data and analytics vs. a ”closed” OEM’s IIoT product telling them when their own equipment needs replacing.  An open IIoT platform is applicable across all asset intensive industries with “moving metal”; oil & gas, utilities, mining, manufacturing, automotive, transportation, agriculture, etc.  “Data is not a competitive advantage. It’s the algorithms you build to analyze your data that will differentiate you from your competitors.”
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Is the Energy Industry Ready to Embrace an Open Model? http://www.lockheedmartin.com/us/news/press-releases/2016/january/160114-mst-us-exxonmobil-awards- lockheed-martin-next-generation-refining-and-chemical-facility-automation-system-contract.html ExxonMobil representatives express frustration when observing step change improvements in adjacent industries enabled by open technologies. Those adjacent industries have deployed significantly higher function software that have lowered lifecycle cost and delivered higher return on investment. The explosive growth of technologies driven by the Internet of Things (IoT) including cloud computing, mobile computing, embedded computing, and consumer electronics makes it obvious that the mainstream industrial automation industry can deliver more value with the adoption of an open, multi-vendor platform approach. http://www.automation.com/automati on-news/article/exxonmobil-to-build- next-generation-multi-vendor- automation-architecture
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Industrial Processes Create Large Amounts of Data Always on, always connected devices generate a constant stream of data related to the operations of industrial businesses These datasets contain: • What happened? • Why something happened or not? • Quantification of events These datasets go by many names: • “SCADA Data” • “Control System Data” • “Historian Data” • “Machine Data” • “Measurement Logs” How are my … People? Processes? Equipment?
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Why Collect this Data with Hadoop? • Scale and Flexibility – Linear & low cost scale out of storage & processing – Bring compute to data – Multi tenant environment that allows multiple modes of simultaneous ingestion & interaction • Increased value of data – Reduce the friction of data access “everything is accessed in one place” – Simplified or new analytic applications • Democratize data – Single point of access – Simplified access & security controls DATASYSTEMSOURCES SCADA ERP EPM Governance &Integration Security Operations Data Access Data Management APPLICATIONS Business Analytics Advanced Process Control Operations Planning Suites AG. Image source "© Siemens AG 2015, All rights reserved"
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved My Industrial Dataset is in Hadoop! Now what? Uses for Industrial Datasets • Condition Based Monitoring • Single View of an Asset • Dashboards & Mobile Applications • Statistics & Predictive Analytics • Event Based Surveillance • Remote Operations Support
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Field Data Capture Office or Datacenter Hortonworks Industrial Data Analytics Platform – In Practice OPC UA/DA, WITSML Video, Audio Commodity Market Weather, Environmental Social Media IoT, Machine Data, Historians Central HDP Cluster Hive Central HDF Cluster NiFi Kafka Storm Streaming Options HBase Solr YARN HDFS Location 1 NiFi Location n NiFi Data Center Data Ingestion Framework End users DATA IN MOTION – H DF DATA AT R EST – H DP HDF Edge (MiNiFi + NiFi)  Reliable collection  Small footprint  Edge processing  Data provenance  Integrates with core policies HDF Core (NiFi with Streaming)  Processing at larger scale  Distributed stream processing HDP  Security and data governance  Monitoring, management, operations  Applications  Analytics Structured / Unstructured Data Sets
  • 10. 10 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Hortonworks Connected Data Platforms in Energy & Utilities Source: https://www.cm-collaborative-tech.com/wp-content/uploads/2016/11/Smart-grid-A-1.jpg Predictive MaintenanceFraud DetectionExternal Sources (Weather, Social Media, GPS, etc.) Single View of Customer
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Time Series Analytics for Power Generation Anomaly Detection  Two week engagement – no direct knowledge of existing systems  Two days were able to isolate problem down from 5000 potential causes to 19 using standard data science algorithms  Company investigated findings and found a valve was installed backwards causing plant to shutdown  Plant failure hasn’t occurred since, saving millions of dollars in unplanned shutdowns  VP of Engineering – “I never thought we would see a solution like this”
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Vertically Integrated Utility’s Data Journey Accelerating Revenue Protection with an Open Analytics Platform  One of the largest electric power holding companies in the US that supplies electricity to approximately 7.4 million customers and operates natural gas distribution services serving more than 1.5 million customers.  Revenue Protection Use Case: Protect revenue from theft, malfunctioning meters, and misconfigured meters.  Why HDP: The only cost effective platform able to do parallel / multi-node analytics on large data sets.  Currently have loaded 200 Billion rows of meter data across 80 nodes of HDP growing to 1.4 Trillion by 2020 from all of their service areas.  Previous energy theft data science process: Predictive model was run on a laptop 1x per week for 10K accounts at a time and produced 100 leads weekly for investigation. At that rate, it would have taken them 6 months to process one state’s data (all states/enterprise data would take much longer)  Current process: Leveraging HDP the run time to analyze one state’s data has been reduced from 6 months to less than an hour, producing theft leads from the entire data set in minutes.  Expected realized business value from the Revenue Protection use-case to be tens of millions of dollars by 2020.  Other use-case include predictive equipment maintenance on a time-series data ingested from OSIsoft Pi and a “Next Best Action” program for cross-selling opportunities on goods and services.
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Utility Big Data Journey – Crawl, Walk, Run Arrears & Credit Collections* •Better identify customers prior to going into Arrears to start payment plans Revenue Protection* •More quickly identify theft, malfunctioning meters and misconfigured meters across entire customer base with HDP •Estimated business value – millions of dollars in previously unrecognized revenue 360 Degree View of Customer* •Aggregate customer data across enterprise: usage, billing, profile info, surveys, call center logs, order history, social media sentiment, etc. •Develop customer segmentation models & KPI’s to improve customer service, reduce call center volumes/times, feed next best action programs, etc. Predictive Maintenance* •Ingest time-series data from control systems and previous maintenance records to identify patterns in malfunctioning equipment •Shift from time-based maintenance to condition-based to prioritize and optimize maintenance resources and operations. Outage Detection & Prevention* •Identify outages in real-time, notify customers of outages and reduce time to resolution •Better forecasting models = lower service costs, reduced truck rolls, increased revenue, and higher customer satisfaction • Start Small, Think Big • Improve Top-line and bottom-line revenue • Develop In-house talent *AMI data is the foundation for both T&D and customer-focused use-cases.
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Manufacturing Data Lake for Global Operations Capabilities • Capture new and breakdown existing operational data silos • Democratize data access to a wider audience • Flexible architecture to incorporate the latest Apache open source/3rd party/customer innovations • Foster community “Not an ops historian but a enterprise historian of ALL PROCESS DATA” Design • Embedded analytics and visualizations • Embedded open source graphical data ingest • Proven at scale – 1M tags / minute • Comply with existing security, governance and operations • Built for extension
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Develop: Using HDP and HDF for Industrial IOT Hortonworks Customer • Schlumberger Drilling Technology Application Area • Real Time Drilling Data Delivery - WITSML A Few Requirements • Provenance – knowing where the data came from is crucial (and often missing) to real time decision making especially when dealing with 3,000 wells per month • Visualization – the ability to visualize the data flow at a granular level aids in troubleshooting and operational understanding • Reduced overhead leveraging NiFi vs. previously built custom-coded solution http://www.slideshare.net/HadoopSummit/from-zero-to-data-flow-in-hours-with-apache-nifi-64032731
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Real-time Remote Surveillance Requirement – A New Business Model: • Fluid and flexible data platforms that can quickly integrate raw data and deliver actionable intelligence to people and processes • Ability to operate when network connectivity with a data center or the shore is intermittent, latent and provide minimal bandwidth • Analysis of large volumes of data and avoid data being stranded and out of reach for analysts and support teams. • Move from an operations posture of reacting and suffering from unnecessary downtime, equipment failures, efficiency losses, and safety risks • Increases the collective expertise available to support safer and more efficient operations Solution and Outcomes – New Sources of Value: • HDF aggregates, prioritizes, compresses and encrypts control system data before sending it over a 64 KB/sec satellite link to the data center in real-time • Data from top drives, BOPs and other equipment is in HDP and every data consumer from data scientist to BI users can be serviced from their tool of choice • Key data consumption patterns enabled include KPI dashboards, condition-based monitoring and maintenance, event-based surveillance, and traditional BI reporting; ensuring safer more efficient operations
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Open source is a way to enable a group of collaborative people to further their individual interests while contributing back to the community for the common good. Open source