Turning an idea into a Data-Driven Production System: An Energy Load Forecasting Case Study by Lucas García

BIG DATA SPAIN 2016 1
BIG DATA SPAIN 2016
© 2015 The MathWorks, Inc.
Turning an idea into a Data-Driven
Production System
An Energy Load Forecasting Case Study
Lucas García
Senior Application Engineer
MathWorks

What is Energy Forecasting?
From Wikipedia:
Energy forecasting is a broad term that refers to
"forecasting in the energy industry".
It includes - but is not limited to - forecasting demand
(load) and price of electricity, fossil fuels (natural
gas, oil, coal) and renewable energy sources (RES;
hydro, wind, solar).

What is Data Analytics?
• What happened?Descriptive
• Why did it happen?Diagnostics
• What will happen?Predictive
• What should be done?Prescriptive
Turn large volumes of complex data into actionable information
Data Decisions

Data Analytics – Using Data to Make Better Decisions
Develop Predictive
Models
Access and Explore
Data
Preprocess Data
Integrate Analytics with
Systems

Goal:
 Implement a tool for easy and accurate computation of day-ahead system load forecast
Requirements:
 Acquire and clean data from multiple
sources
 Accurate predictive model
 Easily deploy to production environment
Case Study: Day-Ahead Energy Load Forecasting

The Data
mis.nyiso.com/public/
NYISO Energy Load Data
cdo.ncdc.noaa.gov/qclcd_ascii/
National Climatic Data Center Weather Data

Data Analytics Workflow
Systems
Desktop Apps
Enterprise Scale
Systems
Embedded Devices
and Hardware
Files
Databases
Sensors
Access and Explore
Data
Develop Predictive
Models
Model Creation e.g.
Machine Learning
Model
Validation
Parameter
Optimization
Preprocess Data
Working with
Messy Data
Data Reduction/
Transformation
Feature
Extraction

Systems
Desktop Apps
Enterprise Scale
Systems
Embedded Devices
and Hardware
Files
Databases
Sensors
Access and Explore
Data
Develop Predictive
Models
Model Creation e.g.
Machine Learning
Model
Validation
Parameter
Optimization
Preprocess Data
Working with
Messy Data
Data Reduction/
Transformation
Feature
Extraction
1

Files
Databases
Sensors
Access and Explore
Data
Preprocess Data
Working with
Messy Data
Data Reduction/
Transformation
Feature
Extraction
 Repositories – SQL, NoSQL, etc.
 File I/O – Text, Spreadsheet, etc.
 Web Sources – RESTful, JSON, etc.
Business and Transactional Data
Engineering, Scientific and Field Data
 Real-Time Sources – Sensors, GPS, etc.
 File I/O – Image, Audio, etc.
 Communication Protocols – OPC (OLE for
Process Control), CAN (Controller Area
Network), etc.

Files
Databases
Sensors
Access and Explore
Data
Preprocess Data
Working with
Messy Data
Data Reduction/
Transformation
Feature
Extraction
 Data aggregation
– Different sources (files, web, etc.)
– Different types (images, text, audio, etc.)
 Data clean up
– Poorly formatted files
– Irregularly sampled data
– Redundant data, outliers, missing data etc.
 Data specific processing
– Signals: Smoothing, resampling, denoising,
Wavelet transforms, etc.
– Images: Image registration, morphological
filtering, deblurring, etc.
 Dealing with out of memory data (big data)
Challenges

Files
Databases
Sensors
Access and Explore
Data
Preprocess Data
Working with
Messy Data
Data Reduction/
Transformation
Feature
Extraction
 Point and click tools to access
variety of data sources
 High-performance environment
for big data
Files
Signals
Databases
Images
 Built-in algorithms for data
preprocessing including sensor,
image, audio, video and other
real-time data
MATLAB Analytics work
with business and
engineering data
1

Systems
Desktop Apps
Enterprise Scale
Systems
Embedded Devices
and Hardware
Files
Databases
Sensors
Access and Explore
Data
Develop Predictive
Models
Model Creation e.g.
Machine Learning
Model
Validation
Parameter
Optimization
Preprocess Data
Working with
Messy Data
Data Reduction/
Transformation
Feature
Extraction
1 2

Develop Predictive
Models
Model Creation e.g.
Machine Learning
Model
Validation
Parameter
Optimization
Challenges
 Lack of data science expertise
 Feature Extraction – How to transform
data to best represent the system?
– Requires subject matter expertise
– No right way of designing features
 Feature Selection – What attributes or
subset of data to use?
– Entails a lot of iteration – Trial and error
– Difficult to evaluate features
 Model Development
– Many different models
– Model Validation and Tuning
 Time required to conduct the analysis
Preprocess Data
Working with
Messy Data
Data Reduction/
Transformation
Feature
Extraction

Develop Predictive
Models
Model Creation e.g.
Machine Learning
Model
Validation
Parameter
Optimization
Preprocess Data
Working with
Messy Data
Data Reduction/
Transformation
Feature
Extraction
MATLAB enables
domain experts to
do Data Science
2
Apps Language
 Easy to use apps
 Wide breadth of tools to facilitate
domain specific analysis
 Examples/videos to get started
 Automatic MATLAB code
generation
 High speed processing of large
data sets

Systems
Desktop Apps
Enterprise Scale
Systems
Embedded Devices
and Hardware
Files
Databases
Sensors
Access and Explore
Data
Develop Predictive
Models
Model Creation e.g.
Machine Learning
Model
Validation
Parameter
Optimization
Preprocess Data
Working with
Messy Data
Data Reduction/
Transformation
Feature
Extraction
1 2 3

Systems
Desktop Apps
Enterprise Scale
Systems
Embedded Devices
and Hardware
Develop Predictive
Models
Model Creation e.g.
Machine Learning
Model
Validation
Parameter
Optimization
 End user: Operators, Analysts,
Administrative Staff, customers etc.
 Different target platforms:
– Cluster or Cloud environment
– Standalone desktop applications
– Server based Web and enterprise systems
– Embedded hardware
 Different Interfaces: C++, Java, Python,
.NET etc.
 Need to translate analytics to production
environment
Challenges

Integrate analytics with systems
MATLAB
Runtime
C, C++ HDL PLC
Embedded Hardware
C/C++ ++
Excel
Add-in Java
Hadoop/
Spark
.NET
MATLAB
Production
Server
Standalone
Application
Enterprise Systems
Python
MATLAB Analytics
run anywhere
3

MATLAB
Desktop
Deployed Analytics
MATLAB Production Server
MATLAB
Production
Server
Web
Application
Server
MATLAB
Production Server
RequestBroker
CTF
Apache Tomcat
Web Server/
Webservice
Weather
Data
Energy
Data
Predictive
Models
Train in
MATLAB

Key Takeaways
 Utilize all of your data
 Apply advanced analytics techniques
 Operationalize analytics to enterprise
systems and embedded devices
MATLAB Analytics work
with business and
engineering data
1
MATLAB enables
domain experts to do
Data Science
2
3MATLAB Analytics
run anywhere

Thank you!
Stay tuned: Twitter: @MATLAB | LinkedIn: https://www.linkedin.com/company/the-mathworks_2
% Send me your feedback:
% lucas.garcia@mathworks.com
% Twitter: @mathinking

Turning an idea into a Data-Driven Production System: An Energy Load Forecasting Case Study by Lucas García

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (8)

Similar to Turning an idea into a Data-Driven Production System: An Energy Load Forecasting Case Study by Lucas García

Similar to Turning an idea into a Data-Driven Production System: An Energy Load Forecasting Case Study by Lucas García (20)

More from Big Data Spain

More from Big Data Spain (20)

Recently uploaded

Recently uploaded (20)

Turning an idea into a Data-Driven Production System: An Energy Load Forecasting Case Study by Lucas García