SlideShare a Scribd company logo
New Clustering-based Forecasting Method for
Disaggregated End-consumer Electricity Load
Using Smart Grid Data
Peter Laurinec, and Mária Lucká
14.11.2017
Slovak University of Technology in Bratislava
Motivation
More accurate forecast of electricity consumption is needed due to:
• Optimization of electricity consumption.
• Distribution (utility) companies. Deregulation of the market.
Purchase and sale of electricity.
• Ecological factors.
However, it is very difficult task for individual end-consumers due to:
• Stochastic behaviour (processes).
• Many factors influencing the consumption:
• Seasonality
• Weather
• Holidays
• Market
1
Example of consumers electricity load
20
40
60
0 250 500 750 1000
Time (3 weeks)
Load(kW)
2
Example of consumers electricity load
3.0
3.5
4.0
4.5
0 250 500 750 1000
Time (3 weeks)
Load(kW)
3
Example of consumers electricity load
0.0
2.5
5.0
7.5
10.0
12.5
0 250 500 750 1000
Time (3 weeks)
Load(kW)
4
Example of consumers electricity load - residential
0
1
2
3
0 250 500 750 1000
Time (3 weeks)
Load(kW)
5
Classical vs. our approach
The classical way is to train a model for every consumer
separately (drawbacks).
Our approach uses data from all consumers in a smart grid to
overcome stochastic changes and noisy character of data (time
series).
Solution: clustering of all consumers.
6
Our method
We will suppose that N is a number of consumers, the length of the training set is 21
days (3 weeks) whereby in every day we will consider 24 × 2 = 48 measurements, and
we will execute one hour ahead forecasts.
1. Starting with iteration iter = 0.
2. Creating of time series for each consumer of the lengths of three weeks.
3. Normalisation of each time series by z-score (keeping a mean and a standard
deviation in memory for every time series).
4. Computation of representations of each time series.
5. K-means clustering of representations and an optimal number of clusters is
computed.
6. The extraction of K centroids and using them as training set to any forecasting
method.
7. The denormalisation of K forecasts using the stored mean and standard
deviation to produce N forecasts.
8. iter = iter + 1. If iter is divisible by 24 (iter mod 24 = 0 mod 24) then steps 4) and
5) are performed otherwise they are skipped and the stored centroids are used.
7
Representation of time series
After normalisation -> computation of representations of time
series.
We conducted from our previous works 1 that clustering
model-based representations significantly improves accuracy
of the forecast of the global (aggregate) consumption.
For a representation, regression coefficients from the multiple
linear regression is used. The linear model is composed of
daily and weekly seasonal parameters.
xt = βd1utd1 + · · · + βdsutds + βw1utw1 + · · · + βw6utw6 + εt
1
Laurinec et al., WCECS (2016) and ICDMW (2016)
8
Representation of time series
−1
0
1
2
3
0 250 500 750 1000
Length
NormalizedLoad
Original Time Series
Daily Period
Weekly
Period
−1
0
1
0 20 40
Length
RegressionCoefficients
Final Representation of Time Series
9
Clustering
17 18 19 20
13 14 15 16
9 10 11 12
5 6 7 8
1 2 3 4
0 20 40 0 20 40 0 20 40 0 20 40
−1
0
1
2
3
−2
−1
0
1
2
3
−2
0
2
−2
0
2
−2
−1
0
1
2
3
−1
0
1
2
−3
−2
−1
0
1
2
−2
0
2
−2
−1
0
1
2
0
2
4
−2
0
2
4
−1
0
1
2
−2
0
2
4
−2
0
2
−2
−1
0
1
2
3
−1
0
1
2
−1
0
1
2
3
0
2
4
−2
−1
0
1
2
−2
−1
0
1
2
Length
RegressionCoefficients
10
Final centroids
17 18 19 20
13 14 15 16
9 10 11 12
5 6 7 8
1 2 3 4
0 250 500 750 1000 0 250 500 750 1000 0 250 500 750 1000 0 250 500 750 1000
−0.5
0.0
0.5
1.0
1.5
−1.5
−1.0
−0.5
0.0
0.5
1.0
−0.50
−0.25
0.00
0.25
−1
0
1
−1.0
−0.5
0.0
0.5
−1.0
−0.5
0.0
0.5
1.0
−1.0
−0.5
0.0
0.5
−1.0
−0.5
0.0
0.5
1.0
−1.0
−0.5
0.0
0.5
1.0
−0.5
0.0
0.5
1.0
0
1
0
1
2
−0.5
0.0
0.5
−1.0
−0.5
0.0
0.5
1.0
−0.5
0.0
0.5
1.0
1.5
0
1
0
1
2
0
1
2
3
4
5
−1.0
−0.5
0.0
0.5
1.0
−1
0
1
Time
NormalizedLoad
11
Forecasting methods
Four methods were implemented
• Seasonal naive method (SNAIVE)
• Multiple Linear Regression (MLR)
• Random Forest (RF)
• Triple exponential smoothing (ES)
MAE (Mean Absolute Error):
1
n
n∑
t=1
|xt − xt|,
where xt is a real consumption, xt is the forecasted load and n
is a length of data.
12
Scaling forecasts
Denormalising K centroid-based forecasts by stored mean and
standard deviation from every consumer (N).
13
Data for experiments
We used two different datasets consisting of a large number of
variable patterns that were gathered from smart meters. This
measurement data includes Irish and Slovak electricity load
data.
For the Irish residential testing dataset (3639 consumers) the
data measurements from 1.2.2010 to 21.2.2010.
For the Slovak factories testing dataset (3607 consumers) the
data measurements from 10.2.2014 to 2.3.2014.
14
Evaluation
MAE Ireland dataset
Mean Median Max
SNAIVE_DisAgg 0.3807 ± 0.203 0.1928 ± 0.147 3.014 ± 1.3
SNAIVE_Clust 0.3373 ± 0.178 0.235 ± 0.143 2.6605 ± 1.192
MLR_Clust 0.3403 ± 0.18 0.2394 ± 0.146 2.6453 ± 1.187
RF_Clust 0.3394 ± 0.18 0.2425 ± 0.147 2.675 ± 1.192
ES_Clust 0.3359 ± 0.177 0.2387 ± 0.144 2.6629 ± 1.189
MAE Slovak dataset
Mean Median Max
SNAIVE_DisAgg 2.6903 ± 2.854 1.769 ± 2.27 16.1599 ± 14.621
SNAIVE_Clust 2.7873 ± 2.858 2.1479 ± 2.452 14.0958 ± 12.711
MLR_Clust 2.9326 ± 2.984 2.3109 ± 2.612 14.0306 ± 12.673
RF_Clust 2.7639 ± 2.836 2.0765 ± 2.388 14.4476 ± 13.081
ES_Clust 2.6752 ± 2.771 2.0283 ± 2.357 14.1695 ± 12.816
15
Ireland dataset results
0
1
2
3
4
5
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
Hour
MAE
Method
ES_Clust
MLR_Clust
RF_Clust
SNAIVE_Clust
SNAIVE_DisAgg
16
Slovak dataset results
0
20
40
60
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
Hour
MAE
Method
ES_Clust
MLR_Clust
RF_Clust
SNAIVE_Clust
SNAIVE_DisAgg
17
Conclusion
• Newly proposed clustering-based forecasting method for end-consumer load
using all data from a smart grid.
• We proved that our clustering-based method decreases the forecasting error in
the meaning of an average and the maximum (high rates of error).
• However, the error rates did not decrease with respect to the median because
of the nature of smart meter data.
• Our method needs to train only K models (in our case about 28) instead of N
models (thousands) that is leading to a huge decrease of the computational
load.
Future work:
• More experiments to find the number of optimal clusters.
• Other centroid-based clustering methods like K-medians, K-medoids and Fuzzy
C-means can be also used.
18

More Related Content

Similar to New Clustering-based Forecasting Method for Disaggregated End-consumer Electricity Load Using Smart Grid Data

Peter Laurinec: Clustering Electricity Consumers & Prediction, PyData Bratisl...
Peter Laurinec: Clustering Electricity Consumers & Prediction, PyData Bratisl...Peter Laurinec: Clustering Electricity Consumers & Prediction, PyData Bratisl...
Peter Laurinec: Clustering Electricity Consumers & Prediction, PyData Bratisl...
GapData Institute
 
IPPTChap010.pptx
IPPTChap010.pptxIPPTChap010.pptx
IPPTChap010.pptx
QuangLong44
 
Statr session 25 and 26
Statr session 25 and 26Statr session 25 and 26
Statr session 25 and 26
Ruru Chowdhury
 
A Framework for Robust Control of Uncertainty in Self-Adaptive Software Conn...
 A Framework for Robust Control of Uncertainty in Self-Adaptive Software Conn... A Framework for Robust Control of Uncertainty in Self-Adaptive Software Conn...
A Framework for Robust Control of Uncertainty in Self-Adaptive Software Conn...
Pooyan Jamshidi
 
Wavelet bootstrap Multiple linear regression models
Wavelet bootstrap Multiple linear regression modelsWavelet bootstrap Multiple linear regression models
Wavelet bootstrap Multiple linear regression modelsVinit Sehgal
 
Forecasting Assignment Help
Forecasting Assignment HelpForecasting Assignment Help
Forecasting Assignment Help
HelpWithAssignment.com
 
DigitalGlobe Overview
DigitalGlobe OverviewDigitalGlobe Overview
DigitalGlobe Overview
CIMMYT
 
Writing Sample
Writing SampleWriting Sample
Writing SampleYiqun Li
 
Statistical Inference.pdf
Statistical Inference.pdfStatistical Inference.pdf
Statistical Inference.pdf
MuhammadFaizan389
 
Inverted Energy: Diesel Generator Set Replacement
Inverted Energy: Diesel Generator Set Replacement Inverted Energy: Diesel Generator Set Replacement
Inverted Energy: Diesel Generator Set Replacement
Inverted2019
 
Projectproposalon10mwsolarpvpowerplant 160312085340 (2)
Projectproposalon10mwsolarpvpowerplant 160312085340 (2)Projectproposalon10mwsolarpvpowerplant 160312085340 (2)
Projectproposalon10mwsolarpvpowerplant 160312085340 (2)
Muzaffar Ahmad
 
Project Proposal on 10 MW Solar PV Power Plant
Project Proposal on 10 MW Solar PV Power PlantProject Proposal on 10 MW Solar PV Power Plant
Project Proposal on 10 MW Solar PV Power Plant
Vignesh Sekar
 
T2K status in 2017
T2K status in 2017T2K status in 2017
T2K status in 2017
Son Cao
 
Statistical quality control
Statistical quality controlStatistical quality control
Statistical quality control
Sai Datri Arige
 
Push And Pull Production Systems Chap7 Ppt)
Push And Pull Production Systems   Chap7 Ppt)Push And Pull Production Systems   Chap7 Ppt)
Push And Pull Production Systems Chap7 Ppt)3abooodi
 
Application of ibearugbulem’s model for optimizing granite concrete mix
Application of ibearugbulem’s model for optimizing granite concrete mixApplication of ibearugbulem’s model for optimizing granite concrete mix
Application of ibearugbulem’s model for optimizing granite concrete mix
eSAT Publishing House
 
Web trafic time series forecasting
Web trafic time series forecastingWeb trafic time series forecasting
Web trafic time series forecasting
Korivi Sravan Kumar
 

Similar to New Clustering-based Forecasting Method for Disaggregated End-consumer Electricity Load Using Smart Grid Data (20)

Peter Laurinec: Clustering Electricity Consumers & Prediction, PyData Bratisl...
Peter Laurinec: Clustering Electricity Consumers & Prediction, PyData Bratisl...Peter Laurinec: Clustering Electricity Consumers & Prediction, PyData Bratisl...
Peter Laurinec: Clustering Electricity Consumers & Prediction, PyData Bratisl...
 
IPPTChap010.pptx
IPPTChap010.pptxIPPTChap010.pptx
IPPTChap010.pptx
 
Project 1
Project 1Project 1
Project 1
 
Statr session 25 and 26
Statr session 25 and 26Statr session 25 and 26
Statr session 25 and 26
 
A Framework for Robust Control of Uncertainty in Self-Adaptive Software Conn...
 A Framework for Robust Control of Uncertainty in Self-Adaptive Software Conn... A Framework for Robust Control of Uncertainty in Self-Adaptive Software Conn...
A Framework for Robust Control of Uncertainty in Self-Adaptive Software Conn...
 
Wavelet bootstrap Multiple linear regression models
Wavelet bootstrap Multiple linear regression modelsWavelet bootstrap Multiple linear regression models
Wavelet bootstrap Multiple linear regression models
 
Forecasting Assignment Help
Forecasting Assignment HelpForecasting Assignment Help
Forecasting Assignment Help
 
DigitalGlobe Overview
DigitalGlobe OverviewDigitalGlobe Overview
DigitalGlobe Overview
 
Writing Sample
Writing SampleWriting Sample
Writing Sample
 
Statistical Inference.pdf
Statistical Inference.pdfStatistical Inference.pdf
Statistical Inference.pdf
 
Inverted Energy: Diesel Generator Set Replacement
Inverted Energy: Diesel Generator Set Replacement Inverted Energy: Diesel Generator Set Replacement
Inverted Energy: Diesel Generator Set Replacement
 
Unit Commitment
Unit CommitmentUnit Commitment
Unit Commitment
 
Projectproposalon10mwsolarpvpowerplant 160312085340 (2)
Projectproposalon10mwsolarpvpowerplant 160312085340 (2)Projectproposalon10mwsolarpvpowerplant 160312085340 (2)
Projectproposalon10mwsolarpvpowerplant 160312085340 (2)
 
Project Proposal on 10 MW Solar PV Power Plant
Project Proposal on 10 MW Solar PV Power PlantProject Proposal on 10 MW Solar PV Power Plant
Project Proposal on 10 MW Solar PV Power Plant
 
T2K status in 2017
T2K status in 2017T2K status in 2017
T2K status in 2017
 
Presentation
PresentationPresentation
Presentation
 
Statistical quality control
Statistical quality controlStatistical quality control
Statistical quality control
 
Push And Pull Production Systems Chap7 Ppt)
Push And Pull Production Systems   Chap7 Ppt)Push And Pull Production Systems   Chap7 Ppt)
Push And Pull Production Systems Chap7 Ppt)
 
Application of ibearugbulem’s model for optimizing granite concrete mix
Application of ibearugbulem’s model for optimizing granite concrete mixApplication of ibearugbulem’s model for optimizing granite concrete mix
Application of ibearugbulem’s model for optimizing granite concrete mix
 
Web trafic time series forecasting
Web trafic time series forecastingWeb trafic time series forecasting
Web trafic time series forecasting
 

Recently uploaded

一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
correoyaya
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
James Polillo
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 

Recently uploaded (20)

一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 

New Clustering-based Forecasting Method for Disaggregated End-consumer Electricity Load Using Smart Grid Data

  • 1. New Clustering-based Forecasting Method for Disaggregated End-consumer Electricity Load Using Smart Grid Data Peter Laurinec, and Mária Lucká 14.11.2017 Slovak University of Technology in Bratislava
  • 2. Motivation More accurate forecast of electricity consumption is needed due to: • Optimization of electricity consumption. • Distribution (utility) companies. Deregulation of the market. Purchase and sale of electricity. • Ecological factors. However, it is very difficult task for individual end-consumers due to: • Stochastic behaviour (processes). • Many factors influencing the consumption: • Seasonality • Weather • Holidays • Market 1
  • 3. Example of consumers electricity load 20 40 60 0 250 500 750 1000 Time (3 weeks) Load(kW) 2
  • 4. Example of consumers electricity load 3.0 3.5 4.0 4.5 0 250 500 750 1000 Time (3 weeks) Load(kW) 3
  • 5. Example of consumers electricity load 0.0 2.5 5.0 7.5 10.0 12.5 0 250 500 750 1000 Time (3 weeks) Load(kW) 4
  • 6. Example of consumers electricity load - residential 0 1 2 3 0 250 500 750 1000 Time (3 weeks) Load(kW) 5
  • 7. Classical vs. our approach The classical way is to train a model for every consumer separately (drawbacks). Our approach uses data from all consumers in a smart grid to overcome stochastic changes and noisy character of data (time series). Solution: clustering of all consumers. 6
  • 8. Our method We will suppose that N is a number of consumers, the length of the training set is 21 days (3 weeks) whereby in every day we will consider 24 × 2 = 48 measurements, and we will execute one hour ahead forecasts. 1. Starting with iteration iter = 0. 2. Creating of time series for each consumer of the lengths of three weeks. 3. Normalisation of each time series by z-score (keeping a mean and a standard deviation in memory for every time series). 4. Computation of representations of each time series. 5. K-means clustering of representations and an optimal number of clusters is computed. 6. The extraction of K centroids and using them as training set to any forecasting method. 7. The denormalisation of K forecasts using the stored mean and standard deviation to produce N forecasts. 8. iter = iter + 1. If iter is divisible by 24 (iter mod 24 = 0 mod 24) then steps 4) and 5) are performed otherwise they are skipped and the stored centroids are used. 7
  • 9. Representation of time series After normalisation -> computation of representations of time series. We conducted from our previous works 1 that clustering model-based representations significantly improves accuracy of the forecast of the global (aggregate) consumption. For a representation, regression coefficients from the multiple linear regression is used. The linear model is composed of daily and weekly seasonal parameters. xt = βd1utd1 + · · · + βdsutds + βw1utw1 + · · · + βw6utw6 + εt 1 Laurinec et al., WCECS (2016) and ICDMW (2016) 8
  • 10. Representation of time series −1 0 1 2 3 0 250 500 750 1000 Length NormalizedLoad Original Time Series Daily Period Weekly Period −1 0 1 0 20 40 Length RegressionCoefficients Final Representation of Time Series 9
  • 11. Clustering 17 18 19 20 13 14 15 16 9 10 11 12 5 6 7 8 1 2 3 4 0 20 40 0 20 40 0 20 40 0 20 40 −1 0 1 2 3 −2 −1 0 1 2 3 −2 0 2 −2 0 2 −2 −1 0 1 2 3 −1 0 1 2 −3 −2 −1 0 1 2 −2 0 2 −2 −1 0 1 2 0 2 4 −2 0 2 4 −1 0 1 2 −2 0 2 4 −2 0 2 −2 −1 0 1 2 3 −1 0 1 2 −1 0 1 2 3 0 2 4 −2 −1 0 1 2 −2 −1 0 1 2 Length RegressionCoefficients 10
  • 12. Final centroids 17 18 19 20 13 14 15 16 9 10 11 12 5 6 7 8 1 2 3 4 0 250 500 750 1000 0 250 500 750 1000 0 250 500 750 1000 0 250 500 750 1000 −0.5 0.0 0.5 1.0 1.5 −1.5 −1.0 −0.5 0.0 0.5 1.0 −0.50 −0.25 0.00 0.25 −1 0 1 −1.0 −0.5 0.0 0.5 −1.0 −0.5 0.0 0.5 1.0 −1.0 −0.5 0.0 0.5 −1.0 −0.5 0.0 0.5 1.0 −1.0 −0.5 0.0 0.5 1.0 −0.5 0.0 0.5 1.0 0 1 0 1 2 −0.5 0.0 0.5 −1.0 −0.5 0.0 0.5 1.0 −0.5 0.0 0.5 1.0 1.5 0 1 0 1 2 0 1 2 3 4 5 −1.0 −0.5 0.0 0.5 1.0 −1 0 1 Time NormalizedLoad 11
  • 13. Forecasting methods Four methods were implemented • Seasonal naive method (SNAIVE) • Multiple Linear Regression (MLR) • Random Forest (RF) • Triple exponential smoothing (ES) MAE (Mean Absolute Error): 1 n n∑ t=1 |xt − xt|, where xt is a real consumption, xt is the forecasted load and n is a length of data. 12
  • 14. Scaling forecasts Denormalising K centroid-based forecasts by stored mean and standard deviation from every consumer (N). 13
  • 15. Data for experiments We used two different datasets consisting of a large number of variable patterns that were gathered from smart meters. This measurement data includes Irish and Slovak electricity load data. For the Irish residential testing dataset (3639 consumers) the data measurements from 1.2.2010 to 21.2.2010. For the Slovak factories testing dataset (3607 consumers) the data measurements from 10.2.2014 to 2.3.2014. 14
  • 16. Evaluation MAE Ireland dataset Mean Median Max SNAIVE_DisAgg 0.3807 ± 0.203 0.1928 ± 0.147 3.014 ± 1.3 SNAIVE_Clust 0.3373 ± 0.178 0.235 ± 0.143 2.6605 ± 1.192 MLR_Clust 0.3403 ± 0.18 0.2394 ± 0.146 2.6453 ± 1.187 RF_Clust 0.3394 ± 0.18 0.2425 ± 0.147 2.675 ± 1.192 ES_Clust 0.3359 ± 0.177 0.2387 ± 0.144 2.6629 ± 1.189 MAE Slovak dataset Mean Median Max SNAIVE_DisAgg 2.6903 ± 2.854 1.769 ± 2.27 16.1599 ± 14.621 SNAIVE_Clust 2.7873 ± 2.858 2.1479 ± 2.452 14.0958 ± 12.711 MLR_Clust 2.9326 ± 2.984 2.3109 ± 2.612 14.0306 ± 12.673 RF_Clust 2.7639 ± 2.836 2.0765 ± 2.388 14.4476 ± 13.081 ES_Clust 2.6752 ± 2.771 2.0283 ± 2.357 14.1695 ± 12.816 15
  • 17. Ireland dataset results 0 1 2 3 4 5 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Hour MAE Method ES_Clust MLR_Clust RF_Clust SNAIVE_Clust SNAIVE_DisAgg 16
  • 18. Slovak dataset results 0 20 40 60 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Hour MAE Method ES_Clust MLR_Clust RF_Clust SNAIVE_Clust SNAIVE_DisAgg 17
  • 19. Conclusion • Newly proposed clustering-based forecasting method for end-consumer load using all data from a smart grid. • We proved that our clustering-based method decreases the forecasting error in the meaning of an average and the maximum (high rates of error). • However, the error rates did not decrease with respect to the median because of the nature of smart meter data. • Our method needs to train only K models (in our case about 28) instead of N models (thousands) that is leading to a huge decrease of the computational load. Future work: • More experiments to find the number of optimal clusters. • Other centroid-based clustering methods like K-medians, K-medoids and Fuzzy C-means can be also used. 18