SlideShare a Scribd company logo
1 of 28
Download to read offline
Is Unsupervised Ensemble Learning Useful for
Aggregated or Clustered Load Forecasting?
Peter Laurinec, and Mária Lucká
22.9.2017
Slovak University of Technology in Bratislava
Motivation n.1
More accurate forecast of electricity consumption is needed
due to:
• Optimization of electricity consumption.
• Production of electricity. Overvoltage in grid.
• Distribution (utility) companies. Deregulation
of the market. Purchase and sale of electricity.
1
Forecasting methods
Factors influencing electricity load:
• Seasonality (daily, weekly, …)
• Weather (temperature, humidity, …)
• Holidays
• Random effects
What we can use for it:
• Time series analysis methods (ARIMA, ES)
• Linear regression methods (MLR, RLM, GAM)
• AI regression methods (Trees, Neural Networks, SVR)
• Time series data mining + clustering - Creating more predictable groups of
consumers1.
1
Laurinec et al., ICMLDA and ICDMW, 2016
2
Motivation n.2 and n.3
Lot of methods are possible to use. Which one to choose?
Solution: ensemble learning.
Dynamic clustering (actually offline batch clustering) -
Aggregated data from electricity consumers (clusters) changes
in each sliding window.
3
Clusters
13 14 15
9 10 11 12
5 6 7 8
1 2 3 4
0 250 500 750 1000 0 250 500 750 1000 0 250 500 750 1000
0 250 500 750 1000
2000
4000
6000
1500
1750
2000
2250
2500
1000
1500
2000
2500
3000
3500
1000
1500
2000
2500
1500
2000
2500
3000
3500
1000
1250
1500
1750
2000
400
800
1200
1600
2000
2500
5000
7500
10000
12500
0
25
50
75
2000
3000
4000
1500
2000
2500
3000
3500
1000
2000
3000
2000
3000
4000
750
1000
1250
1500
2000
2500
3000
Time
Load(kW)
4
Solution to the problem and a target
• Clustering (generating lot of time series) + ensemble
learning → classical weighting of models (based on test
set) is not possible and could be slow (lot of clusters, etc.).
• Weighting of models based on training (or validation) set
is possible, but it would be overfitted.
• Unsupervised ensemble learning can be promising
approach.
Target and contribution: Evaluate various unsupervised
ensemble learning methods combined with clustered and
also simply aggregated electricity load.
5
Base forecasting methods
Two most important factors (assumptions) that have to be
satisfied for used forecasting methods:
• Very fast to compute (i.e. week learners)
• Parametrically adaptable for various time series created
by clustering of consumers.
These assumptions satisfies methods:
• CART (RPART) tree
• CTREE - Conditional inference trees
• ARIMA
• Exponential smoothing
6
Detailed description of forecasting methods
RPART
• Simple daily and weekly attributes
CTREE.lag
• Lag feature (denoised); daily and weekly attributes in sinus and
cosine form.
CTREE.dft
• Daily and weekly attributes in form of Fourier transformation (6
and 12 pairs).
STL+ARIMA, STL+ES, ES
• STL decomposition in combination with ARIMA and ES; ES
standalone.
7
Unsupervised ensemble learning
The 1st possibility is to do ensemble learning on just base
forecasting methods.
Bootstrapping time series:
• Classical sampling from training set (bagging) - for
regression (RPART, CTREE)
• Moving block bootstrap (mbb) - for time series analysis
methods (ARIMA, ES)
Evaluated was the median from each of a bootstrapped
method.
8
Bootstrapping
For regression tree methods:
• The sample ratio was randomly sampled in the range of
0.7 − 0.9.
• Sampled was also maximal depth, complexity parameter,
minimal criterion
For time series analysis methods:
• Box-Cox transformation, STL decomposition, remainder is
bootstrapped by mbb, inverse Box-Cox 2
2
Bergmeir, Hyndman, Benítez, Int. J. of Forecasting, 2016
9
Moving block bootstrapping
Ireland
Slovakia
100
200
300
2000
2500
3000
3500
0 250 500 750 1000
Time
Load(kW)
10
Results of Bagging
4. STL+ARIMA 5. STL+ES 6. ES
1. CART (RPART) 2. CTREE+LAG 3. CTREE+FOURIER
0 10 20 30 40 50 0 10 20 30 40 50 0 10 20 30 40 50
2000
3000
1500
2000
2500
3000
3500
1500
2000
2500
3000
1500
2000
2500
3000
3500
1500
2000
2500
3000
1500
2000
2500
3000
3500
Time
Load(kW)
11
Unsupervised ensemble learning
The 2nd possibility is to combine all bootstrapped forecasts
from all methods.
Unsupervised (structure-based) possibilities for aggregating
bootstrapped forecasts:
• Simple mean or median
• Averaging by methods
• Clustering
Aggregating from all created bootstrappings:
a) Simple aggregation based - average and median
b) Naive cluster based - average of medians of methods
c) Cluster-based - K-means-based, DBSCAN-based and
OPTICS-based
12
Simple aggregation based methods
STL+ARIMA
STL+ARIMA CTREE_FOUR
ES
ES
STL+ARIMA
ES
ES
STL+ES
STL+ES
STL+ARIMA
STL+ARIMA
STL+ARIMA
STL+ES
RPART
STL+ARIMA
STL+ARIMA
CTREE_FOUR
ES
STL+ARIMA
ES
CTREE_FOUR
RPART
RPART
STL+ARIMA
CTREE_FOUR
ES
CTREE_FOUR
CTREE_FOUR
STL+ARIMA
RPART
CTREE_LAG
CTREE_LAG
STL+ARIMA
CTREE_LAG
CTREE_LAG
STL+ES
CTREE_LAG
STL+ES CTREE_FOUR
CTREE_LAG
CTREE_LAG
ES
CTREE_FOUR
CTREE_FOUR
RPART
STL+ES
RPART
CTREE_FOUR
CTREE_FOUR
ES
CTREE_LAG
ES
STL+ES
CTREE_LAG
STL+ES
CTREE_LAG
RPART
CTREE_LAG
CTREE_LAG
ES
CTREE_FOUR
STL+ES
CTREE_LAG
RPART
ES
RPART
STL+ES
CTREE_LAGCTREE_LAG
STL+ARIMA
STL+ES
CTREE_FOUR
CTREE_FOUR
CTREE_LAG
RPART
CTREE_LAG
CTREE_LAG
ES
CTREE_LAG
ES
ES
STL+ARIMA
STL+ARIMA
STL+ARIMA
STL+ES
CTREE_FOUR
CTREE_FOUR
CTREE_LAG
ES
ES
RPART
CTREE_LAG
RPART
ES
CTREE_FOUR
RPART
STL+ES
CTREE_LAG
ES
STL+ARIMA
CTREE_LAG
ES CTREE_LAG
STL+ES
STL+ARIMA
RPART
STL+ES
ES
RPART
CTREE_FOUR
STL+ES
RPARTSTL+ARIMA
CTREE_FOUR
CTREE_FOUR
CTREE_FOUR
ES
ES
STL+ARIMA
RPART
CTREE_LAG
CTREE_LAG
STL+ARIMA
RPART
CTREE_LAG
RPART
STL+ARIMA
STL+ES
CTREE_FOUR
ES
STL+ARIMA
STL+ES
RPART
STL+ES
RPART
STL+ES
STL+ARIMA
CTREE_LAG
RPART
ES
ES
CTREE_FOUR
ES
RPART
CTREE_FOUR
RPART
ES
CTREE_FOUR
ES
CTREE_LAG
CTREE_LAG
STL+ARIMA
CTREE_LAG
STL+ARIMA
RPART
ES
STL+ES
CTREE_FOUR
CTREE_FOURSTL+ARIMA
RPART
STL+ES
RPARTCTREE_FOUR
CTREE_FOUR
STL+ES
STL+ARIMA
ES
STL+ES
ES
CTREE_LAG
STL+ARIMA
RPARTCTREE_LAG
RPART
CTREE_FOUR
CTREE_LAG
CTREE_FOUR
CTREE_FOUR
STL+ARIMA
CTREE_FOUR
STL+ES
CTREE_LAG
ES
CTREE_LAG
ES
RPARTSTL+ARIMA
CTREE_FOUR
CTREE_LAG
RPART
CTREE_FOUR
RPART
RPART
RPART
STL+ARIMA
CTREE_FOUR
CTREE_LAG
STL+ARIMA
@X
−4
0
4
8
−10 0 10
PC1
PC2
Class
@
X
Mean
Median
13
Naive cluster based method
STL+ARIMA
STL+ARIMA CTREE_FOUR
ES
ES
STL+ARIMA
ES
ES
STL+ES
STL+ES
STL+ARIMA
STL+ARIMA
STL+ARIMA
STL+ES
RPART
STL+ARIMA
STL+ARIMA
CTREE_FOUR
ES
STL+ARIMA
ES
CTREE_FOUR
RPART
RPART
STL+ARIMA
CTREE_FOUR
ES
CTREE_FOUR
CTREE_FOUR
STL+ARIMA
RPART
CTREE_LAG
CTREE_LAG
STL+ARIMA
CTREE_LAG
CTREE_LAG
STL+ES
CTREE_LAG
STL+ES CTREE_FOUR
CTREE_LAG
CTREE_LAG
ES
CTREE_FOUR
CTREE_FOUR
RPART
STL+ES
RPART
CTREE_FOUR
CTREE_FOUR
ES
CTREE_LAG
ES
STL+ES
CTREE_LAG
STL+ES
CTREE_LAG
RPART
CTREE_LAG
CTREE_LAG
ES
CTREE_FOUR
STL+ES
CTREE_LAG
RPART
ES
RPART
STL+ES
CTREE_LAGCTREE_LAG
STL+ARIMA
STL+ES
CTREE_FOUR
CTREE_FOUR
CTREE_LAG
RPART
CTREE_LAG
CTREE_LAG
ES
CTREE_LAG
ES
ES
STL+ARIMA
STL+ARIMA
STL+ARIMA
STL+ES
CTREE_FOUR
CTREE_FOUR
CTREE_LAG
ES
ES
RPART
CTREE_LAG
RPART
ES
CTREE_FOUR
RPART
STL+ES
CTREE_LAG
ES
STL+ARIMA
CTREE_LAG
ES CTREE_LAG
STL+ES
STL+ARIMA
RPART
STL+ES
ES
RPART
CTREE_FOUR
STL+ES
RPARTSTL+ARIMA
CTREE_FOUR
CTREE_FOUR
CTREE_FOUR
ES
ES
STL+ARIMA
RPART
CTREE_LAG
CTREE_LAG
STL+ARIMA
RPART
CTREE_LAG
RPART
STL+ARIMA
STL+ES
CTREE_FOUR
ES
STL+ARIMA
STL+ES
RPART
STL+ES
RPART
STL+ES
STL+ARIMA
CTREE_LAG
RPART
ES
ES
CTREE_FOUR
ES
RPART
CTREE_FOUR
RPART
ES
CTREE_FOUR
ES
CTREE_LAG
CTREE_LAG
STL+ARIMA
CTREE_LAG
STL+ARIMA
RPART
ES
STL+ES
CTREE_FOUR
CTREE_FOURSTL+ARIMA
RPART
STL+ES
RPARTCTREE_FOUR
CTREE_FOUR
STL+ES
STL+ARIMA
ES
STL+ES
ES
CTREE_LAG
STL+ARIMA
RPARTCTREE_LAG
RPART
CTREE_FOUR
CTREE_LAG
CTREE_FOUR
CTREE_FOUR
STL+ARIMA
CTREE_FOUR
STL+ES
CTREE_LAG
ES
CTREE_LAG
ES
RPARTSTL+ARIMA
CTREE_FOUR
CTREE_LAG
RPART
CTREE_FOUR
RPART
RPART
RPART
STL+ARIMA
CTREE_FOUR
CTREE_LAG
STL+ARIMA
$ %+
&
@
#
X
−4
0
4
8
−10 0 10
PC1
PC2
Method
@
#
%
$
&
+
X
RPART
CTREE_LAG
CTREE_FOUR
STL+ARIMA
STL+ES
ES
Final
14
Cluster-based approaches
PCA is used to extract three principal components (from 48 dim.) in order to
reduce noise.
K-means-based:
• K-means clustering with K-means++ centroid initialization.
• Optimal number of clusters in the range of 3 − 8 by DB index.
• Centroids were averaged to the final ensemble forecast.
DBSCAN-based:
• requires two parameters: ϵ (set to 1.45) and minPts (set to 8).
• Ensemble forecast is created by the average of medians of clusters.
OPTICS-based:
• Automatic ξ-cluster procedure. ξ defines the degree of steepness (set
to 0.045), which is applied in the so-called reachability plot of
distances.
• The final ensemble forecast is the median of medians of clusters. 15
K-means-based method
STL+ARIMA
STL+ARIMA CTREE_FOUR
ES
ES
STL+ARIMA
ES
ES
STL+ES
STL+ES
STL+ARIMA
STL+ARIMA
STL+ARIMA
STL+ES
RPART
STL+ARIMA
STL+ARIMA
CTREE_FOUR
ES
STL+ARIMA
ES
CTREE_FOUR
RPART
RPART
STL+ARIMA
CTREE_FOUR
ES
CTREE_FOUR
CTREE_FOUR
STL+ARIMA
RPART
CTREE_LAG
CTREE_LAG
STL+ARIMA
CTREE_LAG
CTREE_LAG
STL+ES
CTREE_LAG
STL+ES CTREE_FOUR
CTREE_LAG
CTREE_LAG
ES
CTREE_FOUR
CTREE_FOUR
RPART
STL+ES
RPART
CTREE_FOUR
CTREE_FOUR
ES
CTREE_LAG
ES
STL+ES
CTREE_LAG
STL+ES
CTREE_LAG
RPART
CTREE_LAG
CTREE_LAG
ES
CTREE_FOUR
STL+ES
CTREE_LAG
RPART
ES
RPART
STL+ES
CTREE_LAGCTREE_LAG
STL+ARIMA
STL+ES
CTREE_FOUR
CTREE_FOUR
CTREE_LAG
RPART
CTREE_LAG
CTREE_LAG
ES
CTREE_LAG
ES
ES
STL+ARIMA
STL+ARIMA
STL+ARIMA
STL+ES
CTREE_FOUR
CTREE_FOUR
CTREE_LAG
ES
ES
RPART
CTREE_LAG
RPART
ES
CTREE_FOUR
RPART
STL+ES
CTREE_LAG
ES
STL+ARIMA
CTREE_LAG
ES CTREE_LAG
STL+ES
STL+ARIMA
RPART
STL+ES
ES
RPART
CTREE_FOUR
STL+ES
RPARTSTL+ARIMA
CTREE_FOUR
CTREE_FOUR
CTREE_FOUR
ES
ES
STL+ARIMA
RPART
CTREE_LAG
CTREE_LAG
STL+ARIMA
RPART
CTREE_LAG
RPART
STL+ARIMA
STL+ES
CTREE_FOUR
ES
STL+ARIMA
STL+ES
RPART
STL+ES
RPART
STL+ES
STL+ARIMA
CTREE_LAG
RPART
ES
ES
CTREE_FOUR
ES
RPART
CTREE_FOUR
RPART
ES
CTREE_FOUR
ES
CTREE_LAG
CTREE_LAG
STL+ARIMA
CTREE_LAG
STL+ARIMA
RPART
ES
STL+ES
CTREE_FOUR
CTREE_FOURSTL+ARIMA
RPART
STL+ES
RPARTCTREE_FOUR
CTREE_FOUR
STL+ES
STL+ARIMA
ES
STL+ES
ES
CTREE_LAG
STL+ARIMA
RPARTCTREE_LAG
RPART
CTREE_FOUR
CTREE_LAG
CTREE_FOUR
CTREE_FOUR
STL+ARIMA
CTREE_FOUR
STL+ES
CTREE_LAG
ES
CTREE_LAG
ES
RPARTSTL+ARIMA
CTREE_FOUR
CTREE_LAG
RPART
CTREE_FOUR
RPART
RPART
RPART
STL+ARIMA
CTREE_FOUR
CTREE_LAG
STL+ARIMA
+
y
@
m
#
$
%
&
X
−4
0
4
8
−10 0 10
PC1
PC2
Class
@
#
%
$
&
+
m
y
X
1
2
3
4
5
6
7
8
Final
16
DBSCAN-based method
STL+ARIMA
STL+ARIMA CTREE_FOUR
ES
ES
STL+ARIMA
ES
ES
STL+ES
STL+ES
STL+ARIMA
STL+ARIMA
STL+ARIMA
STL+ES
RPART
STL+ARIMA
STL+ARIMA
CTREE_FOUR
ES
STL+ARIMA
ES
CTREE_FOUR
RPART
RPART
STL+ARIMA
CTREE_FOUR
ES
CTREE_FOUR
CTREE_FOUR
STL+ARIMA
RPART
CTREE_LAG
CTREE_LAG
STL+ARIMA
CTREE_LAG
CTREE_LAG
STL+ES
CTREE_LAG
STL+ES CTREE_FOUR
CTREE_LAG
CTREE_LAG
ES
CTREE_FOUR
CTREE_FOUR
RPART
STL+ES
RPART
CTREE_FOUR
CTREE_FOUR
ES
CTREE_LAG
ES
STL+ES
CTREE_LAG
STL+ES
CTREE_LAG
RPART
CTREE_LAG
CTREE_LAG
ES
CTREE_FOUR
STL+ES
CTREE_LAG
RPART
ES
RPART
STL+ES
CTREE_LAGCTREE_LAG
STL+ARIMA
STL+ES
CTREE_FOUR
CTREE_FOUR
CTREE_LAG
RPART
CTREE_LAG
CTREE_LAG
ES
CTREE_LAG
ES
ES
STL+ARIMA
STL+ARIMA
STL+ARIMA
STL+ES
CTREE_FOUR
CTREE_FOUR
CTREE_LAG
ES
ES
RPART
CTREE_LAG
RPART
ES
CTREE_FOUR
RPART
STL+ES
CTREE_LAG
ES
STL+ARIMA
CTREE_LAG
ES CTREE_LAG
STL+ES
STL+ARIMA
RPART
STL+ES
ES
RPART
CTREE_FOUR
STL+ES
RPARTSTL+ARIMA
CTREE_FOUR
CTREE_FOUR
CTREE_FOUR
ES
ES
STL+ARIMA
RPART
CTREE_LAG
CTREE_LAG
STL+ARIMA
RPART
CTREE_LAG
RPART
STL+ARIMA
STL+ES
CTREE_FOUR
ES
STL+ARIMA
STL+ES
RPART
STL+ES
RPART
STL+ES
STL+ARIMA
CTREE_LAG
RPART
ES
ES
CTREE_FOUR
ES
RPART
CTREE_FOUR
RPART
ES
CTREE_FOUR
ES
CTREE_LAG
CTREE_LAG
STL+ARIMA
CTREE_LAG
STL+ARIMA
RPART
ES
STL+ES
CTREE_FOUR
CTREE_FOURSTL+ARIMA
RPART
STL+ES
RPARTCTREE_FOUR
CTREE_FOUR
STL+ES
STL+ARIMA
ES
STL+ES
ES
CTREE_LAG
STL+ARIMA
RPARTCTREE_LAG
RPART
CTREE_FOUR
CTREE_LAG
CTREE_FOUR
CTREE_FOUR
STL+ARIMA
CTREE_FOUR
STL+ES
CTREE_LAG
ES
CTREE_LAG
ES
RPARTSTL+ARIMA
CTREE_FOUR
CTREE_LAG
RPART
CTREE_FOUR
RPART
RPART
RPART
STL+ARIMA
CTREE_FOUR
CTREE_LAG
STL+ARIMA
X
−4
0
4
8
−10 0 10
PC1
PC2
Class
a
a
a
0
1
Final
Class
X Final
17
OPTICS-based method
STL+ARIMA
STL+ARIMA CTREE_FOUR
ES
ES
STL+ARIMA
ES
ES
STL+ES
STL+ES
STL+ARIMA
STL+ARIMA
STL+ARIMA
STL+ES
RPART
STL+ARIMA
STL+ARIMA
CTREE_FOUR
ES
STL+ARIMA
ES
CTREE_FOUR
RPART
RPART
STL+ARIMA
CTREE_FOUR
ES
CTREE_FOUR
CTREE_FOUR
STL+ARIMA
RPART
CTREE_LAG
CTREE_LAG
STL+ARIMA
CTREE_LAG
CTREE_LAG
STL+ES
CTREE_LAG
STL+ES CTREE_FOUR
CTREE_LAG
CTREE_LAG
ES
CTREE_FOUR
CTREE_FOUR
RPART
STL+ES
RPART
CTREE_FOUR
CTREE_FOUR
ES
CTREE_LAG
ES
STL+ES
CTREE_LAG
STL+ES
CTREE_LAG
RPART
CTREE_LAG
CTREE_LAG
ES
CTREE_FOUR
STL+ES
CTREE_LAG
RPART
ES
RPART
STL+ES
CTREE_LAGCTREE_LAG
STL+ARIMA
STL+ES
CTREE_FOUR
CTREE_FOUR
CTREE_LAG
RPART
CTREE_LAG
CTREE_LAG
ES
CTREE_LAG
ES
ES
STL+ARIMA
STL+ARIMA
STL+ARIMA
STL+ES
CTREE_FOUR
CTREE_FOUR
CTREE_LAG
ES
ES
RPART
CTREE_LAG
RPART
ES
CTREE_FOUR
RPART
STL+ES
CTREE_LAG
ES
STL+ARIMA
CTREE_LAG
ES CTREE_LAG
STL+ES
STL+ARIMA
RPART
STL+ES
ES
RPART
CTREE_FOUR
STL+ES
RPARTSTL+ARIMA
CTREE_FOUR
CTREE_FOUR
CTREE_FOUR
ES
ES
STL+ARIMA
RPART
CTREE_LAG
CTREE_LAG
STL+ARIMA
RPART
CTREE_LAG
RPART
STL+ARIMA
STL+ES
CTREE_FOUR
ES
STL+ARIMA
STL+ES
RPART
STL+ES
RPART
STL+ES
STL+ARIMA
CTREE_LAG
RPART
ES
ES
CTREE_FOUR
ES
RPART
CTREE_FOUR
RPART
ES
CTREE_FOUR
ES
CTREE_LAG
CTREE_LAG
STL+ARIMA
CTREE_LAG
STL+ARIMA
RPART
ES
STL+ES
CTREE_FOUR
CTREE_FOURSTL+ARIMA
RPART
STL+ES
RPARTCTREE_FOUR
CTREE_FOUR
STL+ES
STL+ARIMA
ES
STL+ES
ES
CTREE_LAG
STL+ARIMA
RPARTCTREE_LAG
RPART
CTREE_FOUR
CTREE_LAG
CTREE_FOUR
CTREE_FOUR
STL+ARIMA
CTREE_FOUR
STL+ES
CTREE_LAG
ES
CTREE_LAG
ES
RPARTSTL+ARIMA
CTREE_FOUR
CTREE_LAG
RPART
CTREE_FOUR
RPART
RPART
RPART
STL+ARIMA
CTREE_FOUR
CTREE_LAG
STL+ARIMA
@
+
$
%
&
y
m
#
X
−4
0
4
8
−10 0 10
PC1
PC2
Class
@
#
%
$
&
+
m
y
X
1
2
3
4
5
6
7
8
Final
18
Clustering consumers
Aggregation with clustering
1. Set of time series of electricity consumption of the length of two weeks
2. Normalization (z-score)
3. Computation of representations of time series by MLR (extraction of D.
and W. reg. coeff.)
4. Automatic determination of optimal number of clusters K (DB-index)
5. Clustering of representations (K-means with centroids initialization by
K-means++)
6. Summation of K time series by found clusters
7. Training of K forecast models and the following forecast
8. Summation of forecasts and evaluation
9. Remove first day and add new one to the training window (sliding
window approach), go to step 1
19
Data from smart meters
Ireland residences
• 3639 consumers. Residences.
• 48 measurements per day.
• Test set from three months of year 2010 (February, May
and September).
0
1
2
3
4
0 250 500 750 1000
Time
Load(kW)
20
Data from smart meters
Slovak factories
• 3630 consumers. Factories and enterprises.
• 96 measurements per day. Aggregated to 48.
• Test set from three months of years 2013 and 2014
(September, February and March, June).
20
40
60
0 250 500 750 1000
Time
Load(kW)
21
Evaluation of forecast
The accuracy of the forecast of electricity consumption was
measured by MAPE (Mean Absolute Percentage Error).
MAPE = 100 ×
1
n
n∑
t=1
|xt − xt|
xt
,
where xt is a real consumption, xt is a forecasted load and n is
a length of the time series.
22
Results - ensembles
Agg.-Irel. Clust.-Irel. Agg.-Slov. Clust.-Slov.
CART.bagg 3.7908 3.7964 3.1561 3.0993
CTREE.bagg.lag 3.8081 3.7599 2.9568 2.8730
CTREE.bagg.dft 3.6746 3.7103 3.0080 2.9341
STL+ARIMA.mbb 3.9344 3.9085 3.0325 2.9993
STL+ES.mbb 3.9901 4.0221 3.0306 3.0021
ES.mbb 4.0565 4.0723 2.9760 2.9446
Average 3.7034 3.6970 2.8312 2.8086
Median 3.6103 3.6046 2.8329 2.7980
AveMedians 3.6704 3.6771 2.8179 2.7901
K-means 4.3018 4.0189 2.9715 3.0916
DBSCAN 3.9752 3.7985 2.9352 2.7532
OPTICS 3.7482 3.7239 2.9253 2.7982
Agg.-Irel. Clust.-Irel. Agg.-Slov. Clust.-Slov.
0.0011 <0.0001 0.1379 0.2415
23
Results - base vs. ensemble
Agg.-Irel. Clust.-Irel. Agg.-Slov. Clust.-Slov.
CART 3.8570 3.8502 3.1921 3.1416
CTREE.lag 3.9203 3.7523 2.9950 2.8954
CTREE.dft 4.0849 3.9214 3.1944 3.0096
STL+ARIMA 4.0718 3.8943 2.7567 2.7404
STL+ES 4.2750 4.1866 2.6887 2.6424
ES 4.8000 4.2219 2.3957 2.4672
Agg.-Irel. Clust.-Irel. Agg.-Slov. Clust.-Slov.
CART.bagg 0.1113 0.0760 0.0075 0.0002
CTREE.bagg.lag 0.0001 0.4807 0.0004 0.0036
CTREE.bagg.dft <0.0001 <0.0001 <0.0001 0.0002
STL+ARIMA.mbb 0.0178 0.0592 0.9875 0.9996
STL+ES.mbb 0.0380 0.0330 0.9980 0.9995
ES.mbb <0.0001 0.2559 1.0000 1.0000
24
When time series bootstrapping fails
1500
2000
2500
0 250 500 750 1000
Time
Load(kW)
25
When time series bootstrapping fails
1000
2000
3000
4000
5000
0 250 500 750 1000
Time
Load(kW)
26
Conclusion
• The simple median aggregation of bootstrapped forecasts
is very good approach.
• Clustering-based ensembles is not always the best
approach.
• The bagging using STL decomposition and Box-Cox
transformation fails when data are noisy.
• Exponential smoothing state space models - follow it, use
it! Can be combined with the Temporal Hierarchical
Forecasting as well.
Future work:
• Develop more robust bagging methods for time series.
• Develop more automatic techniques for density-based
clustering. 27

More Related Content

What's hot

Clustering techniques
Clustering techniquesClustering techniques
Clustering techniquestalktoharry
 
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...Salford Systems
 
WIND SPEED & POWER FORECASTING USING ARTIFICIAL NEURAL NETWORK (NARX) FOR NEW...
WIND SPEED & POWER FORECASTING USING ARTIFICIAL NEURAL NETWORK (NARX) FOR NEW...WIND SPEED & POWER FORECASTING USING ARTIFICIAL NEURAL NETWORK (NARX) FOR NEW...
WIND SPEED & POWER FORECASTING USING ARTIFICIAL NEURAL NETWORK (NARX) FOR NEW...Journal For Research
 
Classical force fields as physics-based neural networks
Classical force fields as physics-based neural networksClassical force fields as physics-based neural networks
Classical force fields as physics-based neural networksaimsnist
 
Going Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFGoing Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFIan Foster
 
Short Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural NetworkShort Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural NetworkIJERA Editor
 
A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...aimsnist
 
Mining Regional Knowledge in Spatial Dataset
Mining Regional Knowledge in Spatial DatasetMining Regional Knowledge in Spatial Dataset
Mining Regional Knowledge in Spatial Datasetbutest
 
Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료taeseon ryu
 
Dear - 딥러닝 논문읽기 모임 김창연님
Dear - 딥러닝 논문읽기 모임 김창연님Dear - 딥러닝 논문읽기 모임 김창연님
Dear - 딥러닝 논문읽기 모임 김창연님taeseon ryu
 
RUCK 2017 MxNet과 R을 연동한 딥러닝 소개
RUCK 2017 MxNet과 R을 연동한 딥러닝 소개RUCK 2017 MxNet과 R을 연동한 딥러닝 소개
RUCK 2017 MxNet과 R을 연동한 딥러닝 소개r-kor
 
Noha danms13 talk_final
Noha danms13 talk_finalNoha danms13 talk_final
Noha danms13 talk_finalNoha Elprince
 
Scaling out logistic regression with Spark
Scaling out logistic regression with SparkScaling out logistic regression with Spark
Scaling out logistic regression with SparkBarak Gitsis
 
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...MLconf
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...Ian Foster
 
Overview of DuraMat software tool development (poster version)
Overview of DuraMat software tool development(poster version)Overview of DuraMat software tool development(poster version)
Overview of DuraMat software tool development (poster version)Anubhav Jain
 
Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017
Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017
Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017MLconf
 
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...WiMLDSMontreal
 
Multinomial Logistic Regression with Apache Spark
Multinomial Logistic Regression with Apache SparkMultinomial Logistic Regression with Apache Spark
Multinomial Logistic Regression with Apache SparkDB Tsai
 

What's hot (20)

Clustering techniques
Clustering techniquesClustering techniques
Clustering techniques
 
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
 
WIND SPEED & POWER FORECASTING USING ARTIFICIAL NEURAL NETWORK (NARX) FOR NEW...
WIND SPEED & POWER FORECASTING USING ARTIFICIAL NEURAL NETWORK (NARX) FOR NEW...WIND SPEED & POWER FORECASTING USING ARTIFICIAL NEURAL NETWORK (NARX) FOR NEW...
WIND SPEED & POWER FORECASTING USING ARTIFICIAL NEURAL NETWORK (NARX) FOR NEW...
 
Classical force fields as physics-based neural networks
Classical force fields as physics-based neural networksClassical force fields as physics-based neural networks
Classical force fields as physics-based neural networks
 
Going Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFGoing Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCF
 
Short Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural NetworkShort Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural Network
 
D0931621
D0931621D0931621
D0931621
 
A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...
 
Mining Regional Knowledge in Spatial Dataset
Mining Regional Knowledge in Spatial DatasetMining Regional Knowledge in Spatial Dataset
Mining Regional Knowledge in Spatial Dataset
 
Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료
 
Dear - 딥러닝 논문읽기 모임 김창연님
Dear - 딥러닝 논문읽기 모임 김창연님Dear - 딥러닝 논문읽기 모임 김창연님
Dear - 딥러닝 논문읽기 모임 김창연님
 
RUCK 2017 MxNet과 R을 연동한 딥러닝 소개
RUCK 2017 MxNet과 R을 연동한 딥러닝 소개RUCK 2017 MxNet과 R을 연동한 딥러닝 소개
RUCK 2017 MxNet과 R을 연동한 딥러닝 소개
 
Noha danms13 talk_final
Noha danms13 talk_finalNoha danms13 talk_final
Noha danms13 talk_final
 
Scaling out logistic regression with Spark
Scaling out logistic regression with SparkScaling out logistic regression with Spark
Scaling out logistic regression with Spark
 
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
Overview of DuraMat software tool development (poster version)
Overview of DuraMat software tool development(poster version)Overview of DuraMat software tool development(poster version)
Overview of DuraMat software tool development (poster version)
 
Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017
Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017
Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017
 
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
 
Multinomial Logistic Regression with Apache Spark
Multinomial Logistic Regression with Apache SparkMultinomial Logistic Regression with Apache Spark
Multinomial Logistic Regression with Apache Spark
 

Similar to Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load Forecasting?

FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETScsandit
 
Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Pedro Lopes
 
Machine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’sMachine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’sSiddharth Chakravarty
 
Short-Term Load Forecasting of Australian National Electricity Market by Hier...
Short-Term Load Forecasting of Australian National Electricity Market by Hier...Short-Term Load Forecasting of Australian National Electricity Market by Hier...
Short-Term Load Forecasting of Australian National Electricity Market by Hier...JEE HYUN PARK
 
How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?Anubhav Jain
 
ES_SAA_OG_PF_ECCTD_Pos
ES_SAA_OG_PF_ECCTD_PosES_SAA_OG_PF_ECCTD_Pos
ES_SAA_OG_PF_ECCTD_PosSyed Asad Alam
 
ES-Rank: Evolutionary Strategy Learning to Rank Approach (Presentation)
ES-Rank: Evolutionary Strategy Learning to Rank Approach (Presentation)ES-Rank: Evolutionary Strategy Learning to Rank Approach (Presentation)
ES-Rank: Evolutionary Strategy Learning to Rank Approach (Presentation)Minia University, Egypt
 
Target Response Electrical usage Profile Clustering using Big Data
Target Response Electrical usage Profile Clustering using Big DataTarget Response Electrical usage Profile Clustering using Big Data
Target Response Electrical usage Profile Clustering using Big DataIRJET Journal
 
Tutorial-on-DNN-09A-Co-design-Sparsity.pdf
Tutorial-on-DNN-09A-Co-design-Sparsity.pdfTutorial-on-DNN-09A-Co-design-Sparsity.pdf
Tutorial-on-DNN-09A-Co-design-Sparsity.pdfDuy-Hieu Bui
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGijcsit
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGijcsit
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGAIRCC Publishing Corporation
 

Similar to Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load Forecasting? (20)

RSlovakia #1 meetup
RSlovakia #1 meetupRSlovakia #1 meetup
RSlovakia #1 meetup
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
 
Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013
 
Prediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source toolsPrediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source tools
 
Energy saving policies final
Energy saving policies finalEnergy saving policies final
Energy saving policies final
 
L14.pdf
L14.pdfL14.pdf
L14.pdf
 
Machine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’sMachine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’s
 
Short-Term Load Forecasting of Australian National Electricity Market by Hier...
Short-Term Load Forecasting of Australian National Electricity Market by Hier...Short-Term Load Forecasting of Australian National Electricity Market by Hier...
Short-Term Load Forecasting of Australian National Electricity Market by Hier...
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
H235055
H235055H235055
H235055
 
How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?
 
ES_SAA_OG_PF_ECCTD_Pos
ES_SAA_OG_PF_ECCTD_PosES_SAA_OG_PF_ECCTD_Pos
ES_SAA_OG_PF_ECCTD_Pos
 
Time series
Time seriesTime series
Time series
 
ES-Rank: Evolutionary Strategy Learning to Rank Approach (Presentation)
ES-Rank: Evolutionary Strategy Learning to Rank Approach (Presentation)ES-Rank: Evolutionary Strategy Learning to Rank Approach (Presentation)
ES-Rank: Evolutionary Strategy Learning to Rank Approach (Presentation)
 
Master's Thesis Presentation
Master's Thesis PresentationMaster's Thesis Presentation
Master's Thesis Presentation
 
Target Response Electrical usage Profile Clustering using Big Data
Target Response Electrical usage Profile Clustering using Big DataTarget Response Electrical usage Profile Clustering using Big Data
Target Response Electrical usage Profile Clustering using Big Data
 
Tutorial-on-DNN-09A-Co-design-Sparsity.pdf
Tutorial-on-DNN-09A-Co-design-Sparsity.pdfTutorial-on-DNN-09A-Co-design-Sparsity.pdf
Tutorial-on-DNN-09A-Co-design-Sparsity.pdf
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
 

Recently uploaded

Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 

Recently uploaded (20)

Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 

Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load Forecasting?

  • 1. Is Unsupervised Ensemble Learning Useful for Aggregated or Clustered Load Forecasting? Peter Laurinec, and Mária Lucká 22.9.2017 Slovak University of Technology in Bratislava
  • 2. Motivation n.1 More accurate forecast of electricity consumption is needed due to: • Optimization of electricity consumption. • Production of electricity. Overvoltage in grid. • Distribution (utility) companies. Deregulation of the market. Purchase and sale of electricity. 1
  • 3. Forecasting methods Factors influencing electricity load: • Seasonality (daily, weekly, …) • Weather (temperature, humidity, …) • Holidays • Random effects What we can use for it: • Time series analysis methods (ARIMA, ES) • Linear regression methods (MLR, RLM, GAM) • AI regression methods (Trees, Neural Networks, SVR) • Time series data mining + clustering - Creating more predictable groups of consumers1. 1 Laurinec et al., ICMLDA and ICDMW, 2016 2
  • 4. Motivation n.2 and n.3 Lot of methods are possible to use. Which one to choose? Solution: ensemble learning. Dynamic clustering (actually offline batch clustering) - Aggregated data from electricity consumers (clusters) changes in each sliding window. 3
  • 5. Clusters 13 14 15 9 10 11 12 5 6 7 8 1 2 3 4 0 250 500 750 1000 0 250 500 750 1000 0 250 500 750 1000 0 250 500 750 1000 2000 4000 6000 1500 1750 2000 2250 2500 1000 1500 2000 2500 3000 3500 1000 1500 2000 2500 1500 2000 2500 3000 3500 1000 1250 1500 1750 2000 400 800 1200 1600 2000 2500 5000 7500 10000 12500 0 25 50 75 2000 3000 4000 1500 2000 2500 3000 3500 1000 2000 3000 2000 3000 4000 750 1000 1250 1500 2000 2500 3000 Time Load(kW) 4
  • 6. Solution to the problem and a target • Clustering (generating lot of time series) + ensemble learning → classical weighting of models (based on test set) is not possible and could be slow (lot of clusters, etc.). • Weighting of models based on training (or validation) set is possible, but it would be overfitted. • Unsupervised ensemble learning can be promising approach. Target and contribution: Evaluate various unsupervised ensemble learning methods combined with clustered and also simply aggregated electricity load. 5
  • 7. Base forecasting methods Two most important factors (assumptions) that have to be satisfied for used forecasting methods: • Very fast to compute (i.e. week learners) • Parametrically adaptable for various time series created by clustering of consumers. These assumptions satisfies methods: • CART (RPART) tree • CTREE - Conditional inference trees • ARIMA • Exponential smoothing 6
  • 8. Detailed description of forecasting methods RPART • Simple daily and weekly attributes CTREE.lag • Lag feature (denoised); daily and weekly attributes in sinus and cosine form. CTREE.dft • Daily and weekly attributes in form of Fourier transformation (6 and 12 pairs). STL+ARIMA, STL+ES, ES • STL decomposition in combination with ARIMA and ES; ES standalone. 7
  • 9. Unsupervised ensemble learning The 1st possibility is to do ensemble learning on just base forecasting methods. Bootstrapping time series: • Classical sampling from training set (bagging) - for regression (RPART, CTREE) • Moving block bootstrap (mbb) - for time series analysis methods (ARIMA, ES) Evaluated was the median from each of a bootstrapped method. 8
  • 10. Bootstrapping For regression tree methods: • The sample ratio was randomly sampled in the range of 0.7 − 0.9. • Sampled was also maximal depth, complexity parameter, minimal criterion For time series analysis methods: • Box-Cox transformation, STL decomposition, remainder is bootstrapped by mbb, inverse Box-Cox 2 2 Bergmeir, Hyndman, Benítez, Int. J. of Forecasting, 2016 9
  • 12. Results of Bagging 4. STL+ARIMA 5. STL+ES 6. ES 1. CART (RPART) 2. CTREE+LAG 3. CTREE+FOURIER 0 10 20 30 40 50 0 10 20 30 40 50 0 10 20 30 40 50 2000 3000 1500 2000 2500 3000 3500 1500 2000 2500 3000 1500 2000 2500 3000 3500 1500 2000 2500 3000 1500 2000 2500 3000 3500 Time Load(kW) 11
  • 13. Unsupervised ensemble learning The 2nd possibility is to combine all bootstrapped forecasts from all methods. Unsupervised (structure-based) possibilities for aggregating bootstrapped forecasts: • Simple mean or median • Averaging by methods • Clustering Aggregating from all created bootstrappings: a) Simple aggregation based - average and median b) Naive cluster based - average of medians of methods c) Cluster-based - K-means-based, DBSCAN-based and OPTICS-based 12
  • 14. Simple aggregation based methods STL+ARIMA STL+ARIMA CTREE_FOUR ES ES STL+ARIMA ES ES STL+ES STL+ES STL+ARIMA STL+ARIMA STL+ARIMA STL+ES RPART STL+ARIMA STL+ARIMA CTREE_FOUR ES STL+ARIMA ES CTREE_FOUR RPART RPART STL+ARIMA CTREE_FOUR ES CTREE_FOUR CTREE_FOUR STL+ARIMA RPART CTREE_LAG CTREE_LAG STL+ARIMA CTREE_LAG CTREE_LAG STL+ES CTREE_LAG STL+ES CTREE_FOUR CTREE_LAG CTREE_LAG ES CTREE_FOUR CTREE_FOUR RPART STL+ES RPART CTREE_FOUR CTREE_FOUR ES CTREE_LAG ES STL+ES CTREE_LAG STL+ES CTREE_LAG RPART CTREE_LAG CTREE_LAG ES CTREE_FOUR STL+ES CTREE_LAG RPART ES RPART STL+ES CTREE_LAGCTREE_LAG STL+ARIMA STL+ES CTREE_FOUR CTREE_FOUR CTREE_LAG RPART CTREE_LAG CTREE_LAG ES CTREE_LAG ES ES STL+ARIMA STL+ARIMA STL+ARIMA STL+ES CTREE_FOUR CTREE_FOUR CTREE_LAG ES ES RPART CTREE_LAG RPART ES CTREE_FOUR RPART STL+ES CTREE_LAG ES STL+ARIMA CTREE_LAG ES CTREE_LAG STL+ES STL+ARIMA RPART STL+ES ES RPART CTREE_FOUR STL+ES RPARTSTL+ARIMA CTREE_FOUR CTREE_FOUR CTREE_FOUR ES ES STL+ARIMA RPART CTREE_LAG CTREE_LAG STL+ARIMA RPART CTREE_LAG RPART STL+ARIMA STL+ES CTREE_FOUR ES STL+ARIMA STL+ES RPART STL+ES RPART STL+ES STL+ARIMA CTREE_LAG RPART ES ES CTREE_FOUR ES RPART CTREE_FOUR RPART ES CTREE_FOUR ES CTREE_LAG CTREE_LAG STL+ARIMA CTREE_LAG STL+ARIMA RPART ES STL+ES CTREE_FOUR CTREE_FOURSTL+ARIMA RPART STL+ES RPARTCTREE_FOUR CTREE_FOUR STL+ES STL+ARIMA ES STL+ES ES CTREE_LAG STL+ARIMA RPARTCTREE_LAG RPART CTREE_FOUR CTREE_LAG CTREE_FOUR CTREE_FOUR STL+ARIMA CTREE_FOUR STL+ES CTREE_LAG ES CTREE_LAG ES RPARTSTL+ARIMA CTREE_FOUR CTREE_LAG RPART CTREE_FOUR RPART RPART RPART STL+ARIMA CTREE_FOUR CTREE_LAG STL+ARIMA @X −4 0 4 8 −10 0 10 PC1 PC2 Class @ X Mean Median 13
  • 15. Naive cluster based method STL+ARIMA STL+ARIMA CTREE_FOUR ES ES STL+ARIMA ES ES STL+ES STL+ES STL+ARIMA STL+ARIMA STL+ARIMA STL+ES RPART STL+ARIMA STL+ARIMA CTREE_FOUR ES STL+ARIMA ES CTREE_FOUR RPART RPART STL+ARIMA CTREE_FOUR ES CTREE_FOUR CTREE_FOUR STL+ARIMA RPART CTREE_LAG CTREE_LAG STL+ARIMA CTREE_LAG CTREE_LAG STL+ES CTREE_LAG STL+ES CTREE_FOUR CTREE_LAG CTREE_LAG ES CTREE_FOUR CTREE_FOUR RPART STL+ES RPART CTREE_FOUR CTREE_FOUR ES CTREE_LAG ES STL+ES CTREE_LAG STL+ES CTREE_LAG RPART CTREE_LAG CTREE_LAG ES CTREE_FOUR STL+ES CTREE_LAG RPART ES RPART STL+ES CTREE_LAGCTREE_LAG STL+ARIMA STL+ES CTREE_FOUR CTREE_FOUR CTREE_LAG RPART CTREE_LAG CTREE_LAG ES CTREE_LAG ES ES STL+ARIMA STL+ARIMA STL+ARIMA STL+ES CTREE_FOUR CTREE_FOUR CTREE_LAG ES ES RPART CTREE_LAG RPART ES CTREE_FOUR RPART STL+ES CTREE_LAG ES STL+ARIMA CTREE_LAG ES CTREE_LAG STL+ES STL+ARIMA RPART STL+ES ES RPART CTREE_FOUR STL+ES RPARTSTL+ARIMA CTREE_FOUR CTREE_FOUR CTREE_FOUR ES ES STL+ARIMA RPART CTREE_LAG CTREE_LAG STL+ARIMA RPART CTREE_LAG RPART STL+ARIMA STL+ES CTREE_FOUR ES STL+ARIMA STL+ES RPART STL+ES RPART STL+ES STL+ARIMA CTREE_LAG RPART ES ES CTREE_FOUR ES RPART CTREE_FOUR RPART ES CTREE_FOUR ES CTREE_LAG CTREE_LAG STL+ARIMA CTREE_LAG STL+ARIMA RPART ES STL+ES CTREE_FOUR CTREE_FOURSTL+ARIMA RPART STL+ES RPARTCTREE_FOUR CTREE_FOUR STL+ES STL+ARIMA ES STL+ES ES CTREE_LAG STL+ARIMA RPARTCTREE_LAG RPART CTREE_FOUR CTREE_LAG CTREE_FOUR CTREE_FOUR STL+ARIMA CTREE_FOUR STL+ES CTREE_LAG ES CTREE_LAG ES RPARTSTL+ARIMA CTREE_FOUR CTREE_LAG RPART CTREE_FOUR RPART RPART RPART STL+ARIMA CTREE_FOUR CTREE_LAG STL+ARIMA $ %+ & @ # X −4 0 4 8 −10 0 10 PC1 PC2 Method @ # % $ & + X RPART CTREE_LAG CTREE_FOUR STL+ARIMA STL+ES ES Final 14
  • 16. Cluster-based approaches PCA is used to extract three principal components (from 48 dim.) in order to reduce noise. K-means-based: • K-means clustering with K-means++ centroid initialization. • Optimal number of clusters in the range of 3 − 8 by DB index. • Centroids were averaged to the final ensemble forecast. DBSCAN-based: • requires two parameters: ϵ (set to 1.45) and minPts (set to 8). • Ensemble forecast is created by the average of medians of clusters. OPTICS-based: • Automatic ξ-cluster procedure. ξ defines the degree of steepness (set to 0.045), which is applied in the so-called reachability plot of distances. • The final ensemble forecast is the median of medians of clusters. 15
  • 17. K-means-based method STL+ARIMA STL+ARIMA CTREE_FOUR ES ES STL+ARIMA ES ES STL+ES STL+ES STL+ARIMA STL+ARIMA STL+ARIMA STL+ES RPART STL+ARIMA STL+ARIMA CTREE_FOUR ES STL+ARIMA ES CTREE_FOUR RPART RPART STL+ARIMA CTREE_FOUR ES CTREE_FOUR CTREE_FOUR STL+ARIMA RPART CTREE_LAG CTREE_LAG STL+ARIMA CTREE_LAG CTREE_LAG STL+ES CTREE_LAG STL+ES CTREE_FOUR CTREE_LAG CTREE_LAG ES CTREE_FOUR CTREE_FOUR RPART STL+ES RPART CTREE_FOUR CTREE_FOUR ES CTREE_LAG ES STL+ES CTREE_LAG STL+ES CTREE_LAG RPART CTREE_LAG CTREE_LAG ES CTREE_FOUR STL+ES CTREE_LAG RPART ES RPART STL+ES CTREE_LAGCTREE_LAG STL+ARIMA STL+ES CTREE_FOUR CTREE_FOUR CTREE_LAG RPART CTREE_LAG CTREE_LAG ES CTREE_LAG ES ES STL+ARIMA STL+ARIMA STL+ARIMA STL+ES CTREE_FOUR CTREE_FOUR CTREE_LAG ES ES RPART CTREE_LAG RPART ES CTREE_FOUR RPART STL+ES CTREE_LAG ES STL+ARIMA CTREE_LAG ES CTREE_LAG STL+ES STL+ARIMA RPART STL+ES ES RPART CTREE_FOUR STL+ES RPARTSTL+ARIMA CTREE_FOUR CTREE_FOUR CTREE_FOUR ES ES STL+ARIMA RPART CTREE_LAG CTREE_LAG STL+ARIMA RPART CTREE_LAG RPART STL+ARIMA STL+ES CTREE_FOUR ES STL+ARIMA STL+ES RPART STL+ES RPART STL+ES STL+ARIMA CTREE_LAG RPART ES ES CTREE_FOUR ES RPART CTREE_FOUR RPART ES CTREE_FOUR ES CTREE_LAG CTREE_LAG STL+ARIMA CTREE_LAG STL+ARIMA RPART ES STL+ES CTREE_FOUR CTREE_FOURSTL+ARIMA RPART STL+ES RPARTCTREE_FOUR CTREE_FOUR STL+ES STL+ARIMA ES STL+ES ES CTREE_LAG STL+ARIMA RPARTCTREE_LAG RPART CTREE_FOUR CTREE_LAG CTREE_FOUR CTREE_FOUR STL+ARIMA CTREE_FOUR STL+ES CTREE_LAG ES CTREE_LAG ES RPARTSTL+ARIMA CTREE_FOUR CTREE_LAG RPART CTREE_FOUR RPART RPART RPART STL+ARIMA CTREE_FOUR CTREE_LAG STL+ARIMA + y @ m # $ % & X −4 0 4 8 −10 0 10 PC1 PC2 Class @ # % $ & + m y X 1 2 3 4 5 6 7 8 Final 16
  • 18. DBSCAN-based method STL+ARIMA STL+ARIMA CTREE_FOUR ES ES STL+ARIMA ES ES STL+ES STL+ES STL+ARIMA STL+ARIMA STL+ARIMA STL+ES RPART STL+ARIMA STL+ARIMA CTREE_FOUR ES STL+ARIMA ES CTREE_FOUR RPART RPART STL+ARIMA CTREE_FOUR ES CTREE_FOUR CTREE_FOUR STL+ARIMA RPART CTREE_LAG CTREE_LAG STL+ARIMA CTREE_LAG CTREE_LAG STL+ES CTREE_LAG STL+ES CTREE_FOUR CTREE_LAG CTREE_LAG ES CTREE_FOUR CTREE_FOUR RPART STL+ES RPART CTREE_FOUR CTREE_FOUR ES CTREE_LAG ES STL+ES CTREE_LAG STL+ES CTREE_LAG RPART CTREE_LAG CTREE_LAG ES CTREE_FOUR STL+ES CTREE_LAG RPART ES RPART STL+ES CTREE_LAGCTREE_LAG STL+ARIMA STL+ES CTREE_FOUR CTREE_FOUR CTREE_LAG RPART CTREE_LAG CTREE_LAG ES CTREE_LAG ES ES STL+ARIMA STL+ARIMA STL+ARIMA STL+ES CTREE_FOUR CTREE_FOUR CTREE_LAG ES ES RPART CTREE_LAG RPART ES CTREE_FOUR RPART STL+ES CTREE_LAG ES STL+ARIMA CTREE_LAG ES CTREE_LAG STL+ES STL+ARIMA RPART STL+ES ES RPART CTREE_FOUR STL+ES RPARTSTL+ARIMA CTREE_FOUR CTREE_FOUR CTREE_FOUR ES ES STL+ARIMA RPART CTREE_LAG CTREE_LAG STL+ARIMA RPART CTREE_LAG RPART STL+ARIMA STL+ES CTREE_FOUR ES STL+ARIMA STL+ES RPART STL+ES RPART STL+ES STL+ARIMA CTREE_LAG RPART ES ES CTREE_FOUR ES RPART CTREE_FOUR RPART ES CTREE_FOUR ES CTREE_LAG CTREE_LAG STL+ARIMA CTREE_LAG STL+ARIMA RPART ES STL+ES CTREE_FOUR CTREE_FOURSTL+ARIMA RPART STL+ES RPARTCTREE_FOUR CTREE_FOUR STL+ES STL+ARIMA ES STL+ES ES CTREE_LAG STL+ARIMA RPARTCTREE_LAG RPART CTREE_FOUR CTREE_LAG CTREE_FOUR CTREE_FOUR STL+ARIMA CTREE_FOUR STL+ES CTREE_LAG ES CTREE_LAG ES RPARTSTL+ARIMA CTREE_FOUR CTREE_LAG RPART CTREE_FOUR RPART RPART RPART STL+ARIMA CTREE_FOUR CTREE_LAG STL+ARIMA X −4 0 4 8 −10 0 10 PC1 PC2 Class a a a 0 1 Final Class X Final 17
  • 19. OPTICS-based method STL+ARIMA STL+ARIMA CTREE_FOUR ES ES STL+ARIMA ES ES STL+ES STL+ES STL+ARIMA STL+ARIMA STL+ARIMA STL+ES RPART STL+ARIMA STL+ARIMA CTREE_FOUR ES STL+ARIMA ES CTREE_FOUR RPART RPART STL+ARIMA CTREE_FOUR ES CTREE_FOUR CTREE_FOUR STL+ARIMA RPART CTREE_LAG CTREE_LAG STL+ARIMA CTREE_LAG CTREE_LAG STL+ES CTREE_LAG STL+ES CTREE_FOUR CTREE_LAG CTREE_LAG ES CTREE_FOUR CTREE_FOUR RPART STL+ES RPART CTREE_FOUR CTREE_FOUR ES CTREE_LAG ES STL+ES CTREE_LAG STL+ES CTREE_LAG RPART CTREE_LAG CTREE_LAG ES CTREE_FOUR STL+ES CTREE_LAG RPART ES RPART STL+ES CTREE_LAGCTREE_LAG STL+ARIMA STL+ES CTREE_FOUR CTREE_FOUR CTREE_LAG RPART CTREE_LAG CTREE_LAG ES CTREE_LAG ES ES STL+ARIMA STL+ARIMA STL+ARIMA STL+ES CTREE_FOUR CTREE_FOUR CTREE_LAG ES ES RPART CTREE_LAG RPART ES CTREE_FOUR RPART STL+ES CTREE_LAG ES STL+ARIMA CTREE_LAG ES CTREE_LAG STL+ES STL+ARIMA RPART STL+ES ES RPART CTREE_FOUR STL+ES RPARTSTL+ARIMA CTREE_FOUR CTREE_FOUR CTREE_FOUR ES ES STL+ARIMA RPART CTREE_LAG CTREE_LAG STL+ARIMA RPART CTREE_LAG RPART STL+ARIMA STL+ES CTREE_FOUR ES STL+ARIMA STL+ES RPART STL+ES RPART STL+ES STL+ARIMA CTREE_LAG RPART ES ES CTREE_FOUR ES RPART CTREE_FOUR RPART ES CTREE_FOUR ES CTREE_LAG CTREE_LAG STL+ARIMA CTREE_LAG STL+ARIMA RPART ES STL+ES CTREE_FOUR CTREE_FOURSTL+ARIMA RPART STL+ES RPARTCTREE_FOUR CTREE_FOUR STL+ES STL+ARIMA ES STL+ES ES CTREE_LAG STL+ARIMA RPARTCTREE_LAG RPART CTREE_FOUR CTREE_LAG CTREE_FOUR CTREE_FOUR STL+ARIMA CTREE_FOUR STL+ES CTREE_LAG ES CTREE_LAG ES RPARTSTL+ARIMA CTREE_FOUR CTREE_LAG RPART CTREE_FOUR RPART RPART RPART STL+ARIMA CTREE_FOUR CTREE_LAG STL+ARIMA @ + $ % & y m # X −4 0 4 8 −10 0 10 PC1 PC2 Class @ # % $ & + m y X 1 2 3 4 5 6 7 8 Final 18
  • 20. Clustering consumers Aggregation with clustering 1. Set of time series of electricity consumption of the length of two weeks 2. Normalization (z-score) 3. Computation of representations of time series by MLR (extraction of D. and W. reg. coeff.) 4. Automatic determination of optimal number of clusters K (DB-index) 5. Clustering of representations (K-means with centroids initialization by K-means++) 6. Summation of K time series by found clusters 7. Training of K forecast models and the following forecast 8. Summation of forecasts and evaluation 9. Remove first day and add new one to the training window (sliding window approach), go to step 1 19
  • 21. Data from smart meters Ireland residences • 3639 consumers. Residences. • 48 measurements per day. • Test set from three months of year 2010 (February, May and September). 0 1 2 3 4 0 250 500 750 1000 Time Load(kW) 20
  • 22. Data from smart meters Slovak factories • 3630 consumers. Factories and enterprises. • 96 measurements per day. Aggregated to 48. • Test set from three months of years 2013 and 2014 (September, February and March, June). 20 40 60 0 250 500 750 1000 Time Load(kW) 21
  • 23. Evaluation of forecast The accuracy of the forecast of electricity consumption was measured by MAPE (Mean Absolute Percentage Error). MAPE = 100 × 1 n n∑ t=1 |xt − xt| xt , where xt is a real consumption, xt is a forecasted load and n is a length of the time series. 22
  • 24. Results - ensembles Agg.-Irel. Clust.-Irel. Agg.-Slov. Clust.-Slov. CART.bagg 3.7908 3.7964 3.1561 3.0993 CTREE.bagg.lag 3.8081 3.7599 2.9568 2.8730 CTREE.bagg.dft 3.6746 3.7103 3.0080 2.9341 STL+ARIMA.mbb 3.9344 3.9085 3.0325 2.9993 STL+ES.mbb 3.9901 4.0221 3.0306 3.0021 ES.mbb 4.0565 4.0723 2.9760 2.9446 Average 3.7034 3.6970 2.8312 2.8086 Median 3.6103 3.6046 2.8329 2.7980 AveMedians 3.6704 3.6771 2.8179 2.7901 K-means 4.3018 4.0189 2.9715 3.0916 DBSCAN 3.9752 3.7985 2.9352 2.7532 OPTICS 3.7482 3.7239 2.9253 2.7982 Agg.-Irel. Clust.-Irel. Agg.-Slov. Clust.-Slov. 0.0011 <0.0001 0.1379 0.2415 23
  • 25. Results - base vs. ensemble Agg.-Irel. Clust.-Irel. Agg.-Slov. Clust.-Slov. CART 3.8570 3.8502 3.1921 3.1416 CTREE.lag 3.9203 3.7523 2.9950 2.8954 CTREE.dft 4.0849 3.9214 3.1944 3.0096 STL+ARIMA 4.0718 3.8943 2.7567 2.7404 STL+ES 4.2750 4.1866 2.6887 2.6424 ES 4.8000 4.2219 2.3957 2.4672 Agg.-Irel. Clust.-Irel. Agg.-Slov. Clust.-Slov. CART.bagg 0.1113 0.0760 0.0075 0.0002 CTREE.bagg.lag 0.0001 0.4807 0.0004 0.0036 CTREE.bagg.dft <0.0001 <0.0001 <0.0001 0.0002 STL+ARIMA.mbb 0.0178 0.0592 0.9875 0.9996 STL+ES.mbb 0.0380 0.0330 0.9980 0.9995 ES.mbb <0.0001 0.2559 1.0000 1.0000 24
  • 26. When time series bootstrapping fails 1500 2000 2500 0 250 500 750 1000 Time Load(kW) 25
  • 27. When time series bootstrapping fails 1000 2000 3000 4000 5000 0 250 500 750 1000 Time Load(kW) 26
  • 28. Conclusion • The simple median aggregation of bootstrapped forecasts is very good approach. • Clustering-based ensembles is not always the best approach. • The bagging using STL decomposition and Box-Cox transformation fails when data are noisy. • Exponential smoothing state space models - follow it, use it! Can be combined with the Temporal Hierarchical Forecasting as well. Future work: • Develop more robust bagging methods for time series. • Develop more automatic techniques for density-based clustering. 27