Time Series with Driverless AI - Marios Michailidis and Mathias Müller - H2O AI World London 2018

Time Series in
Driverless AI
Marios Michiadilis & Mathias Müller
Data Scientists / Kaggle Grandmasters
H2O.ai

Backgrounds
• Data Scientists
• Former #1 & #4

• Some inputdata
• A targetvariable
• An objective(or a successmetric)like RMSE or MAE
• Some allocatedresources(time andhardware)
e.g.salesx1 x2 x3 x4 y
0.14 0.69 0.01 0.71 300
0.22 0.44 0.45 0.69 100
0.12 0.35 0.51 0.23 40
0.22 0.42 0.79 0.60 23
0.93 0.82 0.72 0.50 1900
0.32 0.58 0.28 0.22 231
0.95 0.59 0.68 0.09 700
0.34 0.58 0.35 0.81 423
0.05 0.80 0.28 0.86 222
0.23 0.49 0.63 0.03 190
0.05 0.34 0.53 0.73 890
0.74 0.02 0.33 0.56 1000
Driverless AI Process
- Datavisualization(AutoViz)
- Featureengineering& selection
- AutomatedModeling
- Modelinterpretability (MLI)
- Scoringpipeline(predictions)

0
50
100
150
200
250
12/21/2017 12/31/2017 1/10/2018 1/20/2018 1/30/2018 2/9/2018 2/19/2018
Sales over time
0
10
20
30
40
50
60
70
80
12/21/2017 12/31/2017 1/10/2018 1/20/2018 1/30/2018 2/9/2018 2/19/2018
Sales over time
0
50
100
150
200
250
300
350
400
12/31/2017 1/2/2018 1/4/2018 1/6/2018 1/8/2018 1/10/2018 1/12/2018 1/14/2018
Sales over time
Linear relationshipNonlinear (seasonal) relationship
What is a Time Series Problem?

0
100
200
300
400
500
600
700
800
12/21/2017 12/31/2017 1/10/2018 1/20/2018 1/30/2018 2/9/2018 2/19/2018 3/1/2018 3/11/2018
sales per per day (all groups)
0
100
200
300
400
500
600
700
800
12/21/2017 12/31/2017 1/10/2018 1/20/2018 1/30/2018 2/9/2018 2/19/2018 3/1/2018 3/11/2018
sales by group
group 1 group 2 group 3
time groups sales
01/01/2018 group1 30
01/01/2018 group2 100
01/01/2018 group3 10
02/01/2018 group1 60.2
02/01/2018 group2 200.2
02/01/2018 group3 20.2
03/01/2018 group1 90.3
03/01/2018 group2 300.3
03/01/2018 group3 30.3
04/01/2018 group1 120.4
04/01/2018 group2 400.4
04/01/2018 group3 40.4
Time Groups

Modeling Foundation
1 2 3 4 5 6 7 8 9 10 11 12
[Gap]
1 2 3 4 5 6 7 8 9 10 11 12
[Gap] [Gap]
testtrain
tvs train tvs valid test
time:
Gap | Forecast Horizon
invalid lag size
valid lag size
time:

• Single time split (most recent training data becomes validation)
Validation Schemas
0
10000
20000
30000
40000
50000
60000
70000

Validation Schemas
Rolling window with adjusting training size Rolling window with constant training size
• Multi window validation

Validation Schemas
• Random k intervals

Date
1/1/2018
2/1/2018
3/1/2018
4/1/2018
5/1/2018
6/1/2018
7/1/2018
8/1/2018
9/1/2018
10/1/2018
Day Month Year Weekday Weeknum IsHoliday
1 1 2018 2 1 1
2 1 2018 3 1 0
3 1 2018 4 1 0
4 1 2018 5 1 0
5 1 2018 6 1 0
6 1 2018 7 1 0
7 1 2018 1 2 0
8 1 2018 2 2 0
9 1 2018 3 2 0
10 1 2018 4 2 0
Feature Engineering

Date Sales
1/1/2018 100
2/1/2018 150
3/1/2018 160
4/1/2018 200
5/1/2018 210
6/1/2018 150
7/1/2018 160
8/1/2018 120
9/1/2018 80
10/1/2018 70
Lag1 Lag2
- -
100 -
150 100
160 150
200 160
210 200
150 210
160 150
120 160
80 120
Moving Average
-
100
125
155
180
205
180
155
140
100
Feature Engineering (cont.)
• Exponentially Weighted Moving Averages (EWMA) of n-th order differentiated lags
• Aggregation of lags (mean, std, sums, etc.)
• Interactions of lags (e.g. Lag2 - Lag1)
• Linear regression on lags (taking slope and/or intercept as new feature)

• Ranking based on autocorrelation
• Pre-defined intervals (based on estimated frequency)
Daily data
• [7, 14, 21, …]
• [14, 28, 32, …]
• …
Weekly data
• [2, 4, 6, 8, …]
• [4, 8, 12, 16, …]
• …
…
Candidates for Lag-Sizes

• Lower bound for considered lag sizes
• Dropout
• Random replacement of actual lag-values by „n.a.“
• Align frequency of available lag information between train and validation/test
• Target binning
• Decrease of possible amount of splits GBM can perform
Regularization of Lag-Features

Time Series Roadmap
• Data augmentation at time of prediction
• Signal processing and classification

Signal Processing and Classification
Time TG Value Target Feature 1 .. Feature n Target
1 A A(1) 0 F1(A) .. Fn(A) 0
2 A A(2) 0 F1(B) .. Fn(B) 1
3 A A(3) 0
4 A A(4) 0
5 A A(5) 0
1 B B(1) 1
2 B B(2) 1
3 B B(3) 1
4 B B(4) 1
5 B B(5) 1
Original Frame Transformed Frame
• 1 label associated to each individual time series (signal)
• Feature engineering w.r.t. time (frequency domain features, sliding window aggregates,
zero crossings, peak patterns, etc.)

Time Series Roadmap
• Data augmentation at time of prediction
• Signal Processing and Classification
• Additional algorithms (Prophet, „classical“ methods, …)
• Ensembling
• Iterative learning (propagation of predictions to create lag-features)
• …

Thank you!
Marios Michiadilis
marios.michiadilis@h2o.ai
in/mariosmichailidis/
@StackNet_
Mathias Müller
mathias.mueller@h2o.ai
/in/muellermat/
@kagglenizer

Time Series with Driverless AI - Marios Michailidis and Mathias Müller - H2O AI World London 2018

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Time Series with Driverless AI - Marios Michailidis and Mathias Müller - H2O AI World London 2018

Similar to Time Series with Driverless AI - Marios Michailidis and Mathias Müller - H2O AI World London 2018 (20)

More from Sri Ambati

More from Sri Ambati (20)

Recently uploaded

Recently uploaded (20)

Time Series with Driverless AI - Marios Michailidis and Mathias Müller - H2O AI World London 2018