Recurrent Neuronal Network tailored for Weather Radar Nowcasting

Eawag: Swiss Federal Institute of Aquatic Science and Technology
Recurrent Neuronal Network tailored for
Weather Radar Nowcasting
June 23, 2016
Andreas Scheidegger

Andreas Scheidegger – Eawag
Weather Radar Nowcasting
t0
t-1t-2t-3
t-4
t5t4t3t2
t1
nowcast
model
available inputs:
every pixel of the previous images
desired outputs:
every pixel of the next n images

Recurrent ANN
Pt
Ht Ht+1
ANN
Pt+1
^
last observed
image

Recurrent ANN
Pt
Ht Ht+1
ANN
Ht+2
ANN
Pt+1
^ Pt+2
^
last observed
image

Recurrent ANN
Pt
Ht Ht+1
ANN
Ht+2 Ht+3
Pt+3
^
ANN ANN
Pt+1
^ Pt+2
^
last observed
image

Recurrent ANN
Pt
Ht Ht+1
ANN
Ht+2 Ht+3 Ht+4
Pt+3
^ Pt+4
^
ANN ANN ANN
Pt+1
^ Pt+2
^
last observed
image

Model structure
Pt
Ht Ht+1
ANN
Pt+1
^
v j=σ(∑
k
wjk uk +bj)
Traditional Artificial Neuronal
Networks (ANN) are (nested) non-
linear regressions:
linear regressions:
How can we do better?

Model structure
Pt
Ht Ht+1
ANN
Pt+1
^
v j=σ(∑
k
wjk uk +bj)
linear regressions:
linear regressions:
How can we do better?
tailor model structure to problem

Model structure
Pt
Ht Ht+1
ANN
Pt+1
^

Model structure
convolution
Pt
Ht
Pt+1
Ht+1
+
fully
connected
fully
connected
fully
connected
spatial
transformer
deconvolution
^
Gaussian blur
fully
connected

Spatial transformer
convolution
Pt
Ht
Pt+1
Ht+1
+
fully
connected
fully
connected
fully
connected
spatial
transformer
deconvolution
^
Gaussian blur
fully
connected
Jaderberg,M.,Simonyan,K.,Zisserman,A.,andKavukcuoglu,K.
(2015)SpatialTransformerNetworks.arXiv:1506.02025[cs].

Spatial Transformer
input output
Jaderberg, M., Simonyan, K., Zisserman, A., and
Kavukcuoglu, K. (2015) Spatial Transformer Networks.
arXiv:1506.02025 [cs].

Spatial transformer
convolution
Pt
Ht
Pt+1
Ht+1
+
fully
connected
fully
connected
fully
connected
spatial
transformer
deconvolution
^
Gaussian blur
fully
connected

Local correction
convolution
Pt
Ht
Pt+1
Ht+1
+
fully
connected
fully
connected
fully
connected
spatial
transformer
deconvolution
^
Gaussian blur
fully
connected

Local correction

Gaussian blur
convolution
Pt
Ht
Pt+1
Ht+1
+
fully
connected
fully
connected
fully
connected
spatial
transformer
deconvolution
^
Gaussian blur
fully
connected

Prediction
Forecast horizon: 2.5 minutes
observed predicted

Prediction
observed predicted
Forecast horizon: 15 minutes

Prediction
observed predicted

Prediction

Conclusions

Deep learning may be a
hype – but it's a useful
tool nevertheless!
Conclusions

v j=σ( ∑k
wjk
u k
+b j)
Combine domain
knowledge with data
driven modeling
tool nevertheless!
Conclusions

and Outlook
v j=σ( ∑k
wjk
u k
+b j)
Combine domain
knowledge with data
driven modeling
tool nevertheless!
Conclusions

and Outlook
v j=σ( ∑k
wjk
u k
+b j)
Combine domain
knowledge with data
driven modeling
θt+1=θt−λ ∇ L(θt)
Online parameter adaptionDeep learning may be a
tool nevertheless!
Conclusions

and Outlook
v j=σ( ∑k
wjk
u k
+b j)
Combine domain
knowledge with data
driven modeling
Online parameter adaption
L(θ)Other objective function?
tool nevertheless!
Conclusions

and Outlook
v j=σ( ∑k
wjk
u k
+b j)
Combine domain
knowledge with data
driven modeling
tool nevertheless!
Include other inputs
Conclusions

and Outlook
v j=σ( ∑k
wjk
u k
+b j)
Combine domain
knowledge with data
driven modeling
tool nevertheless!
Predict prediction uncertainty
Conclusions

and Outlook
v j=σ( ∑k
wjk
u k
+b j)
Combine domain
knowledge with data
driven modeling
tool nevertheless!
Interested in collaboration?
andreas.scheidegger@eawag.ch
Predict prediction uncertainty
Conclusions

Training and Implementaion
θnew=θold−λ ∇ L(θold)
Loss function
Scheduled Learning
Regularisation
Drop-out
L(θ)=∫
0
∞
RMS(τ;θ) p(τ)d τ Runs on cuda compatible GPUs
Based on Python library chainer
Srivastava, et al. (2014). Dropout: A simple way to
prevent neural networks from overfitting. The Journal
of Machine Learning Research 15, 1929–1958.
Stochastic Gradient Decent
Implementation
Bengio, et al. (2015). Scheduled sampling for sequence prediction with
recurrent neural networks. arXiv preprint arXiv:1506.03099.
Tokui, et al., (2015). Chainer: a Next-Generation Open
Source Framework for Deep Learning.

Training
Pt
Ht Ht+1
Pt+1
ANN
Ht+2 Ht+3 Ht+4
Pt+3
^ Pt+4
^
ANN ANN ANN
Pt+2
training predictions
observed images predicted images

Scheduled Training
Pt
Ht Ht+1
Pt+1
ANN
Ht+2 Ht+3 Ht+4
Pt+3
^ Pt+4
^
ANN ANN ANN
Pt+1
^
or
Pt+2
Pt+2
^
or
training predictions
observed or predicted images predicted images
Bengio, Set al. (2015). Scheduled sampling for sequence prediction with
recurrent neural networks. arXiv preprint arXiv:1506.03099.

Results

Prediction animated

Traditional model structures
Estimate velocity field
and translate last radar
image (optical flow)
Identify rain cells, track
and extrapolate them
→ seems to be the most
“natural” approach
→ can model grows
and decay of cells
● Models can be difficult to tune
● Local features (e.g. mountain range) cannot be modeled

Convolution layers
https://
developer.apple.com/library/ios/documentation/Performance/Conceptual/
vImage/ConvolutionOperations/ConvolutionOperations.html

Convolution layers
https://
developer.apple.com/library/ios/documentation/Performance/Conceptual/
vImage/ConvolutionOperations/ConvolutionOperations.html https://en.wikipedia.org/wiki/Kernel_(im
age_processing)

Spatial Transformer
Jaderberg, M., Simonyan, K., Zisserman, A., and Kavukcuoglu, K. (2015) Spatial
Transformer Networks. arXiv:1506.02025 [cs].

Recurrent Neuronal Network tailored for Weather Radar Nowcasting

Recommended

Recommended

More Related Content

What's hot

What's hot (7)

Similar to Recurrent Neuronal Network tailored for Weather Radar Nowcasting

Similar to Recurrent Neuronal Network tailored for Weather Radar Nowcasting (20)

More from Andreas Scheidegger

More from Andreas Scheidegger (8)

Recently uploaded

Recently uploaded (20)

Recurrent Neuronal Network tailored for Weather Radar Nowcasting