SlideShare a Scribd company logo
1 of 9
Download to read offline
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019
DOI: 10.5121/ijdkp.2019.9602 13
DEEP LEARNING BASED MULTIPLE REGRESSION
TO PREDICT TOTAL COLUMN WATER VAPOR
(TCWV) FROM PHYSICAL PARAMETERS IN WEST
AFRICA BY USING KERAS LIBRARY.
Daouda DIOUF1
, Awa Niang1
and Sylvie Thiria2
1
Laboratoire de Traitement de l’Information (LTI) – ESP – Université Cheikh Anta Diop
de Dakar BP : 5085 Dakar-Fann (Sénégal)
2
Laboratoire d’Océanographie et du Climat : Expérimentations et Approches Numériques
(IPSL / LOCEAN) – Université Pierre et Marie Curie, 75252 Paris (France)
ABSTRACT
Total column water vapor is an important factor for the weather and climate. This study apply
deep learning based multiple regression to map the TCWV with elements that can improve
spatiotemporal prediction. In this study, we predict the TCWV with the use of ERA5 that is the
fifth generation ECMWF atmospheric reanalysis of the global climate. We use an appropriate
deep learning based multiple regression algorithm using Keras library to improve nonlinear
prediction between Total Column water vapor and predictors as Mean sea level pressure, Surface
pressure, Sea surface temperature, 100 metre U wind component, 100 metre V wind component,
10 metre U wind component, 10 metre V wind component, 2 metre dew point temperature, 2
metre temperature.
The results obtained permit to build a predictor which modelling TCWV with a mean abs error 
(MAE) equal to 3.60 kg/m2
and a coefficient of determination R2
 equal to 0.90.
1. INTRODUCTION
Water vapor is the most abundant greenhouse gas and is a good factor for the weather and climate
[1][2]. The heating rate and circulation of the atmosphere depend greatly to the TCWV through
the condensation of the latter into clouds. The atmospheric composition can be affected also by
the TCWV via the photochemical reactions. A good prediction and monitoring of weather,
climate and a better understanding atmospheric physics and chemistry go through a better
knowledge of the TCWV that is highly variable in space and time.
At present, TCWV, also known as TPW (Total Precipitable Water), is retrieved from various
imager remote sensing as AMSU on board the POES and METOP polar-orbiting satellites, SSM/I
on board the DMSP F-13 satellite, from the sounders as GOES and ground-based Global
Positioning System (GPS) equipment [3], [4].
The aim of this paper is to predict the Total Column water vapor(TCWV) from climate
parameters in West Africa(Figure 1). The following variables were used: Mean sea level
pressure, Surface pressure, Sea surface temperature, 100 metre U wind component, 100 metre V
wind component, 10 metre U wind component, 10 metre V wind component, 2 metre dewpoint
temperature, 2 metre temperature.
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019
14
Figure1: Area of study and transect (in yellow) on latitude 0°N, 15°N and 30°N
Due to its high computing power, machine learning has shown a particular interest in processing
and understanding of large and multifunctional data [5]. In the case of environmental data, these
are often complex and highly non-linear. From this nonlinearity and complexity of data, we aim
to build a deep learning model able to model the TCWV parameter from other parameters.
2. DATASET
The dataset we used are from European Centre for Medium-Range Weather Forecasts, ERA5
Reanalysis. These dataset are taken in an area of the West Africa, between -5°N and 34°N and -
34°W and 35°W. These measurements extend the period of January 2004 to October 2018.
The learning dataset describes nine (09) and is concerned with modeling the Total column
watervapor. These 09 parameters are noted by x and the TCWV by y .
To avoid the over fitting, we random the data.
Then 36 370 741 of pixel taken from 2004 to 2006 are random. From these random value, we
take the 1% for train data and 0,5 % for test data.
The model is trained to predict the outputs and generalize to other non-trained data. Test data is
used to test the accuracy of the model. It is to build, by learning, a neural model able to find the
TCWV from input data.
For comparison we also used monthly observations data on GOME-2 instrument on board of the
MetOp-A satellite.
3. NEURAL NETWORK MODEL
A deep learning based multiple regression network that consist an input layer, a multi-hidden
layer with more the one hidden layer and an output layer. The nodes are fully connected. The
number of input layer nodes is equal to the number of features of the input data. The more hidden
layers, the higher the number of features needed to reduce the influence of under fitting or over
fitting. Each hidden layer node is composed of neurons. The neurons contain both rectifier
activation and aggregation function, when constructing the deep learning multiple regression
model, the activation function in the default neuron is the Rectified Linear activation function,
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019
15
making the deep learning network neurons have sparse characteristics, which reduces the
influence of overfitting while increasing the depth of the network, improving the training speed of
the model, and effectively overcoming the problem of gradient disappearance. This function that
we must define is responsible for creating the neural network model to be evaluated [6].
3.1 Deep Learning Based Regression
A deep learning estimator is essentially based on the distributed representation, this mean that an
output data is due to the interactions of various component sat different levels [7]. In this study,
the deep learning estimator is organized in two training procedures, with a pre-learning and
tuning with respect to the target TCWV.
3.2 Neural network model
We train the neural network by defining a sequential keras model. We are using the 09 inputs
variables as Mean sea level pressure, Surface pressure, Sea surface temperature, 100 metre U
wind component, 100 metre V wind component, 10 metre U wind component, 10 metre V wind
component, 2 metre dewpoint temperature, 2 metre temperature. These 09 input features are fully
connected to a first dense hidden layer of 64 (L1), this one fully connected to a second hidden
layer of 32 neurons (L2), and finally using the activation function, the Rectified Linear Unit
(ReLU), to process the output (Total column water vapor). ReLU are defined as f(x) = max(0,x)
and are used with minibatch size of 64.
The workflow for training the model is simple.
We want to estimate a )(xgy  function )( RyetRx p
 but by knowing only certain
realizations of this function:     Nnyx nn ...1,  . This set is called learning set. The purpose
of the learning is to estimate the weights of the network so that the output function noted F best
approaches the realizations of g . It is therefore a question of minimizing the following function
so-called cost function:
 
n
nn wxFywJ
2
),()( where w is the set of weights.
Since the cost function is the sum over all the realizations nn yx , , the gradient must be
calculated for each of the realizations. Note n
J the partial cost function corresponding to the
realization n:
2
)( nn
n
xFyJ 
Let the error observed n
J for the output neuron j and the training data n. The gradient with
respect to the output yj of the neuron is:
Indeed, knowing the gradient with respect to the outputs of all the neurons of a layer k makes it
possible to calculate the gradients with respect to the outputs of the neurons of the antecedent
layer k-1:
  





 


j j
k
j
k
ij
k
jk
i
k
j
k
j
n
k
i
vfw
y
y
y
J
)('1
1
1

But it is easy to know the gradient of the cost function with respect to the output neuron. In our
case, the quadratic cost function is:
j
n
j
y
J



International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019
16
).(2
)( 2
Yy
Y
Yy
Y
J
n
n
n






And so by backward propagation, first in the output layer, then in the hidden layers, we can
calculate the gradient n
J with respect to each of the weights of the network.
3.3Tune the neural network
We have specified 140 epochs for our model. For this deep learning model, we choose Adam as
an optimization algorithm [8]. Adam is an optimization algorithm that can used instead of the
classical stochastic gradient [9] descent procedure to update network weights iterative based in
training data. Adam is combining the advantages of two other extensions of stochastic gradient
descent, specifically the Adaptive Gradient Algorithm (AdaGrad) and Root Mean Square
Propagation (RMSProp).
4. RESULTS
Test data is used to test the prediction accuracy of the model. This model is used to predict
TCWV from dependent or independent variables.
The accuracy on the learning set is 90.47% and the validation accuracy is 90.23%. The learning
mean abs error is 3.60 kg/m2
and the validation mean abs erroris 3.45 kg/m2
. In the figures below,
the scatter plot between the target retrieved from training features and the real target are quite
good. Most of the prediction error less than |5 kg/m2
|.
Figure 2: Scatter plot of predicted TCWV versus
true TCWV
Figure 3: Error prediction
4.1 Validation against dependent data sets
We compared two data sets of total column water vapor that did not participate in learning phase
to the measure from ERA-5 at the same date. Figure 4 show a comparison of TCWV predicted
and TCWV measured above both land and ocean on January 2004. The global mean bias between
the two data sets is quite small: 0.10 kg/m2
. Then, the TCWV retrieval from the others parameters
by using neural network are obtained with good accuracy. January mean TCWV ranges from 0.5
to 57 kg/m2
. We denoted maximum values between -5°N to 5°N.
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019
17
4.2 Validation against independent data sets
Comparison between the Total Column Water Vapor (TCWV) retrieved with the GOME-2
instrument on board of the MetOp-A satellite(c), the retrieved TCWV from model with using the
ECMWF ERA-5 parameters reanalysis (b) and the measured TCWV of ECMWF ERA-5 (a) in
May 2007 can be seen in figure 5. The patterns for the three boxes are very similar. We can
observe that the highest values are all located between -5°N and 10°N.
Figure 4: Map of the TCWV ECMWF ERA-5 analysis (a) with the corresponding retrieved from the neural
network model(b) in January 2004.
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019
18
Figure 5:Map of the TCWV ECMWF ERA-5 analysis (a) with the corresponding retrieved from the neural
network model (b) and GOME2 observations data (c) in May 2007.
The results shows the accuracy of the neural model to retrieved total column water vapor from
few parameters. The figure 5 permit us to see that there is not more difference between the
measured values (a) and the predicted values (b) but these last two have little difference with (c).
We can see that the water vapor patterns over land and ocean are clearly visible with moist
Intertropical Convergence Zone near the equatorial regions.
We are plotting the annual TCWV average retrieved for years 2004 and 2005. The plots concern
the latitudinal transect at 0°N, 15°N and 30°NofTCWV, outputted by the neural network model
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019
19
using the annual average of the nine parameters as inputs for years 2004 and 2005, and compared
by the annual TCWV ECMWF ERA-5 analysis average(figure 6). We also calculate the
corresponding performances between the predicted annual TCWV average and the annual TCWV
ECMWF ERA-5 analysis average at three latitudes for years 2004 and 2005 (Tab.1 and Tab.3). In
addition, the performance of predicted annual TCWV average and those of the GOME2
observations TCWV are calculate and compared (Tab.2 and Tab.4). For these correlations, there
are all quite high (> 90%) except at latitude 0°N when they are around 60-70%. From tab.1 to
tab.4, we can observe the lowness of standard deviation for the years 2004 and 2005.
Figure 6: AnnualTCWV average for year 2004 (left) and 2005 (right) at different latitudes
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019
20
Tab.1: Predicted TCWV vs.TCWV ECMWF ERA-5 analysis for 2004
Standard deviation (kg/m2) Correlation (%)
Latitude 0° N 6.73 93.32
Latitude 15° N 4.52 76.68
Latitude 30° N 5.18 94.78
Tab.2: Predicted TCWV vs.TCWV GOME2 observations data for 2004
Standard deviation (kg/m2) Correlation (%)
Latitude 0° N 2.61 60.33
Latitude 15° N 7.76 91.59
Latitude 30° N 5.12 90.44
Tab.3: Predicted TCWV vs.TCWV ECMWF ERA-5 analysis for 2005
Standard deviation (kg/m2) Correlation (%)
Latitude 0° N 6.23 76.62
Latitude 15° N 7.75 95.26
Latitude 30° N 6.42 92.21
Tab.4: Predicted TCWV vs. TCWV GOME2 observations data for 2005
Standard deviation (kg/m2) Correlation (%)
Latitude 0° N 2.61 63.4
Latitude 15° N 7.57 93.06
Latitude 30° N 4.6 92.05
5. CONCLUSION
In this paper, a focus was made on the ability of deep learning to predict the TCWV by using
geophysical parameters as Mean sea level pressure, Surface pressure, Sea surface temperature,
100 metre U wind component, 100 metre V wind component, 10 metre U wind component, 10
metre V wind component, 2 metre dewpoint temperature, 2 metre temperature. We analyze the
retrieved TCWV and compare its results with Gome2 observations. There are high precision with
a mean global bias equal to 0.10 km/m2
and the MAEis 3.41 kg/m2
. The annual prediction
average of TCWV for three transects at 0°N, 15°N and 30°N compared to real measurement show
good result about the effective of the deep neural regression model.
Acknowledgements
European Centre for Medium-Range Weather Forecasts. 2017, updated monthly. ERA5
Reanalysis. Research Data Archive at the National Center for Atmospheric Research,
Computational and Information Systems Laboratory. https://doi.org/10.5065/D6X34W69.
Accessed 05 Feb. 2019.
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019
21
REFERENCES
[1] IPCC, 2007: Climate Change 2007: The Physical Science Basis Contribution of Working Group I to
the Fourth Assessment Report of the Intergovernmental Panel on Climate Change [ S.Solomon, D.
Qin, M. Manning, Z. Chen, M. Marquis, K.B. Averyt, M.Tignor and H.L. Miller (eds.),], Cambridge
University Press, Cambridge, United Kingdom and New York, NY, USA.
https://www.ipcc.ch/report/ar4/syr/, 2007
[2] S. B. Mockler, 1995: Special report: Water vapor in the climate system. Amer. Geophys. Union.
[Available online at www.eso.org/gen-fac/pubs/astclim/espas/pwv/mockler.html.]
[3] Forsythe, J., S. Kidder, S. Kusselson, A.S. Jones,T.H. Vonder Haar, 2009: Increasing the
landcoverage of blended multisensor total precipitablewater products for weather analysis.
16thConference on Satellite Meteorology andOceanography, Phoenix, Arizona.
http://ams.confex.com/ams/89annual/techprogram/paper_149348.htm
[4] J. Shi, C. Xu, J. Guo, and Y. Gao, “Real-time GPS precise pointpositioning-based precipitable water
vapor estimation forrainfall monitoring and forecasting,”IEEE Transactions onGeoscience and
Remote Sensing, vol. 53, no. 6, pp. 3452–3459,2015.
[5] Hinton, G.; Osindero, S.; Teh, Y.W. A fast learning algorithm for deep belief nets.Neural
Comput.2006,18, 1527–1554.
[6] N.K. Manaswi (2018) Regression to MLP in Keras. In: Deep Learning with Applications Using
Python. Apress, Berkeley, CA
[7] Y.Bengio, Learning deep architectures for AI.Foundat. Trends Mach. Learn.2009,2, 1–127.
[8] D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization” in International Conference
on Learning Representations, 2015
[9] Y. A. Lecun, L. Bottou, Y. Bengio and P. Haffner, Gradient-based learning applied to document
recognition, in Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, Nov 1998.

More Related Content

What's hot

Nonlinear filtering approaches to field mapping by sampling using mobile sensors
Nonlinear filtering approaches to field mapping by sampling using mobile sensorsNonlinear filtering approaches to field mapping by sampling using mobile sensors
Nonlinear filtering approaches to field mapping by sampling using mobile sensorsijassn
 
Agu maosi chen h31g-1590 retrieval of surface ozone from uv-mfrsr irradiances...
Agu maosi chen h31g-1590 retrieval of surface ozone from uv-mfrsr irradiances...Agu maosi chen h31g-1590 retrieval of surface ozone from uv-mfrsr irradiances...
Agu maosi chen h31g-1590 retrieval of surface ozone from uv-mfrsr irradiances...Maosi Chen
 
WE2.TO9.4.ppt
WE2.TO9.4.pptWE2.TO9.4.ppt
WE2.TO9.4.pptgrssieee
 
Hourly Groundwater Modelling In Tidal Lowlands Areas Using Extreme Learning M...
Hourly Groundwater Modelling In Tidal Lowlands Areas Using Extreme Learning M...Hourly Groundwater Modelling In Tidal Lowlands Areas Using Extreme Learning M...
Hourly Groundwater Modelling In Tidal Lowlands Areas Using Extreme Learning M...IJERDJOURNAL
 
RE.SUN Validation (March 2013)
RE.SUN Validation (March 2013)RE.SUN Validation (March 2013)
RE.SUN Validation (March 2013)Carlos Pinto
 
A Testbed for Image Synthesis
A Testbed for Image SynthesisA Testbed for Image Synthesis
A Testbed for Image SynthesisBen Trumbore
 
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...ijaia
 
AUTOMATIC IDENTIFICATION OF CLOUD COVER REGIONS USING SURF
AUTOMATIC IDENTIFICATION OF CLOUD COVER REGIONS USING SURF AUTOMATIC IDENTIFICATION OF CLOUD COVER REGIONS USING SURF
AUTOMATIC IDENTIFICATION OF CLOUD COVER REGIONS USING SURF ijcseit
 
2 ShengleiZhang_IGARSS2011_MO3.T04.2.ppt
2 ShengleiZhang_IGARSS2011_MO3.T04.2.ppt2 ShengleiZhang_IGARSS2011_MO3.T04.2.ppt
2 ShengleiZhang_IGARSS2011_MO3.T04.2.pptgrssieee
 

What's hot (14)

Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Nonlinear filtering approaches to field mapping by sampling using mobile sensors
Nonlinear filtering approaches to field mapping by sampling using mobile sensorsNonlinear filtering approaches to field mapping by sampling using mobile sensors
Nonlinear filtering approaches to field mapping by sampling using mobile sensors
 
Agu maosi chen h31g-1590 retrieval of surface ozone from uv-mfrsr irradiances...
Agu maosi chen h31g-1590 retrieval of surface ozone from uv-mfrsr irradiances...Agu maosi chen h31g-1590 retrieval of surface ozone from uv-mfrsr irradiances...
Agu maosi chen h31g-1590 retrieval of surface ozone from uv-mfrsr irradiances...
 
08039246
0803924608039246
08039246
 
WE2.TO9.4.ppt
WE2.TO9.4.pptWE2.TO9.4.ppt
WE2.TO9.4.ppt
 
Hourly Groundwater Modelling In Tidal Lowlands Areas Using Extreme Learning M...
Hourly Groundwater Modelling In Tidal Lowlands Areas Using Extreme Learning M...Hourly Groundwater Modelling In Tidal Lowlands Areas Using Extreme Learning M...
Hourly Groundwater Modelling In Tidal Lowlands Areas Using Extreme Learning M...
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
RE.SUN Validation (March 2013)
RE.SUN Validation (March 2013)RE.SUN Validation (March 2013)
RE.SUN Validation (March 2013)
 
A Testbed for Image Synthesis
A Testbed for Image SynthesisA Testbed for Image Synthesis
A Testbed for Image Synthesis
 
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
 
AUTOMATIC IDENTIFICATION OF CLOUD COVER REGIONS USING SURF
AUTOMATIC IDENTIFICATION OF CLOUD COVER REGIONS USING SURF AUTOMATIC IDENTIFICATION OF CLOUD COVER REGIONS USING SURF
AUTOMATIC IDENTIFICATION OF CLOUD COVER REGIONS USING SURF
 
2 ShengleiZhang_IGARSS2011_MO3.T04.2.ppt
2 ShengleiZhang_IGARSS2011_MO3.T04.2.ppt2 ShengleiZhang_IGARSS2011_MO3.T04.2.ppt
2 ShengleiZhang_IGARSS2011_MO3.T04.2.ppt
 

Similar to DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (TCWV) FROM PHYSICAL PARAMETERS IN WEST AFRICA BY USING KERAS LIBRARY.

MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...gerogepatton
 
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...gerogepatton
 
Neural wavelet based hybrid model for short-term load forecasting
Neural wavelet based hybrid model for short-term load forecastingNeural wavelet based hybrid model for short-term load forecasting
Neural wavelet based hybrid model for short-term load forecastingAlexander Decker
 
The prediction of moisture through the use of neural networks MLP type
The prediction of moisture through the use of neural networks MLP typeThe prediction of moisture through the use of neural networks MLP type
The prediction of moisture through the use of neural networks MLP typeIOSR Journals
 
Short term load forecasting using hybrid neuro wavelet model
Short term load forecasting using hybrid neuro  wavelet modelShort term load forecasting using hybrid neuro  wavelet model
Short term load forecasting using hybrid neuro wavelet modelIAEME Publication
 
Prediction of Extreme Wind Speed Using Artificial Neural Network Approach
Prediction of Extreme Wind Speed Using Artificial Neural  Network ApproachPrediction of Extreme Wind Speed Using Artificial Neural  Network Approach
Prediction of Extreme Wind Speed Using Artificial Neural Network ApproachScientific Review SR
 
Application of extreme learning machine for estimating solar radiation from s...
Application of extreme learning machine for estimating solar radiation from s...Application of extreme learning machine for estimating solar radiation from s...
Application of extreme learning machine for estimating solar radiation from s...mehmet şahin
 
Comparison of Tropical Thunderstorm Estimation between Multiple Linear Regres...
Comparison of Tropical Thunderstorm Estimation between Multiple Linear Regres...Comparison of Tropical Thunderstorm Estimation between Multiple Linear Regres...
Comparison of Tropical Thunderstorm Estimation between Multiple Linear Regres...journalBEEI
 
Short-term load forecasting with using multiple linear regression
Short-term load forecasting with using multiple  linear regression Short-term load forecasting with using multiple  linear regression
Short-term load forecasting with using multiple linear regression IJECEIAES
 
The quality of data and the accuracy of energy generation forecast by artific...
The quality of data and the accuracy of energy generation forecast by artific...The quality of data and the accuracy of energy generation forecast by artific...
The quality of data and the accuracy of energy generation forecast by artific...IJECEIAES
 
IRJET - Intelligent Weather Forecasting using Machine Learning Techniques
IRJET -  	  Intelligent Weather Forecasting using Machine Learning TechniquesIRJET -  	  Intelligent Weather Forecasting using Machine Learning Techniques
IRJET - Intelligent Weather Forecasting using Machine Learning TechniquesIRJET Journal
 
Optimal artificial neural network configurations for hourly solar irradiation...
Optimal artificial neural network configurations for hourly solar irradiation...Optimal artificial neural network configurations for hourly solar irradiation...
Optimal artificial neural network configurations for hourly solar irradiation...IJECEIAES
 
Neural Networks for High Performance Time-Delay Estimation and Acoustic Sourc...
Neural Networks for High Performance Time-Delay Estimation and Acoustic Sourc...Neural Networks for High Performance Time-Delay Estimation and Acoustic Sourc...
Neural Networks for High Performance Time-Delay Estimation and Acoustic Sourc...cscpconf
 
NEURAL NETWORKS FOR HIGH PERFORMANCE TIME-DELAY ESTIMATION AND ACOUSTIC SOURC...
NEURAL NETWORKS FOR HIGH PERFORMANCE TIME-DELAY ESTIMATION AND ACOUSTIC SOURC...NEURAL NETWORKS FOR HIGH PERFORMANCE TIME-DELAY ESTIMATION AND ACOUSTIC SOURC...
NEURAL NETWORKS FOR HIGH PERFORMANCE TIME-DELAY ESTIMATION AND ACOUSTIC SOURC...csandit
 
Short Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural NetworkShort Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural NetworkIJERA Editor
 
Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor For Real Data Set ...
Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor  For Real Data Set ...Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor  For Real Data Set ...
Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor For Real Data Set ...IJMER
 

Similar to DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (TCWV) FROM PHYSICAL PARAMETERS IN WEST AFRICA BY USING KERAS LIBRARY. (20)

MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
 
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
 
E012252736
E012252736E012252736
E012252736
 
Neural wavelet based hybrid model for short-term load forecasting
Neural wavelet based hybrid model for short-term load forecastingNeural wavelet based hybrid model for short-term load forecasting
Neural wavelet based hybrid model for short-term load forecasting
 
The prediction of moisture through the use of neural networks MLP type
The prediction of moisture through the use of neural networks MLP typeThe prediction of moisture through the use of neural networks MLP type
The prediction of moisture through the use of neural networks MLP type
 
Short term load forecasting using hybrid neuro wavelet model
Short term load forecasting using hybrid neuro  wavelet modelShort term load forecasting using hybrid neuro  wavelet model
Short term load forecasting using hybrid neuro wavelet model
 
Prediction of Extreme Wind Speed Using Artificial Neural Network Approach
Prediction of Extreme Wind Speed Using Artificial Neural  Network ApproachPrediction of Extreme Wind Speed Using Artificial Neural  Network Approach
Prediction of Extreme Wind Speed Using Artificial Neural Network Approach
 
Application of extreme learning machine for estimating solar radiation from s...
Application of extreme learning machine for estimating solar radiation from s...Application of extreme learning machine for estimating solar radiation from s...
Application of extreme learning machine for estimating solar radiation from s...
 
Comparison of Tropical Thunderstorm Estimation between Multiple Linear Regres...
Comparison of Tropical Thunderstorm Estimation between Multiple Linear Regres...Comparison of Tropical Thunderstorm Estimation between Multiple Linear Regres...
Comparison of Tropical Thunderstorm Estimation between Multiple Linear Regres...
 
Short-term load forecasting with using multiple linear regression
Short-term load forecasting with using multiple  linear regression Short-term load forecasting with using multiple  linear regression
Short-term load forecasting with using multiple linear regression
 
Paper18
Paper18Paper18
Paper18
 
The quality of data and the accuracy of energy generation forecast by artific...
The quality of data and the accuracy of energy generation forecast by artific...The quality of data and the accuracy of energy generation forecast by artific...
The quality of data and the accuracy of energy generation forecast by artific...
 
IRJET - Intelligent Weather Forecasting using Machine Learning Techniques
IRJET -  	  Intelligent Weather Forecasting using Machine Learning TechniquesIRJET -  	  Intelligent Weather Forecasting using Machine Learning Techniques
IRJET - Intelligent Weather Forecasting using Machine Learning Techniques
 
Optimal artificial neural network configurations for hourly solar irradiation...
Optimal artificial neural network configurations for hourly solar irradiation...Optimal artificial neural network configurations for hourly solar irradiation...
Optimal artificial neural network configurations for hourly solar irradiation...
 
Neural Networks for High Performance Time-Delay Estimation and Acoustic Sourc...
Neural Networks for High Performance Time-Delay Estimation and Acoustic Sourc...Neural Networks for High Performance Time-Delay Estimation and Acoustic Sourc...
Neural Networks for High Performance Time-Delay Estimation and Acoustic Sourc...
 
NEURAL NETWORKS FOR HIGH PERFORMANCE TIME-DELAY ESTIMATION AND ACOUSTIC SOURC...
NEURAL NETWORKS FOR HIGH PERFORMANCE TIME-DELAY ESTIMATION AND ACOUSTIC SOURC...NEURAL NETWORKS FOR HIGH PERFORMANCE TIME-DELAY ESTIMATION AND ACOUSTIC SOURC...
NEURAL NETWORKS FOR HIGH PERFORMANCE TIME-DELAY ESTIMATION AND ACOUSTIC SOURC...
 
Path loss prediction
Path loss predictionPath loss prediction
Path loss prediction
 
Short Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural NetworkShort Term Electrical Load Forecasting by Artificial Neural Network
Short Term Electrical Load Forecasting by Artificial Neural Network
 
Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor For Real Data Set ...
Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor  For Real Data Set ...Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor  For Real Data Set ...
Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor For Real Data Set ...
 
Very-Short Term Wind Power Forecasting through Wavelet Based ANFIS
Very-Short Term Wind Power Forecasting through Wavelet Based ANFISVery-Short Term Wind Power Forecasting through Wavelet Based ANFIS
Very-Short Term Wind Power Forecasting through Wavelet Based ANFIS
 

Recently uploaded

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...Amil baba
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Servicemeghakumariji156
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersMairaAshraf6
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Call Girls Mumbai
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...drmkjayanthikannan
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptNANDHAKUMARA10
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxSCMS School of Architecture
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEselvakumar948
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesMayuraD1
 
Moment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilMoment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilVinayVitekari
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTbhaskargani46
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdfKamal Acharya
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiessarkmank1
 

Recently uploaded (20)

FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to Computers
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
Moment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilMoment Distribution Method For Btech Civil
Moment Distribution Method For Btech Civil
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 

DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (TCWV) FROM PHYSICAL PARAMETERS IN WEST AFRICA BY USING KERAS LIBRARY.

  • 1. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019 DOI: 10.5121/ijdkp.2019.9602 13 DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (TCWV) FROM PHYSICAL PARAMETERS IN WEST AFRICA BY USING KERAS LIBRARY. Daouda DIOUF1 , Awa Niang1 and Sylvie Thiria2 1 Laboratoire de Traitement de l’Information (LTI) – ESP – Université Cheikh Anta Diop de Dakar BP : 5085 Dakar-Fann (Sénégal) 2 Laboratoire d’Océanographie et du Climat : Expérimentations et Approches Numériques (IPSL / LOCEAN) – Université Pierre et Marie Curie, 75252 Paris (France) ABSTRACT Total column water vapor is an important factor for the weather and climate. This study apply deep learning based multiple regression to map the TCWV with elements that can improve spatiotemporal prediction. In this study, we predict the TCWV with the use of ERA5 that is the fifth generation ECMWF atmospheric reanalysis of the global climate. We use an appropriate deep learning based multiple regression algorithm using Keras library to improve nonlinear prediction between Total Column water vapor and predictors as Mean sea level pressure, Surface pressure, Sea surface temperature, 100 metre U wind component, 100 metre V wind component, 10 metre U wind component, 10 metre V wind component, 2 metre dew point temperature, 2 metre temperature. The results obtained permit to build a predictor which modelling TCWV with a mean abs error  (MAE) equal to 3.60 kg/m2 and a coefficient of determination R2  equal to 0.90. 1. INTRODUCTION Water vapor is the most abundant greenhouse gas and is a good factor for the weather and climate [1][2]. The heating rate and circulation of the atmosphere depend greatly to the TCWV through the condensation of the latter into clouds. The atmospheric composition can be affected also by the TCWV via the photochemical reactions. A good prediction and monitoring of weather, climate and a better understanding atmospheric physics and chemistry go through a better knowledge of the TCWV that is highly variable in space and time. At present, TCWV, also known as TPW (Total Precipitable Water), is retrieved from various imager remote sensing as AMSU on board the POES and METOP polar-orbiting satellites, SSM/I on board the DMSP F-13 satellite, from the sounders as GOES and ground-based Global Positioning System (GPS) equipment [3], [4]. The aim of this paper is to predict the Total Column water vapor(TCWV) from climate parameters in West Africa(Figure 1). The following variables were used: Mean sea level pressure, Surface pressure, Sea surface temperature, 100 metre U wind component, 100 metre V wind component, 10 metre U wind component, 10 metre V wind component, 2 metre dewpoint temperature, 2 metre temperature.
  • 2. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019 14 Figure1: Area of study and transect (in yellow) on latitude 0°N, 15°N and 30°N Due to its high computing power, machine learning has shown a particular interest in processing and understanding of large and multifunctional data [5]. In the case of environmental data, these are often complex and highly non-linear. From this nonlinearity and complexity of data, we aim to build a deep learning model able to model the TCWV parameter from other parameters. 2. DATASET The dataset we used are from European Centre for Medium-Range Weather Forecasts, ERA5 Reanalysis. These dataset are taken in an area of the West Africa, between -5°N and 34°N and - 34°W and 35°W. These measurements extend the period of January 2004 to October 2018. The learning dataset describes nine (09) and is concerned with modeling the Total column watervapor. These 09 parameters are noted by x and the TCWV by y . To avoid the over fitting, we random the data. Then 36 370 741 of pixel taken from 2004 to 2006 are random. From these random value, we take the 1% for train data and 0,5 % for test data. The model is trained to predict the outputs and generalize to other non-trained data. Test data is used to test the accuracy of the model. It is to build, by learning, a neural model able to find the TCWV from input data. For comparison we also used monthly observations data on GOME-2 instrument on board of the MetOp-A satellite. 3. NEURAL NETWORK MODEL A deep learning based multiple regression network that consist an input layer, a multi-hidden layer with more the one hidden layer and an output layer. The nodes are fully connected. The number of input layer nodes is equal to the number of features of the input data. The more hidden layers, the higher the number of features needed to reduce the influence of under fitting or over fitting. Each hidden layer node is composed of neurons. The neurons contain both rectifier activation and aggregation function, when constructing the deep learning multiple regression model, the activation function in the default neuron is the Rectified Linear activation function,
  • 3. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019 15 making the deep learning network neurons have sparse characteristics, which reduces the influence of overfitting while increasing the depth of the network, improving the training speed of the model, and effectively overcoming the problem of gradient disappearance. This function that we must define is responsible for creating the neural network model to be evaluated [6]. 3.1 Deep Learning Based Regression A deep learning estimator is essentially based on the distributed representation, this mean that an output data is due to the interactions of various component sat different levels [7]. In this study, the deep learning estimator is organized in two training procedures, with a pre-learning and tuning with respect to the target TCWV. 3.2 Neural network model We train the neural network by defining a sequential keras model. We are using the 09 inputs variables as Mean sea level pressure, Surface pressure, Sea surface temperature, 100 metre U wind component, 100 metre V wind component, 10 metre U wind component, 10 metre V wind component, 2 metre dewpoint temperature, 2 metre temperature. These 09 input features are fully connected to a first dense hidden layer of 64 (L1), this one fully connected to a second hidden layer of 32 neurons (L2), and finally using the activation function, the Rectified Linear Unit (ReLU), to process the output (Total column water vapor). ReLU are defined as f(x) = max(0,x) and are used with minibatch size of 64. The workflow for training the model is simple. We want to estimate a )(xgy  function )( RyetRx p  but by knowing only certain realizations of this function:     Nnyx nn ...1,  . This set is called learning set. The purpose of the learning is to estimate the weights of the network so that the output function noted F best approaches the realizations of g . It is therefore a question of minimizing the following function so-called cost function:   n nn wxFywJ 2 ),()( where w is the set of weights. Since the cost function is the sum over all the realizations nn yx , , the gradient must be calculated for each of the realizations. Note n J the partial cost function corresponding to the realization n: 2 )( nn n xFyJ  Let the error observed n J for the output neuron j and the training data n. The gradient with respect to the output yj of the neuron is: Indeed, knowing the gradient with respect to the outputs of all the neurons of a layer k makes it possible to calculate the gradients with respect to the outputs of the neurons of the antecedent layer k-1:             j j k j k ij k jk i k j k j n k i vfw y y y J )('1 1 1  But it is easy to know the gradient of the cost function with respect to the output neuron. In our case, the quadratic cost function is: j n j y J   
  • 4. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019 16 ).(2 )( 2 Yy Y Yy Y J n n n       And so by backward propagation, first in the output layer, then in the hidden layers, we can calculate the gradient n J with respect to each of the weights of the network. 3.3Tune the neural network We have specified 140 epochs for our model. For this deep learning model, we choose Adam as an optimization algorithm [8]. Adam is an optimization algorithm that can used instead of the classical stochastic gradient [9] descent procedure to update network weights iterative based in training data. Adam is combining the advantages of two other extensions of stochastic gradient descent, specifically the Adaptive Gradient Algorithm (AdaGrad) and Root Mean Square Propagation (RMSProp). 4. RESULTS Test data is used to test the prediction accuracy of the model. This model is used to predict TCWV from dependent or independent variables. The accuracy on the learning set is 90.47% and the validation accuracy is 90.23%. The learning mean abs error is 3.60 kg/m2 and the validation mean abs erroris 3.45 kg/m2 . In the figures below, the scatter plot between the target retrieved from training features and the real target are quite good. Most of the prediction error less than |5 kg/m2 |. Figure 2: Scatter plot of predicted TCWV versus true TCWV Figure 3: Error prediction 4.1 Validation against dependent data sets We compared two data sets of total column water vapor that did not participate in learning phase to the measure from ERA-5 at the same date. Figure 4 show a comparison of TCWV predicted and TCWV measured above both land and ocean on January 2004. The global mean bias between the two data sets is quite small: 0.10 kg/m2 . Then, the TCWV retrieval from the others parameters by using neural network are obtained with good accuracy. January mean TCWV ranges from 0.5 to 57 kg/m2 . We denoted maximum values between -5°N to 5°N.
  • 5. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019 17 4.2 Validation against independent data sets Comparison between the Total Column Water Vapor (TCWV) retrieved with the GOME-2 instrument on board of the MetOp-A satellite(c), the retrieved TCWV from model with using the ECMWF ERA-5 parameters reanalysis (b) and the measured TCWV of ECMWF ERA-5 (a) in May 2007 can be seen in figure 5. The patterns for the three boxes are very similar. We can observe that the highest values are all located between -5°N and 10°N. Figure 4: Map of the TCWV ECMWF ERA-5 analysis (a) with the corresponding retrieved from the neural network model(b) in January 2004.
  • 6. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019 18 Figure 5:Map of the TCWV ECMWF ERA-5 analysis (a) with the corresponding retrieved from the neural network model (b) and GOME2 observations data (c) in May 2007. The results shows the accuracy of the neural model to retrieved total column water vapor from few parameters. The figure 5 permit us to see that there is not more difference between the measured values (a) and the predicted values (b) but these last two have little difference with (c). We can see that the water vapor patterns over land and ocean are clearly visible with moist Intertropical Convergence Zone near the equatorial regions. We are plotting the annual TCWV average retrieved for years 2004 and 2005. The plots concern the latitudinal transect at 0°N, 15°N and 30°NofTCWV, outputted by the neural network model
  • 7. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019 19 using the annual average of the nine parameters as inputs for years 2004 and 2005, and compared by the annual TCWV ECMWF ERA-5 analysis average(figure 6). We also calculate the corresponding performances between the predicted annual TCWV average and the annual TCWV ECMWF ERA-5 analysis average at three latitudes for years 2004 and 2005 (Tab.1 and Tab.3). In addition, the performance of predicted annual TCWV average and those of the GOME2 observations TCWV are calculate and compared (Tab.2 and Tab.4). For these correlations, there are all quite high (> 90%) except at latitude 0°N when they are around 60-70%. From tab.1 to tab.4, we can observe the lowness of standard deviation for the years 2004 and 2005. Figure 6: AnnualTCWV average for year 2004 (left) and 2005 (right) at different latitudes
  • 8. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019 20 Tab.1: Predicted TCWV vs.TCWV ECMWF ERA-5 analysis for 2004 Standard deviation (kg/m2) Correlation (%) Latitude 0° N 6.73 93.32 Latitude 15° N 4.52 76.68 Latitude 30° N 5.18 94.78 Tab.2: Predicted TCWV vs.TCWV GOME2 observations data for 2004 Standard deviation (kg/m2) Correlation (%) Latitude 0° N 2.61 60.33 Latitude 15° N 7.76 91.59 Latitude 30° N 5.12 90.44 Tab.3: Predicted TCWV vs.TCWV ECMWF ERA-5 analysis for 2005 Standard deviation (kg/m2) Correlation (%) Latitude 0° N 6.23 76.62 Latitude 15° N 7.75 95.26 Latitude 30° N 6.42 92.21 Tab.4: Predicted TCWV vs. TCWV GOME2 observations data for 2005 Standard deviation (kg/m2) Correlation (%) Latitude 0° N 2.61 63.4 Latitude 15° N 7.57 93.06 Latitude 30° N 4.6 92.05 5. CONCLUSION In this paper, a focus was made on the ability of deep learning to predict the TCWV by using geophysical parameters as Mean sea level pressure, Surface pressure, Sea surface temperature, 100 metre U wind component, 100 metre V wind component, 10 metre U wind component, 10 metre V wind component, 2 metre dewpoint temperature, 2 metre temperature. We analyze the retrieved TCWV and compare its results with Gome2 observations. There are high precision with a mean global bias equal to 0.10 km/m2 and the MAEis 3.41 kg/m2 . The annual prediction average of TCWV for three transects at 0°N, 15°N and 30°N compared to real measurement show good result about the effective of the deep neural regression model. Acknowledgements European Centre for Medium-Range Weather Forecasts. 2017, updated monthly. ERA5 Reanalysis. Research Data Archive at the National Center for Atmospheric Research, Computational and Information Systems Laboratory. https://doi.org/10.5065/D6X34W69. Accessed 05 Feb. 2019.
  • 9. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.9, No.6, November 2019 21 REFERENCES [1] IPCC, 2007: Climate Change 2007: The Physical Science Basis Contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change [ S.Solomon, D. Qin, M. Manning, Z. Chen, M. Marquis, K.B. Averyt, M.Tignor and H.L. Miller (eds.),], Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA. https://www.ipcc.ch/report/ar4/syr/, 2007 [2] S. B. Mockler, 1995: Special report: Water vapor in the climate system. Amer. Geophys. Union. [Available online at www.eso.org/gen-fac/pubs/astclim/espas/pwv/mockler.html.] [3] Forsythe, J., S. Kidder, S. Kusselson, A.S. Jones,T.H. Vonder Haar, 2009: Increasing the landcoverage of blended multisensor total precipitablewater products for weather analysis. 16thConference on Satellite Meteorology andOceanography, Phoenix, Arizona. http://ams.confex.com/ams/89annual/techprogram/paper_149348.htm [4] J. Shi, C. Xu, J. Guo, and Y. Gao, “Real-time GPS precise pointpositioning-based precipitable water vapor estimation forrainfall monitoring and forecasting,”IEEE Transactions onGeoscience and Remote Sensing, vol. 53, no. 6, pp. 3452–3459,2015. [5] Hinton, G.; Osindero, S.; Teh, Y.W. A fast learning algorithm for deep belief nets.Neural Comput.2006,18, 1527–1554. [6] N.K. Manaswi (2018) Regression to MLP in Keras. In: Deep Learning with Applications Using Python. Apress, Berkeley, CA [7] Y.Bengio, Learning deep architectures for AI.Foundat. Trends Mach. Learn.2009,2, 1–127. [8] D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization” in International Conference on Learning Representations, 2015 [9] Y. A. Lecun, L. Bottou, Y. Bengio and P. Haffner, Gradient-based learning applied to document recognition, in Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, Nov 1998.