SlideShare a Scribd company logo
1 of 6
Download to read offline
International Journal of Trend in Scientific Research and Development (IJTSRD)
Volume 5 Issue 3, March-April 2021 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470
@ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 409
A Review on the Comparison of Box-Jenkins
ARIMA and LSTM of Deep Learning
Stavelin Abhinandithe K*, Madhu B1, Balasubramanian S2, Sahana C3
*Assistant Professor, Division of Medical Statistics, Faculty of Life Sciences,
1Associate Professor, Department of Community Medicine JSS Medical College,
2Dean & Director of Research, Department of Water & Health,
3Division of Medical Statistics, Faculty of Life Sciences,
*, 1, 2, 3 JSS AHER, Mysore, Karnataka, India
ABSTRACT
A time-series is a set of events, sequentially calculated over time. Predicting
the Time Series is mostly about predicting the future. The ability of a time
series forecasting model is determined by its success in predicting the future.
This is often at the expense of being able to explain why a specific prediction
was made. The Box-Jenkins model implies that the time series is stationary,
and thus suggests differentiating non-stationary series once or several times
to obtain stationary effects. This generates a model for ARIMA, the "I" being
the word for "Integrated". The LSTM networks, comparable to computer
memory, enforce a gated cell for storing information. Such as the previously
mentioned networks, the LSTM cells also recognize when to make preceding
time-steps reads and writes information. Even though the work is new, it is
obvious that LSTM architecturesprovidetremendousprospectsascontenders
for modeling and forecasting time series. The outcomes of the overall
discrepancy in error indicate that in regards of both RMSE and MAE, the
LSTM-model tended to have greater predictive accuracy than the ARIMA-
model.
KEYWORDS: ARIMA, LSTM, Time series model, RMSE, MAE
How to cite this paper: Stavelin
Abhinandithe K | Madhu B |
Balasubramanian S | Sahana C "A Review
on the Comparison of Box-Jenkins ARIMA
and LSTM of Deep
Learning" Published
in International
Journal of Trend in
Scientific Research
and Development
(ijtsrd), ISSN: 2456-
6470, Volume-5 |
Issue-3, April 2021, pp.409-414, URL:
www.ijtsrd.com/papers/ijtsrd39831.pdf
Copyright © 2021 by author(s) and
International Journal ofTrendinScientific
Research and Development Journal. This
is an Open Access article distributed
under the terms of
the Creative
CommonsAttribution
License (CC BY 4.0)
(http://creativecommons.org/licenses/by/4.0)
I. INTRODUCTION
A time-series is a set of events, sequentially calculated over
time. Such findings either are continuous over time, or
recorded at discrete intervals of equal time.[1 ]Exploration of
the time series is a rather useful field of data analysis,
retrieving information from past findings to establish the
progression of a current phenomenon and to promote its
prediction into the long term. The study also illustrates
definite associations that might seem obvious to the naked
eye and describes certain characteristics required to
recognize this phenomenon’s current state of play.
Continuous time series, such as brain activity assessment,
are generally evaluated by measuring the series at intervals
of equal time to offer a discrete time series. Observations in
the series frequently correspond in variouswaysatdifferent
time-stages. The potential consequences can be estimated
using observed data while succeeding observations are
dependent. Analysis of the time series is the field of study
where these associations and correlations are analyzed.The
correlation might be the order of the data gathered, the
linearity of the model, reiterating trends etc.[2] The research
community had also raised a handful of attempts to enhance
time series exploration and analysis[.3]
A. Prediction in time series:
Predicting the Time Series is mostly about predicting the
future. Although many data from the time series relies on its
previous values. Near history attributesarestrongmeasures
of conduct in a factor. Lagged values of a variable as in a rate
of exchange are regressed over one or more lagged valuesof
themselves to determine the present and potential values of
the variable. Forecasting in the time series is reliant on
numerous models and methods, and is commonly included
to predict different aspects of human behavior [4-6]. While
forecasting time series, one primarily uses sequence
preceding values to estimate an upcoming value. It's
convenient to map the association of the y variable with the
preceding y vector values because of the usage of past
values. The intention of time series analysis is to research
the time series path patterns and to construct a model to
explain the data structure and then forecasting potential
values of time series.[7]
B. Forecasting in time series:
Time series estimation includes utilizing time series
observations, in accordance with the time series analysis, to
build a predictive model illustrating the dependencies. This
IJTSRD39831
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 410
model is then used in series estimation of future values. The
forecast uses only past knowledge to estimate future
values.[4] The ability of a time series forecasting model is
determined by its success in predicting the future. This is
often at the expense of being able to explain why a specific
prediction was made.
II. Forecasting methods in time series analysis.
Methods of forecasting appear to be divided into two main
categories. We are: intuitive and formalized.
Fig.1. Classification of forecasting methods
IntuitiveforecastingMethodscompriseforecastsandexpert
inferences. Theyare also used nowadays in market research,
economics, politics and other fields, the action of which is
very complicated or harder to predict with mathematical
models[8]
Formalized methods are the methods, which use
mathematical models to predict future values. They are
divided into domain models and time-series models.
Domainmodels–Models,focusedondomainprocesses,laws,
and mechanisms. Climate prediction model for instance
includes an equation of fluid dynamics and thermodynamics.
The most common solutions to software failures include the
software reliability models. Their key disadvantage: they
don't suit all different technology classes, since they focus on
unique attributes of it[9]. A thorough comprehension of the
processes, methodologies and technologies of software
design and testing is needed to construct an appropriate
model of software reliability and to be qualified to draw
conclusions dependent on such a model.
Time-series models are mathematical forecasting models
that aim to determine, throughout the framework itself, the
reliance of the future value from the past value as well as
compute the estimation relying on this dependence. Such
models are identical for various contexts, i.e. their overall
appearance does not alter according to the design of thetime
series[10]. Time series models[11] can be further divided into:
Regression models;
Smoothing models;
Models based on neura.
III. Model description:
Basics of Box and Jenkins Time Series Models
The Box-Jenkins Time Series models are titled after the
statisticians George Box and Gwilym Jenkins (Box and
Jenkins, 1970), these models generate projected estimates
depending on the statistical criteria of the time series data
obtained and these models have achieved considerable
interest in the fields of organizational analysis,management
science and statistics. The Box-Jenkins ARMA
(Autoregressive Moving Average) model is a combination of
the AR (Auto Regressive) and MA (Moving Average) models.
The Box-Jenkins model implies that the time series is
stationary, and thus suggests differentiating non-stationary
series once or several times to obtain stationary effects.This
generates a model for ARIMA, the "I" being the word for
"Integrated". They are also known as the Autoregressive
Integrated Moving Average (ARIMA) models (Box and
Jenkins, 1976).
There are many ways to forecast using the time-series
models, and the most commonly used of which is as follows
Autoregressive Models (AR).
Moving Average Models (MA).
Mixed Models (ARMA).
Integrated Mixed Models (ARIMA).
Autoregressive Integrated Moving Averages Models:
In 1976, Box and Jenkins first published ARIMA in a book
which gained considerable coverage from the science
establishment, based on statistical analysis at that period.
Thus, this approach is implementedina vastrangeofsectors
and stays one of the best reliable data processing and
operational prediction models[13]. ARIMA stands for
Autoregressive ( AR) Integrated (I) Moving Average (MA),
otherwise regarded as the Box- Jenkins approach.13 As
indicated by the terminology, ARIMA (p , d, q) incorporates
main components of the model:
AR: Auto regression. A regression model using the
relationships between an observation and a number of
observed lags (p).
I: Integrated. To render the time series stationary by
calculating the observed variations at various times (d).
MA: Moving Average. A methodology that bears into
consideration the dependence among the observations and
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 411
the terms of residual error while using a moving average
model for lagging observations (q).
The Box-Jenkins method[14] summarizes the ARIMAprocess
in three main steps:
Identification: the initial move is to split through the time
series by the three processes: AR (autoregressive), I
(integrated) and MA (moving average); this phase
apparently allows the parameters p, d and q to be defined,
whilst testing the series' stationarity first. Configuration of
parameters p, q is accomplished owing to the functions of
autocorrelation and partial autocorrelation that we would
address in depth in the section of realization. The parameter
d is differentiation order.
Estimation: The following phase in the Box-Jenkins method
is to determine the parameters of the respective models by
supplying the p d and q orders. Estimates are made using
non-linear methodologies.
Diagnosis: the final stage in the Box-Jenkins process relates
to checking the model’s validity. That is, validate that the
predicted model is suited to the accessible data. We refer to
statistical testing for that.
LSTM
Neural Networks
ANN: ANNs are partially influenced by our brain's neural
interactions, attempting to imitate the neuronal circuits and
its behavioral patterns. ANNs, being resilient and self-
adaptive, typically offer satisfactory remedies to implicitly
inferred non - linear challenges, and solvechallengessuchas
speech recognition, natural language processing, and
forecasting20. The framework includes so-called layers
across input and output, in which raising the amount of
layers often raises network complexities. By loading
identified data into the network, the network is educated by
weight allocation in reference to the preferred input output.
The weights are network-adjusted scalarstowhichtheerror
within the output, which is inferred by the network, and the
true expected output utilizing the weight gradient of the
error. Nevertheless, those kinds ofnetworksare notideal for
sequential results. Since the network seems to have no
memory of earlier stages, it cannot model dependencies,
leaving it difficult to interpret sequential data. Therefore a
memory of some sort is desired.
RNN: RNNs, recurrent neural networks, string numerous
layers of networks together, whereby, in connection to the
output, information from previous time stages is passed to
future time stages. As the inputs and parameters are
analyzed from each layer, the output from previous layers is
taken into account, providing the network a type of memory
[15]. The gradient can become fairlycomplicatedwhena time-
dimension needs to be drawn into account for RNNs,
culminating in the knowledge from previous phases in time
either begin to disappear or become greatly enhanced. Such
phenomenon are also known as a vanishing or bursting
gradient.
LSTM: The LSTM networks, comparable to computer
memory, enforce a gated cell for storinginformation.Such as
the previously mentioned networks, the LSTM cells also
recognize when to make preceding time-steps reads and
writes information[15, 17]. Therefore the LSTM model
addresses the issue of a disappearing or exploding gradient
and helps the network to accurately recognize data far back
in the sequence [18, 16].
LSTM model architecture is quite complicated to present.
The Long Short-Term Memory (LSTM) network is basically
the most commonly implemented framework to resolve the
gradient disappearance crisis. SeppHochreiter and
JrgenSchmidhuber suggestedthissamenetwork structurein
1997[19]. With the emergence of deep architectures, LSTM
has been extensively seen in sequence learning currently,
and has shown considerable experimental strength[20]. Each
LSTM is a collection of cells, or device modules,whichcollect
and preserve the data streams. The cells are like a transport
line (the upper line in each cell) that links data from the past
and gathers them for the current from one module to
another. Owing to the utilization of certain gates in eachcell,
data may be discarded of, filtrated, or incorporated for the
following cells in each cell. Thus, the gates that are focused
on a sigmoidal neural network layer allow the cells to allow
data to transit through or be disposed of optionally. Each
sigmoid layer generates numbers in the vicinity of zero and
one, reflecting - cell's quantity of each data segment should
be allowed through. Most specifically, an estimate of zero
value means that "let nothing pass through;" and whilst; an
estimate of one suggests that "let anything pass through."
Each LSTM includes three styles of gates mostly with
purpose of regulating the condition of each cell.[21]:
Forget Gate outputs a number between 0 and 1,where1
shows “completely keep this”; whereas, 0 implies
“completely ignore this.”
Memory Gate determines whichnewdata tobe retained
in the cell. Initially, a sigmoid layer, called the "door
input layer," decides which values are to be altered. A
tanh layer then allows a vector of fresh agent values
which can be attached to the system.
Output Gate determines what yields every cell canyield.
The rendered value, together with the processed and
freshly inserted data, will be based on cell state.
This architecture is prone to numerous modifications,
however, it has nearly as many changes as the articleswhich
use it. A condensed version of LSTM termed GRU (Gated
Recurrent Unit) was implemented in 2014[22]. This has the
benefit of having fewercomputationallycomplex,sinceit has
smaller parameters and equations. It remains however as
powerful as its predecessor. Here are the equations which
govern the model's dynamism:
Zt = σ(WZxt + UZht−1 + bZ) (update gate)
Rt = σ(WRxt + URht−1 + bR) (reset gate)
ht = Zt ◦ ht−1 + (1 − Zt) ◦ tanh(Whxt + Uh(Rt ◦ ht−1) +
bh).
Last year's curiosity in utilizing LSTM has unexpectedly
increased and its predictive accuracy can illustrate its
strength in nearly every area of science. It took about two
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 412
decades for the strategyto becomemainstream. Toconstruct
an LSTM model, initially we have to convert the Dataset
training and test into a three-dimensional collection of
"samples, features, and time measures". We utilize a layer of
inputs, a hidden layer of LSTM blocks aswell asa singlelayer
of output. The accompanying principles are employed to
evaluate the amount of layers and the number of neurons in
each layer:
Input layer: Logically, we possessone entry;the number
of neurons in that layer is calculated by the number of
columns that can be quantified.
Output layer: each neural network has a single output.
Hidden layer: the size of this layer, that is to say the
number of neurons, is to determine.
GAN: Generative adversarial networks, or GANs for shorter,
are a strategy to generative modeling incorporating
techniques of deep learning, like neural convolution
networks. GANs are an innovative means to teach a
generative model by presenting the issue using two sub-
models as a supervised learning problem: the generator
model, and the discriminator model. There is teaching ofthe
two models together.
Naive Bayes is an illustration of a more commonly used
generative model as a discriminative model. Naive Bayes
functions by summing up the distribution of probability for
each input variable and the output class. When a prediction
is made, the likelihood for each potential outcome for each
variable is determined, the individual probabilities are
merged, and the highly realistic consequence is estimated.
Other examples of generative models include Latent
Dirichlet Allocation,orLDA, andtheGaussianMixtureModel,
or GMM.
RBM: A Restricted Boltzmann Machine is a generative
stochastic artificial neural network which is able to learn a
distribution of probability over its set of inputs. RBMs were
first discovered in 1986 by Paul Smolensky under the name
Harmonium. RBMs are a variety of Boltzmann devices, as
their name suggests, with the constraint that its neurons
should build a bipartisan graph. This limitation enables for
increasingly effective training algorithms than those
accessible for the general group of Boltzmann machines, in
specific the contrastive divergence algorithm based on
gradients.
DBN: Deep Belief Network is an unsupervised probabilistic
deep learning algorithm, made up of multilayered latent
stochastic variables. DBN is a graphical model of the
generative hybrid. Top two stratification are undirected.
Two layers lower have guided interactions from above
layers. DBN are pre trained using Greedy Algorithm. By
adequately instantiating the weights of all the layers, pre-
training improves optimization. DBNs are used in image
recognition, video sequences, motion capture data and
speech recognition.
FORECAST PERFORMANCE METRICS(CommonAccuracy
Statistics)
RMSE: The RMSE is thesquare rootofresidual variance. This
shows the model’s absolute fit for the data – how identical
the measured data points are to the expected values of the
model. RMSE is an absolute fit-measure. It can be assumed,
focusing on a thumb rule, that the RMSE values between 0.2
and 0.5 indicate that the model was able to predict the
relevant data reasonably. Adjusted R-squared upwards of
0.75 is, however, a very better value fortheaccuracydisplay.
For certain cases it is also appropriate to have Modified R-
squared of 0.4 or more.
MSE: The mean squared error, or MSE, is measured as the
square predicted error values average. The squaring of
forecast error values causes them to be positive; it also puts
greater weight on major errors. A greater MSE implies that
the data values are massively scattered around its central
moment (mean), and a lesser MSE indicatesotherwiseandis
certainly the preferable and/or desired option because it
indicates that the data values are distributed near to its
central moment, which is normally large.
MAPE: The mean absolute percentage error (MAPE) is the
mean or average of forecast errors with an absolute
percentage. Error is characterized as real or detected,minus
the expected value. Percentage errors are summed up
lacking regard to MAPE calculation signing. This calculation
is easy to grasp, because it rendersthemistakeinpercentage
terms. Furthermore, since absolute percentage errors are
considered, the issue of equally dismissing positive and
negative errors is prevented. MAPE therefore has a strategic
reach and is a metric which is widely includedinforecasting.
The smaller the MAPE the better the forecast.
RELATED WORKS:
A comparison analysis was carried out between LSTM and
ARIMA in AjilaElmasdotter's forecasting sale retail[2].
Furthermore, ARIMA time series is a commonly used
methodology for the budgetary time series23 in
econometrics, numerous ARIMA models have been
formulated to evaluate and project stock markets [24-26]. In
addition, ARIMA is used for water expenditure / forecasting
of consumption[27,28] and the requirement for electricity[29].
Most notably, there has been a great deal of concern in the
usage of deep learning models [30,31], particularly recursive
models like LSTM Neural Network for the estimation of
budgetary time series, particularly in the stock market[32-
34].In[32] presented a simulationandsimulationofChina stock
returns employing LSTM architecture via an accepted 27.2
percent accuracy, in[33] studied the usefulness of recurrent
neural networks to the assessment of stock market price
fluctuations. ShanghaiCompositeIndexandDow JonesIndex
were eventually proposed[34].Withregardtobudgetanalysis
and forecasting, hardly any research has been reported
trying to compare or even implementingthetwotechniques,
the majority of which have been adapted to the stock price
as discussed above. It is due largely to the complexity of
accessing specific data sets and the unreliablenatureofsuch
data. None of its authors correlated LSTM and ARIMA
models to efficiency.
MAJOR DIFFERENCES BETWEEN ARIMA AND LSTM:
Even though the work is new, it is obvious that LSTM
architectures provide tremendous prospects as contenders
for modeling and forecasting time series. We review the
major variations between an LSTM and ARIMA in the
following Table. The usage of RNNs, especially the LSTM
architecture, allows us to set many parameters that we need
to change in order to achieve optimum efficiency in the
forecasting tasks. The problem lies in selecting the
appropriate parameters to identify the correct architecture
for the platform. An ARIMA model is easy to customize as it
provides a reasonable efficiency, such model also includes
the determination of the parameters p, d and q such that p is
the order of the autoregressive part (AR), the order of
differentiation and q the order of the average moving
portion (MA).
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 413
ARIMA LSTM
ARIMA is a linear model
Small amount of data is also sufficient for ARIMA (i.e.,
minimum number of observations is 50-100).
ARIMA is a parametric model, which is to say for each
series we have to define the parameters p, d and q.
ARIMA is dedicated only for time series analysis
LSTM is a nonlinear model.
LSTM can store and process large amount of data.
As LSTM is a non-parametric model, requires
adjustment of some hypermeters.
LSTM models can process sequential data as well.
Evaluation measures of real world time series data:
MAE RMSE MAPE SMAPE
1 Formula
1
1
ˆ
n
i i
i
MAE y y
n =
= −
∑ ( )
2
1
n
i i
i
f o
RMSE
n
=
−
=
∑
1
1 n
t t
t t
A F
MAPE
n A
=
−
= ∑
( )
1
100%
/ 2
n
t t
t t t
F A
n A F
=
−
+
∑
2
Acceptable
range
The lower the MSE, the
higher the accuracy of
the prediction.
Lower values of RMSE
indicate better fit.
Since, MAPE is a
measure of error, high
number are bad and
low numbers are good
Lower the value, higher
the accuracy.
Comparison of Software:
ARIMA LSTM
EViews
Julia.
Mathematicia
MATLAB
NCSS
Python
R
Ruby
SAS
IBM SPSS
Stata
Python
Rnn
Tensor Flow
IBM SPSS
MATLAB
SAS
Julia
Ruby
CONCLUSIONS:
These methodologies are acquiring prominence amongst
investigators across diverse disciplines with the latest
enhancement in the development of advanced machine
learning-based approaches and especially deep learning
algorithms. The major concern then is how specifically and
effectively these newly developed techniques compete with
conventional methods. The outcomes of the overall
discrepancy in error indicate that in regards of both RMSE
and MAE, the LSTM-model tended to have greaterpredictive
accuracy than the ARIMA-model. All in all, we have shown
that on lengthy-term predictions, neural networks provide
more precision. Since the error for the ARIMA model
presented by Box Jenkins is not significantly higher, we note
that its use remains an advantage in the prediction of time
series. We also saw the two models enjoy complimentary
differences: ARIMA is far pickier on seasonal peaks while
LSTM suits the balanced portion of the series better. For
some developed countries dysfunctional economic system
makes reliable prediction models complicated. The
probability of having a better estimate (without sacrificing
the complexity of the data through diff and linearization)
emerges with the aid of new approaches in RNN.
References:
[1] Chatfield C. Time-series forecasting. CRC press; 2000
Oct 25.
[2] Elmasdotter A, Nyströmer C. A comparative study
between LSTM and ARIMA for sales forecasting in
retail.
[3] Fathi O. Time series forecastingusinga hybridARIMA
and LSTM model. In Velvet Consulting 2019.
[4] Zhang GP. Time series forecasting using a hybrid
ARIMA and neural network model. Neurocomputing.
2003 Jan 1; 50: 159-75.
[5] Zhang GP. Time series forecasting using a hybrid
ARIMA and neural network model. Neurocomputing.
2003 Jan 1; 50: 159-75.
[6] Chen L, Wang S, Fan W, Sun J, Naoi S. Beyond human
recognition: A CNN-based framework for
handwritten character recognition. In2015 3rd IAPR
Asian Conference on Pattern Recognition (ACPR)
2015 Nov 3 (pp. 695-699). IEEE.
[7] Siami-Namini S, Tavakoli N, Namin AS. A comparison
of ARIMA and LSTM in forecastingtimeseries.In2018
17th IEEE International Conference on Machine
Learning and Applications (ICMLA) 2018 Dec 17 (pp.
1394-1401). IEEE.
[8] A. M. Alonso, C. Garcia-Martos. Time Series Analysis -
Forecasting with ARIMA models. In2012 Universidad
Carlos III de Madrid, Universidad Politecnica de
Madrid.
[9] Earnest A, Chen MI, Ng D, Sin LY.Usingautoregressive
integrated moving average (ARIMA) models to
predict and monitor the number of beds occupied
during a SARS outbreak in a tertiary hospital in
Singapore. BMC Health Services Research. 2005 Dec
1; 5(1): 36.
[10] Tang R, Zeng F, Chen Z, Wang JS, Huang CM, WuZ.The
Comparison of Predicting Storm-time Ionospheric
TEC by Three Methods: ARIMA, LSTM, and Seq2Seq.
Atmosphere. 2020 Apr; 11(4): 316.
[11] Khashei M, Bijari M. A novel hybridization of artificial
neural networks and ARIMA models for time series
forecasting. Applied Soft Computing. 2011 Mar 1;
11(2): 2664-75.
[12] Liu LM, Hudak GB, Box GE, Muller ME, Tiao GC.
Forecasting and time series analysis using the SCA
statistical system. DeKalb, IL: Scientific Computing
Associates; 1992.
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 414
[13] K. Lawrence and M. Geurts. Advances in Businessand
Management Forecasting. In2004 vol. 19. Emerald
Group Publishing.
[14] Rhanoui M, Yousfi S, Mikram M, Merizak H.
Forecasting financial budget time series: ARIMA
random walk vs LSTM neural network. IAES
International Journal of Artificial Intelligence. 2019
Dec 1; 8(4).
[15] Rojas R. Neural networks: a systematic introduction.
Springer Science & Business Media; 2013 Jun 29.
[16] Li X, Peng L, Yao X, Cui S, Hu Y, You C, Chi T. Long
short-term memory neural network for air pollutant
concentration predictions: Method developmentand
evaluation. Environmental pollution.2017Dec1;231:
997-1004.
[17] Cortez B, Carrera B, Kim YJ, Jung JY. An architecture
for emergency event predictionusingLSTM recurrent
neural networks. Expert Systems with Applications.
2018 May 1; 97: 315-24.
[18] Krstanovic S, Paulheim H. Ensembles of recurrent
neural networks for robust time series forecasting.In
International Conference on Innovative Techniques
and Applications of Artificial Intelligence2017Dec12
(pp. 34-46). Springer, Cham.
[19] Jia Y, Wu Z, Xu Y, Ke D, Su K. Long Short-Term
Memory Projection Recurrent Neural Network
Architectures for Piano’s Continuous Note
Recognition. Journal of Robotics. 2017 Jan 1; 2017.
[20] Chung J, Gulcehre C, Cho K, Bengio Y. Empirical
evaluation of gated recurrent neural networks on
sequence modeling. arXiv preprint arXiv:1412.3555.
2014 Dec 11.
[21] Schmidhuber J. Deep learning in neural networks: An
overview. Neural networks. 2015 Jan 1; 61: 85-117.
[22] Hochreiter S, Schmidhuber J. Long short-term
memory. Neural computation. 1997 Nov 15; 9(8):
1735-80.
[23] Mills TC, Markellos RN. The econometric modellingof
financial time series. Cambridge University Press;
2008 Mar 20.
[24] Pai PF, Lin CS. A hybrid ARIMA and support vector
machines model in stock price forecasting. Omega.
2005 Dec 1; 33(6): 497-505.
[25] Jarrett JE, KyperE.ARIMAmodelingwithintervention
to forecast and analyze Chinese stock prices.
International Journal of Engineering Business
Management. 2011 Aug 15; 3(3): 53-8.
[26] Ariyo AA, Adewumi AO, Ayo CK. Stock price
prediction using the ARIMA model. In2014 UKSim-
AMSS 16th International Conference on Computer
Modelling and Simulation2014Mar26(pp.106-112).
IEEE.
[27] Birylo M, Rzepecka Z, Kuczynska-Siehien J, Nastula J.
Analysis of water budget prediction accuracy using
ARIMA models. Water Science and Technology:
Water Supply. 2018 Jun; 18(3): 819-30.
[28] Razali SN, Rusiman MS, Zawawi NI, Arbin N.
Forecasting ofwaterconsumptions expenditureusing
Holt-Winter’s and ARIMA.
[29] Kandananond K. Electricity demand forecasting in
buildings based on ARIMA and ARX models. In
Proceedings of the 8th International Conference on
Informatics, Environment, Energy and Applications
2019 Mar 16 (pp. 268-271).
[30] Chong E, Han C, Park FC. Deep learning networks for
stock market analysis and prediction: Methodology,
data representations, and case studies. Expert
Systems with Applications. 2017 Oct 15;83:187-205.
[31] Singh R, Srivastava S. Stock prediction using deep
learning. Multimedia Tools and Applications. 2017
Sep 1; 76(18): 18569-84.
[32] Chen K, Zhou Y, Dai F. A LSTM-basedmethodforstock
returns prediction: A case study of China stock
market. In2015 IEEE international conference on big
data (big data) 2015 Oct 29 (pp. 2823-2824). IEEE.
[33] Nelson DM, Pereira AC, de Oliveira RA.Stock market's
price movement prediction with LSTM neural
networks. In2017 International joint conference on
neural networks (IJCNN) 2017 May 14 (pp. 1419-
1426). IEEE.
[34] Jiang Q, Tang C, Chen C, Wang X, Huang Q. Stock price
forecast based on LSTM neural network. In
International Conference onManagementScienceand
Engineering Management 2018 Aug 1 (pp. 393-408).
Springer, Cham.

More Related Content

What's hot

Quatitative Data Analysis
Quatitative Data Analysis Quatitative Data Analysis
Quatitative Data Analysis maneesh mani
 
Types of data and graphical representation
Types of data and graphical representationTypes of data and graphical representation
Types of data and graphical representationReena Titoria
 
Business statistics review
Business statistics reviewBusiness statistics review
Business statistics reviewFELIXARCHER
 
Introduction concepts of Statistics
Introduction concepts of StatisticsIntroduction concepts of Statistics
Introduction concepts of StatisticsSaurabh Patni
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of datadrasifk
 
Scales of measurement and presentation of data
Scales of measurement and presentation of dataScales of measurement and presentation of data
Scales of measurement and presentation of dataDr Sithun Kumar Patro
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsSarfraz Ahmad
 
Analysis of Forecasting Sales By Using Quantitative And Qualitative Methods
Analysis of Forecasting Sales By Using Quantitative And Qualitative MethodsAnalysis of Forecasting Sales By Using Quantitative And Qualitative Methods
Analysis of Forecasting Sales By Using Quantitative And Qualitative MethodsIJERA Editor
 
#2 Classification and tabulation of data
#2 Classification and tabulation of data#2 Classification and tabulation of data
#2 Classification and tabulation of dataKawita Bhatt
 
135. Graphic Presentation
135. Graphic Presentation135. Graphic Presentation
135. Graphic PresentationLAKSHMANAN S
 
A NEW METHOD OF CENTRAL DIFFERENCE INTERPOLATION
A NEW METHOD OF CENTRAL DIFFERENCE INTERPOLATIONA NEW METHOD OF CENTRAL DIFFERENCE INTERPOLATION
A NEW METHOD OF CENTRAL DIFFERENCE INTERPOLATIONmathsjournal
 

What's hot (15)

Quatitative Data Analysis
Quatitative Data Analysis Quatitative Data Analysis
Quatitative Data Analysis
 
Edited economic statistics note
Edited economic statistics noteEdited economic statistics note
Edited economic statistics note
 
Types of data and graphical representation
Types of data and graphical representationTypes of data and graphical representation
Types of data and graphical representation
 
Business statistics review
Business statistics reviewBusiness statistics review
Business statistics review
 
Hoc
HocHoc
Hoc
 
Introduction concepts of Statistics
Introduction concepts of StatisticsIntroduction concepts of Statistics
Introduction concepts of Statistics
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of data
 
Scales of measurement and presentation of data
Scales of measurement and presentation of dataScales of measurement and presentation of data
Scales of measurement and presentation of data
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Analysis of Forecasting Sales By Using Quantitative And Qualitative Methods
Analysis of Forecasting Sales By Using Quantitative And Qualitative MethodsAnalysis of Forecasting Sales By Using Quantitative And Qualitative Methods
Analysis of Forecasting Sales By Using Quantitative And Qualitative Methods
 
Data presentation 2
Data presentation 2Data presentation 2
Data presentation 2
 
#2 Classification and tabulation of data
#2 Classification and tabulation of data#2 Classification and tabulation of data
#2 Classification and tabulation of data
 
135. Graphic Presentation
135. Graphic Presentation135. Graphic Presentation
135. Graphic Presentation
 
Food quality management
Food quality managementFood quality management
Food quality management
 
A NEW METHOD OF CENTRAL DIFFERENCE INTERPOLATION
A NEW METHOD OF CENTRAL DIFFERENCE INTERPOLATIONA NEW METHOD OF CENTRAL DIFFERENCE INTERPOLATION
A NEW METHOD OF CENTRAL DIFFERENCE INTERPOLATION
 

Similar to A Review on the Comparison of Box Jenkins ARIMA and LSTM of Deep Learning

Parallel multivariate deep learning models for time-series prediction: A comp...
Parallel multivariate deep learning models for time-series prediction: A comp...Parallel multivariate deep learning models for time-series prediction: A comp...
Parallel multivariate deep learning models for time-series prediction: A comp...IAESIJAI
 
TIME SERIES & CROSS ‎SECTIONAL ANALYSIS
TIME SERIES & CROSS ‎SECTIONAL ANALYSISTIME SERIES & CROSS ‎SECTIONAL ANALYSIS
TIME SERIES & CROSS ‎SECTIONAL ANALYSISLibcorpio
 
Delivery Feet Data using K Mean Clustering with Applied SPSS
Delivery Feet Data using K Mean Clustering with Applied SPSSDelivery Feet Data using K Mean Clustering with Applied SPSS
Delivery Feet Data using K Mean Clustering with Applied SPSSijtsrd
 
Time Series Weather Forecasting Techniques: Literature Survey
Time Series Weather Forecasting Techniques: Literature SurveyTime Series Weather Forecasting Techniques: Literature Survey
Time Series Weather Forecasting Techniques: Literature SurveyIRJET Journal
 
International Journal of Mathematics and Statistics Invention (IJMSI)
International Journal of Mathematics and Statistics Invention (IJMSI) International Journal of Mathematics and Statistics Invention (IJMSI)
International Journal of Mathematics and Statistics Invention (IJMSI) inventionjournals
 
Demand Forecasting of a Perishable Dairy Drink: An ARIMA Approach
Demand Forecasting of a Perishable Dairy Drink: An ARIMA ApproachDemand Forecasting of a Perishable Dairy Drink: An ARIMA Approach
Demand Forecasting of a Perishable Dairy Drink: An ARIMA ApproachIJDKP
 
Time Series Analysis: Techniques for Analyzing Time-Dependent Data
Time Series Analysis: Techniques for Analyzing Time-Dependent DataTime Series Analysis: Techniques for Analyzing Time-Dependent Data
Time Series Analysis: Techniques for Analyzing Time-Dependent DataUncodemy
 
Time Series Forecasting Using Novel Feature Extraction Algorithm and Multilay...
Time Series Forecasting Using Novel Feature Extraction Algorithm and Multilay...Time Series Forecasting Using Novel Feature Extraction Algorithm and Multilay...
Time Series Forecasting Using Novel Feature Extraction Algorithm and Multilay...Editor IJCATR
 
Case Study: Prediction on Iris Dataset Using KNN Algorithm
Case Study: Prediction on Iris Dataset Using KNN AlgorithmCase Study: Prediction on Iris Dataset Using KNN Algorithm
Case Study: Prediction on Iris Dataset Using KNN AlgorithmIRJET Journal
 
Principle Component Analysis Based on Optimal Centroid Selection Model for Su...
Principle Component Analysis Based on Optimal Centroid Selection Model for Su...Principle Component Analysis Based on Optimal Centroid Selection Model for Su...
Principle Component Analysis Based on Optimal Centroid Selection Model for Su...ijtsrd
 
Pca based support vector machine technique for volatility forecasting
Pca based support vector machine technique for volatility forecastingPca based support vector machine technique for volatility forecasting
Pca based support vector machine technique for volatility forecastingeSAT Publishing House
 
Efficiency of recurrent neural networks for seasonal trended time series mode...
Efficiency of recurrent neural networks for seasonal trended time series mode...Efficiency of recurrent neural networks for seasonal trended time series mode...
Efficiency of recurrent neural networks for seasonal trended time series mode...IJECEIAES
 
SSA-based hybrid forecasting models and applications
SSA-based hybrid forecasting models and applicationsSSA-based hybrid forecasting models and applications
SSA-based hybrid forecasting models and applicationsjournalBEEI
 
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONSMUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONSIRJET Journal
 
esmaeili-2016-ijca-911399
esmaeili-2016-ijca-911399esmaeili-2016-ijca-911399
esmaeili-2016-ijca-911399Nafas Esmaeili
 
KIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfKIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfDr. Radhey Shyam
 
Study of effectiveness of time series modeling (arima) in forecasting stock p...
Study of effectiveness of time series modeling (arima) in forecasting stock p...Study of effectiveness of time series modeling (arima) in forecasting stock p...
Study of effectiveness of time series modeling (arima) in forecasting stock p...IJCSEA Journal
 
RAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEY
RAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEYRAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEY
RAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEYcsandit
 
A novel hybrid deep learning model for price prediction
A novel hybrid deep learning model for price prediction A novel hybrid deep learning model for price prediction
A novel hybrid deep learning model for price prediction IJECEIAES
 
Novel Ensemble Tree for Fast Prediction on Data Streams
Novel Ensemble Tree for Fast Prediction on Data StreamsNovel Ensemble Tree for Fast Prediction on Data Streams
Novel Ensemble Tree for Fast Prediction on Data StreamsIJERA Editor
 

Similar to A Review on the Comparison of Box Jenkins ARIMA and LSTM of Deep Learning (20)

Parallel multivariate deep learning models for time-series prediction: A comp...
Parallel multivariate deep learning models for time-series prediction: A comp...Parallel multivariate deep learning models for time-series prediction: A comp...
Parallel multivariate deep learning models for time-series prediction: A comp...
 
TIME SERIES & CROSS ‎SECTIONAL ANALYSIS
TIME SERIES & CROSS ‎SECTIONAL ANALYSISTIME SERIES & CROSS ‎SECTIONAL ANALYSIS
TIME SERIES & CROSS ‎SECTIONAL ANALYSIS
 
Delivery Feet Data using K Mean Clustering with Applied SPSS
Delivery Feet Data using K Mean Clustering with Applied SPSSDelivery Feet Data using K Mean Clustering with Applied SPSS
Delivery Feet Data using K Mean Clustering with Applied SPSS
 
Time Series Weather Forecasting Techniques: Literature Survey
Time Series Weather Forecasting Techniques: Literature SurveyTime Series Weather Forecasting Techniques: Literature Survey
Time Series Weather Forecasting Techniques: Literature Survey
 
International Journal of Mathematics and Statistics Invention (IJMSI)
International Journal of Mathematics and Statistics Invention (IJMSI) International Journal of Mathematics and Statistics Invention (IJMSI)
International Journal of Mathematics and Statistics Invention (IJMSI)
 
Demand Forecasting of a Perishable Dairy Drink: An ARIMA Approach
Demand Forecasting of a Perishable Dairy Drink: An ARIMA ApproachDemand Forecasting of a Perishable Dairy Drink: An ARIMA Approach
Demand Forecasting of a Perishable Dairy Drink: An ARIMA Approach
 
Time Series Analysis: Techniques for Analyzing Time-Dependent Data
Time Series Analysis: Techniques for Analyzing Time-Dependent DataTime Series Analysis: Techniques for Analyzing Time-Dependent Data
Time Series Analysis: Techniques for Analyzing Time-Dependent Data
 
Time Series Forecasting Using Novel Feature Extraction Algorithm and Multilay...
Time Series Forecasting Using Novel Feature Extraction Algorithm and Multilay...Time Series Forecasting Using Novel Feature Extraction Algorithm and Multilay...
Time Series Forecasting Using Novel Feature Extraction Algorithm and Multilay...
 
Case Study: Prediction on Iris Dataset Using KNN Algorithm
Case Study: Prediction on Iris Dataset Using KNN AlgorithmCase Study: Prediction on Iris Dataset Using KNN Algorithm
Case Study: Prediction on Iris Dataset Using KNN Algorithm
 
Principle Component Analysis Based on Optimal Centroid Selection Model for Su...
Principle Component Analysis Based on Optimal Centroid Selection Model for Su...Principle Component Analysis Based on Optimal Centroid Selection Model for Su...
Principle Component Analysis Based on Optimal Centroid Selection Model for Su...
 
Pca based support vector machine technique for volatility forecasting
Pca based support vector machine technique for volatility forecastingPca based support vector machine technique for volatility forecasting
Pca based support vector machine technique for volatility forecasting
 
Efficiency of recurrent neural networks for seasonal trended time series mode...
Efficiency of recurrent neural networks for seasonal trended time series mode...Efficiency of recurrent neural networks for seasonal trended time series mode...
Efficiency of recurrent neural networks for seasonal trended time series mode...
 
SSA-based hybrid forecasting models and applications
SSA-based hybrid forecasting models and applicationsSSA-based hybrid forecasting models and applications
SSA-based hybrid forecasting models and applications
 
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONSMUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
 
esmaeili-2016-ijca-911399
esmaeili-2016-ijca-911399esmaeili-2016-ijca-911399
esmaeili-2016-ijca-911399
 
KIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfKIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdf
 
Study of effectiveness of time series modeling (arima) in forecasting stock p...
Study of effectiveness of time series modeling (arima) in forecasting stock p...Study of effectiveness of time series modeling (arima) in forecasting stock p...
Study of effectiveness of time series modeling (arima) in forecasting stock p...
 
RAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEY
RAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEYRAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEY
RAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEY
 
A novel hybrid deep learning model for price prediction
A novel hybrid deep learning model for price prediction A novel hybrid deep learning model for price prediction
A novel hybrid deep learning model for price prediction
 
Novel Ensemble Tree for Fast Prediction on Data Streams
Novel Ensemble Tree for Fast Prediction on Data StreamsNovel Ensemble Tree for Fast Prediction on Data Streams
Novel Ensemble Tree for Fast Prediction on Data Streams
 

More from YogeshIJTSRD

Cosmetic Science An Overview
Cosmetic Science An OverviewCosmetic Science An Overview
Cosmetic Science An OverviewYogeshIJTSRD
 
Standardization and Formulations of Calotropis Procera
Standardization and Formulations of Calotropis ProceraStandardization and Formulations of Calotropis Procera
Standardization and Formulations of Calotropis ProceraYogeshIJTSRD
 
Review of the Diagnosis and Treatment of Paralysis
Review of the Diagnosis and Treatment of ParalysisReview of the Diagnosis and Treatment of Paralysis
Review of the Diagnosis and Treatment of ParalysisYogeshIJTSRD
 
Comparative Analysis of Forced Draft Cooling Tower Using Two Design Methods A...
Comparative Analysis of Forced Draft Cooling Tower Using Two Design Methods A...Comparative Analysis of Forced Draft Cooling Tower Using Two Design Methods A...
Comparative Analysis of Forced Draft Cooling Tower Using Two Design Methods A...YogeshIJTSRD
 
Criminology Educators Triumphs and Struggles
Criminology Educators Triumphs and StrugglesCriminology Educators Triumphs and Struggles
Criminology Educators Triumphs and StrugglesYogeshIJTSRD
 
A Review Herbal Drugs Used in Skin Disorder
A Review Herbal Drugs Used in Skin DisorderA Review Herbal Drugs Used in Skin Disorder
A Review Herbal Drugs Used in Skin DisorderYogeshIJTSRD
 
Automatic Query Expansion Using Word Embedding Based on Fuzzy Graph Connectiv...
Automatic Query Expansion Using Word Embedding Based on Fuzzy Graph Connectiv...Automatic Query Expansion Using Word Embedding Based on Fuzzy Graph Connectiv...
Automatic Query Expansion Using Word Embedding Based on Fuzzy Graph Connectiv...YogeshIJTSRD
 
A New Proposal for Smartphone Based Drowsiness Detection and Warning System f...
A New Proposal for Smartphone Based Drowsiness Detection and Warning System f...A New Proposal for Smartphone Based Drowsiness Detection and Warning System f...
A New Proposal for Smartphone Based Drowsiness Detection and Warning System f...YogeshIJTSRD
 
Data Security by AES Advanced Encryption Standard
Data Security by AES Advanced Encryption StandardData Security by AES Advanced Encryption Standard
Data Security by AES Advanced Encryption StandardYogeshIJTSRD
 
Antimicrobial and Phytochemical Screening of Phyllantus Niruri
Antimicrobial and Phytochemical Screening of Phyllantus NiruriAntimicrobial and Phytochemical Screening of Phyllantus Niruri
Antimicrobial and Phytochemical Screening of Phyllantus NiruriYogeshIJTSRD
 
Heat Sink for Underground Pipe Line
Heat Sink for Underground Pipe LineHeat Sink for Underground Pipe Line
Heat Sink for Underground Pipe LineYogeshIJTSRD
 
Newly Proposed Multi Channel Fiber Optic Cable Core
Newly Proposed Multi Channel Fiber Optic Cable CoreNewly Proposed Multi Channel Fiber Optic Cable Core
Newly Proposed Multi Channel Fiber Optic Cable CoreYogeshIJTSRD
 
Security Sector Reform toward Professionalism of Military and Police
Security Sector Reform toward Professionalism of Military and PoliceSecurity Sector Reform toward Professionalism of Military and Police
Security Sector Reform toward Professionalism of Military and PoliceYogeshIJTSRD
 
Stress An Undetachable Condition of Life
Stress An Undetachable Condition of LifeStress An Undetachable Condition of Life
Stress An Undetachable Condition of LifeYogeshIJTSRD
 
Comparative Studies of Diabetes in Adult Nigerians Lipid Profile and Antioxid...
Comparative Studies of Diabetes in Adult Nigerians Lipid Profile and Antioxid...Comparative Studies of Diabetes in Adult Nigerians Lipid Profile and Antioxid...
Comparative Studies of Diabetes in Adult Nigerians Lipid Profile and Antioxid...YogeshIJTSRD
 
To Assess the Severity and Mortality among Covid 19 Patients after Having Vac...
To Assess the Severity and Mortality among Covid 19 Patients after Having Vac...To Assess the Severity and Mortality among Covid 19 Patients after Having Vac...
To Assess the Severity and Mortality among Covid 19 Patients after Having Vac...YogeshIJTSRD
 
Novel Drug Delivery System An Overview
Novel Drug Delivery System An OverviewNovel Drug Delivery System An Overview
Novel Drug Delivery System An OverviewYogeshIJTSRD
 
Security Issues Related to Biometrics
Security Issues Related to BiometricsSecurity Issues Related to Biometrics
Security Issues Related to BiometricsYogeshIJTSRD
 
Comparative Analysis of Different Numerical Methods for the Solution of Initi...
Comparative Analysis of Different Numerical Methods for the Solution of Initi...Comparative Analysis of Different Numerical Methods for the Solution of Initi...
Comparative Analysis of Different Numerical Methods for the Solution of Initi...YogeshIJTSRD
 
Evaluation of Different Paving Mixes Using Optimum Stabilizing Content
Evaluation of Different Paving Mixes Using Optimum Stabilizing ContentEvaluation of Different Paving Mixes Using Optimum Stabilizing Content
Evaluation of Different Paving Mixes Using Optimum Stabilizing ContentYogeshIJTSRD
 

More from YogeshIJTSRD (20)

Cosmetic Science An Overview
Cosmetic Science An OverviewCosmetic Science An Overview
Cosmetic Science An Overview
 
Standardization and Formulations of Calotropis Procera
Standardization and Formulations of Calotropis ProceraStandardization and Formulations of Calotropis Procera
Standardization and Formulations of Calotropis Procera
 
Review of the Diagnosis and Treatment of Paralysis
Review of the Diagnosis and Treatment of ParalysisReview of the Diagnosis and Treatment of Paralysis
Review of the Diagnosis and Treatment of Paralysis
 
Comparative Analysis of Forced Draft Cooling Tower Using Two Design Methods A...
Comparative Analysis of Forced Draft Cooling Tower Using Two Design Methods A...Comparative Analysis of Forced Draft Cooling Tower Using Two Design Methods A...
Comparative Analysis of Forced Draft Cooling Tower Using Two Design Methods A...
 
Criminology Educators Triumphs and Struggles
Criminology Educators Triumphs and StrugglesCriminology Educators Triumphs and Struggles
Criminology Educators Triumphs and Struggles
 
A Review Herbal Drugs Used in Skin Disorder
A Review Herbal Drugs Used in Skin DisorderA Review Herbal Drugs Used in Skin Disorder
A Review Herbal Drugs Used in Skin Disorder
 
Automatic Query Expansion Using Word Embedding Based on Fuzzy Graph Connectiv...
Automatic Query Expansion Using Word Embedding Based on Fuzzy Graph Connectiv...Automatic Query Expansion Using Word Embedding Based on Fuzzy Graph Connectiv...
Automatic Query Expansion Using Word Embedding Based on Fuzzy Graph Connectiv...
 
A New Proposal for Smartphone Based Drowsiness Detection and Warning System f...
A New Proposal for Smartphone Based Drowsiness Detection and Warning System f...A New Proposal for Smartphone Based Drowsiness Detection and Warning System f...
A New Proposal for Smartphone Based Drowsiness Detection and Warning System f...
 
Data Security by AES Advanced Encryption Standard
Data Security by AES Advanced Encryption StandardData Security by AES Advanced Encryption Standard
Data Security by AES Advanced Encryption Standard
 
Antimicrobial and Phytochemical Screening of Phyllantus Niruri
Antimicrobial and Phytochemical Screening of Phyllantus NiruriAntimicrobial and Phytochemical Screening of Phyllantus Niruri
Antimicrobial and Phytochemical Screening of Phyllantus Niruri
 
Heat Sink for Underground Pipe Line
Heat Sink for Underground Pipe LineHeat Sink for Underground Pipe Line
Heat Sink for Underground Pipe Line
 
Newly Proposed Multi Channel Fiber Optic Cable Core
Newly Proposed Multi Channel Fiber Optic Cable CoreNewly Proposed Multi Channel Fiber Optic Cable Core
Newly Proposed Multi Channel Fiber Optic Cable Core
 
Security Sector Reform toward Professionalism of Military and Police
Security Sector Reform toward Professionalism of Military and PoliceSecurity Sector Reform toward Professionalism of Military and Police
Security Sector Reform toward Professionalism of Military and Police
 
Stress An Undetachable Condition of Life
Stress An Undetachable Condition of LifeStress An Undetachable Condition of Life
Stress An Undetachable Condition of Life
 
Comparative Studies of Diabetes in Adult Nigerians Lipid Profile and Antioxid...
Comparative Studies of Diabetes in Adult Nigerians Lipid Profile and Antioxid...Comparative Studies of Diabetes in Adult Nigerians Lipid Profile and Antioxid...
Comparative Studies of Diabetes in Adult Nigerians Lipid Profile and Antioxid...
 
To Assess the Severity and Mortality among Covid 19 Patients after Having Vac...
To Assess the Severity and Mortality among Covid 19 Patients after Having Vac...To Assess the Severity and Mortality among Covid 19 Patients after Having Vac...
To Assess the Severity and Mortality among Covid 19 Patients after Having Vac...
 
Novel Drug Delivery System An Overview
Novel Drug Delivery System An OverviewNovel Drug Delivery System An Overview
Novel Drug Delivery System An Overview
 
Security Issues Related to Biometrics
Security Issues Related to BiometricsSecurity Issues Related to Biometrics
Security Issues Related to Biometrics
 
Comparative Analysis of Different Numerical Methods for the Solution of Initi...
Comparative Analysis of Different Numerical Methods for the Solution of Initi...Comparative Analysis of Different Numerical Methods for the Solution of Initi...
Comparative Analysis of Different Numerical Methods for the Solution of Initi...
 
Evaluation of Different Paving Mixes Using Optimum Stabilizing Content
Evaluation of Different Paving Mixes Using Optimum Stabilizing ContentEvaluation of Different Paving Mixes Using Optimum Stabilizing Content
Evaluation of Different Paving Mixes Using Optimum Stabilizing Content
 

Recently uploaded

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 

Recently uploaded (20)

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 

A Review on the Comparison of Box Jenkins ARIMA and LSTM of Deep Learning

  • 1. International Journal of Trend in Scientific Research and Development (IJTSRD) Volume 5 Issue 3, March-April 2021 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470 @ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 409 A Review on the Comparison of Box-Jenkins ARIMA and LSTM of Deep Learning Stavelin Abhinandithe K*, Madhu B1, Balasubramanian S2, Sahana C3 *Assistant Professor, Division of Medical Statistics, Faculty of Life Sciences, 1Associate Professor, Department of Community Medicine JSS Medical College, 2Dean & Director of Research, Department of Water & Health, 3Division of Medical Statistics, Faculty of Life Sciences, *, 1, 2, 3 JSS AHER, Mysore, Karnataka, India ABSTRACT A time-series is a set of events, sequentially calculated over time. Predicting the Time Series is mostly about predicting the future. The ability of a time series forecasting model is determined by its success in predicting the future. This is often at the expense of being able to explain why a specific prediction was made. The Box-Jenkins model implies that the time series is stationary, and thus suggests differentiating non-stationary series once or several times to obtain stationary effects. This generates a model for ARIMA, the "I" being the word for "Integrated". The LSTM networks, comparable to computer memory, enforce a gated cell for storing information. Such as the previously mentioned networks, the LSTM cells also recognize when to make preceding time-steps reads and writes information. Even though the work is new, it is obvious that LSTM architecturesprovidetremendousprospectsascontenders for modeling and forecasting time series. The outcomes of the overall discrepancy in error indicate that in regards of both RMSE and MAE, the LSTM-model tended to have greater predictive accuracy than the ARIMA- model. KEYWORDS: ARIMA, LSTM, Time series model, RMSE, MAE How to cite this paper: Stavelin Abhinandithe K | Madhu B | Balasubramanian S | Sahana C "A Review on the Comparison of Box-Jenkins ARIMA and LSTM of Deep Learning" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456- 6470, Volume-5 | Issue-3, April 2021, pp.409-414, URL: www.ijtsrd.com/papers/ijtsrd39831.pdf Copyright © 2021 by author(s) and International Journal ofTrendinScientific Research and Development Journal. This is an Open Access article distributed under the terms of the Creative CommonsAttribution License (CC BY 4.0) (http://creativecommons.org/licenses/by/4.0) I. INTRODUCTION A time-series is a set of events, sequentially calculated over time. Such findings either are continuous over time, or recorded at discrete intervals of equal time.[1 ]Exploration of the time series is a rather useful field of data analysis, retrieving information from past findings to establish the progression of a current phenomenon and to promote its prediction into the long term. The study also illustrates definite associations that might seem obvious to the naked eye and describes certain characteristics required to recognize this phenomenon’s current state of play. Continuous time series, such as brain activity assessment, are generally evaluated by measuring the series at intervals of equal time to offer a discrete time series. Observations in the series frequently correspond in variouswaysatdifferent time-stages. The potential consequences can be estimated using observed data while succeeding observations are dependent. Analysis of the time series is the field of study where these associations and correlations are analyzed.The correlation might be the order of the data gathered, the linearity of the model, reiterating trends etc.[2] The research community had also raised a handful of attempts to enhance time series exploration and analysis[.3] A. Prediction in time series: Predicting the Time Series is mostly about predicting the future. Although many data from the time series relies on its previous values. Near history attributesarestrongmeasures of conduct in a factor. Lagged values of a variable as in a rate of exchange are regressed over one or more lagged valuesof themselves to determine the present and potential values of the variable. Forecasting in the time series is reliant on numerous models and methods, and is commonly included to predict different aspects of human behavior [4-6]. While forecasting time series, one primarily uses sequence preceding values to estimate an upcoming value. It's convenient to map the association of the y variable with the preceding y vector values because of the usage of past values. The intention of time series analysis is to research the time series path patterns and to construct a model to explain the data structure and then forecasting potential values of time series.[7] B. Forecasting in time series: Time series estimation includes utilizing time series observations, in accordance with the time series analysis, to build a predictive model illustrating the dependencies. This IJTSRD39831
  • 2. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 410 model is then used in series estimation of future values. The forecast uses only past knowledge to estimate future values.[4] The ability of a time series forecasting model is determined by its success in predicting the future. This is often at the expense of being able to explain why a specific prediction was made. II. Forecasting methods in time series analysis. Methods of forecasting appear to be divided into two main categories. We are: intuitive and formalized. Fig.1. Classification of forecasting methods IntuitiveforecastingMethodscompriseforecastsandexpert inferences. Theyare also used nowadays in market research, economics, politics and other fields, the action of which is very complicated or harder to predict with mathematical models[8] Formalized methods are the methods, which use mathematical models to predict future values. They are divided into domain models and time-series models. Domainmodels–Models,focusedondomainprocesses,laws, and mechanisms. Climate prediction model for instance includes an equation of fluid dynamics and thermodynamics. The most common solutions to software failures include the software reliability models. Their key disadvantage: they don't suit all different technology classes, since they focus on unique attributes of it[9]. A thorough comprehension of the processes, methodologies and technologies of software design and testing is needed to construct an appropriate model of software reliability and to be qualified to draw conclusions dependent on such a model. Time-series models are mathematical forecasting models that aim to determine, throughout the framework itself, the reliance of the future value from the past value as well as compute the estimation relying on this dependence. Such models are identical for various contexts, i.e. their overall appearance does not alter according to the design of thetime series[10]. Time series models[11] can be further divided into: Regression models; Smoothing models; Models based on neura. III. Model description: Basics of Box and Jenkins Time Series Models The Box-Jenkins Time Series models are titled after the statisticians George Box and Gwilym Jenkins (Box and Jenkins, 1970), these models generate projected estimates depending on the statistical criteria of the time series data obtained and these models have achieved considerable interest in the fields of organizational analysis,management science and statistics. The Box-Jenkins ARMA (Autoregressive Moving Average) model is a combination of the AR (Auto Regressive) and MA (Moving Average) models. The Box-Jenkins model implies that the time series is stationary, and thus suggests differentiating non-stationary series once or several times to obtain stationary effects.This generates a model for ARIMA, the "I" being the word for "Integrated". They are also known as the Autoregressive Integrated Moving Average (ARIMA) models (Box and Jenkins, 1976). There are many ways to forecast using the time-series models, and the most commonly used of which is as follows Autoregressive Models (AR). Moving Average Models (MA). Mixed Models (ARMA). Integrated Mixed Models (ARIMA). Autoregressive Integrated Moving Averages Models: In 1976, Box and Jenkins first published ARIMA in a book which gained considerable coverage from the science establishment, based on statistical analysis at that period. Thus, this approach is implementedina vastrangeofsectors and stays one of the best reliable data processing and operational prediction models[13]. ARIMA stands for Autoregressive ( AR) Integrated (I) Moving Average (MA), otherwise regarded as the Box- Jenkins approach.13 As indicated by the terminology, ARIMA (p , d, q) incorporates main components of the model: AR: Auto regression. A regression model using the relationships between an observation and a number of observed lags (p). I: Integrated. To render the time series stationary by calculating the observed variations at various times (d). MA: Moving Average. A methodology that bears into consideration the dependence among the observations and
  • 3. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 411 the terms of residual error while using a moving average model for lagging observations (q). The Box-Jenkins method[14] summarizes the ARIMAprocess in three main steps: Identification: the initial move is to split through the time series by the three processes: AR (autoregressive), I (integrated) and MA (moving average); this phase apparently allows the parameters p, d and q to be defined, whilst testing the series' stationarity first. Configuration of parameters p, q is accomplished owing to the functions of autocorrelation and partial autocorrelation that we would address in depth in the section of realization. The parameter d is differentiation order. Estimation: The following phase in the Box-Jenkins method is to determine the parameters of the respective models by supplying the p d and q orders. Estimates are made using non-linear methodologies. Diagnosis: the final stage in the Box-Jenkins process relates to checking the model’s validity. That is, validate that the predicted model is suited to the accessible data. We refer to statistical testing for that. LSTM Neural Networks ANN: ANNs are partially influenced by our brain's neural interactions, attempting to imitate the neuronal circuits and its behavioral patterns. ANNs, being resilient and self- adaptive, typically offer satisfactory remedies to implicitly inferred non - linear challenges, and solvechallengessuchas speech recognition, natural language processing, and forecasting20. The framework includes so-called layers across input and output, in which raising the amount of layers often raises network complexities. By loading identified data into the network, the network is educated by weight allocation in reference to the preferred input output. The weights are network-adjusted scalarstowhichtheerror within the output, which is inferred by the network, and the true expected output utilizing the weight gradient of the error. Nevertheless, those kinds ofnetworksare notideal for sequential results. Since the network seems to have no memory of earlier stages, it cannot model dependencies, leaving it difficult to interpret sequential data. Therefore a memory of some sort is desired. RNN: RNNs, recurrent neural networks, string numerous layers of networks together, whereby, in connection to the output, information from previous time stages is passed to future time stages. As the inputs and parameters are analyzed from each layer, the output from previous layers is taken into account, providing the network a type of memory [15]. The gradient can become fairlycomplicatedwhena time- dimension needs to be drawn into account for RNNs, culminating in the knowledge from previous phases in time either begin to disappear or become greatly enhanced. Such phenomenon are also known as a vanishing or bursting gradient. LSTM: The LSTM networks, comparable to computer memory, enforce a gated cell for storinginformation.Such as the previously mentioned networks, the LSTM cells also recognize when to make preceding time-steps reads and writes information[15, 17]. Therefore the LSTM model addresses the issue of a disappearing or exploding gradient and helps the network to accurately recognize data far back in the sequence [18, 16]. LSTM model architecture is quite complicated to present. The Long Short-Term Memory (LSTM) network is basically the most commonly implemented framework to resolve the gradient disappearance crisis. SeppHochreiter and JrgenSchmidhuber suggestedthissamenetwork structurein 1997[19]. With the emergence of deep architectures, LSTM has been extensively seen in sequence learning currently, and has shown considerable experimental strength[20]. Each LSTM is a collection of cells, or device modules,whichcollect and preserve the data streams. The cells are like a transport line (the upper line in each cell) that links data from the past and gathers them for the current from one module to another. Owing to the utilization of certain gates in eachcell, data may be discarded of, filtrated, or incorporated for the following cells in each cell. Thus, the gates that are focused on a sigmoidal neural network layer allow the cells to allow data to transit through or be disposed of optionally. Each sigmoid layer generates numbers in the vicinity of zero and one, reflecting - cell's quantity of each data segment should be allowed through. Most specifically, an estimate of zero value means that "let nothing pass through;" and whilst; an estimate of one suggests that "let anything pass through." Each LSTM includes three styles of gates mostly with purpose of regulating the condition of each cell.[21]: Forget Gate outputs a number between 0 and 1,where1 shows “completely keep this”; whereas, 0 implies “completely ignore this.” Memory Gate determines whichnewdata tobe retained in the cell. Initially, a sigmoid layer, called the "door input layer," decides which values are to be altered. A tanh layer then allows a vector of fresh agent values which can be attached to the system. Output Gate determines what yields every cell canyield. The rendered value, together with the processed and freshly inserted data, will be based on cell state. This architecture is prone to numerous modifications, however, it has nearly as many changes as the articleswhich use it. A condensed version of LSTM termed GRU (Gated Recurrent Unit) was implemented in 2014[22]. This has the benefit of having fewercomputationallycomplex,sinceit has smaller parameters and equations. It remains however as powerful as its predecessor. Here are the equations which govern the model's dynamism: Zt = σ(WZxt + UZht−1 + bZ) (update gate) Rt = σ(WRxt + URht−1 + bR) (reset gate) ht = Zt ◦ ht−1 + (1 − Zt) ◦ tanh(Whxt + Uh(Rt ◦ ht−1) + bh). Last year's curiosity in utilizing LSTM has unexpectedly increased and its predictive accuracy can illustrate its strength in nearly every area of science. It took about two
  • 4. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 412 decades for the strategyto becomemainstream. Toconstruct an LSTM model, initially we have to convert the Dataset training and test into a three-dimensional collection of "samples, features, and time measures". We utilize a layer of inputs, a hidden layer of LSTM blocks aswell asa singlelayer of output. The accompanying principles are employed to evaluate the amount of layers and the number of neurons in each layer: Input layer: Logically, we possessone entry;the number of neurons in that layer is calculated by the number of columns that can be quantified. Output layer: each neural network has a single output. Hidden layer: the size of this layer, that is to say the number of neurons, is to determine. GAN: Generative adversarial networks, or GANs for shorter, are a strategy to generative modeling incorporating techniques of deep learning, like neural convolution networks. GANs are an innovative means to teach a generative model by presenting the issue using two sub- models as a supervised learning problem: the generator model, and the discriminator model. There is teaching ofthe two models together. Naive Bayes is an illustration of a more commonly used generative model as a discriminative model. Naive Bayes functions by summing up the distribution of probability for each input variable and the output class. When a prediction is made, the likelihood for each potential outcome for each variable is determined, the individual probabilities are merged, and the highly realistic consequence is estimated. Other examples of generative models include Latent Dirichlet Allocation,orLDA, andtheGaussianMixtureModel, or GMM. RBM: A Restricted Boltzmann Machine is a generative stochastic artificial neural network which is able to learn a distribution of probability over its set of inputs. RBMs were first discovered in 1986 by Paul Smolensky under the name Harmonium. RBMs are a variety of Boltzmann devices, as their name suggests, with the constraint that its neurons should build a bipartisan graph. This limitation enables for increasingly effective training algorithms than those accessible for the general group of Boltzmann machines, in specific the contrastive divergence algorithm based on gradients. DBN: Deep Belief Network is an unsupervised probabilistic deep learning algorithm, made up of multilayered latent stochastic variables. DBN is a graphical model of the generative hybrid. Top two stratification are undirected. Two layers lower have guided interactions from above layers. DBN are pre trained using Greedy Algorithm. By adequately instantiating the weights of all the layers, pre- training improves optimization. DBNs are used in image recognition, video sequences, motion capture data and speech recognition. FORECAST PERFORMANCE METRICS(CommonAccuracy Statistics) RMSE: The RMSE is thesquare rootofresidual variance. This shows the model’s absolute fit for the data – how identical the measured data points are to the expected values of the model. RMSE is an absolute fit-measure. It can be assumed, focusing on a thumb rule, that the RMSE values between 0.2 and 0.5 indicate that the model was able to predict the relevant data reasonably. Adjusted R-squared upwards of 0.75 is, however, a very better value fortheaccuracydisplay. For certain cases it is also appropriate to have Modified R- squared of 0.4 or more. MSE: The mean squared error, or MSE, is measured as the square predicted error values average. The squaring of forecast error values causes them to be positive; it also puts greater weight on major errors. A greater MSE implies that the data values are massively scattered around its central moment (mean), and a lesser MSE indicatesotherwiseandis certainly the preferable and/or desired option because it indicates that the data values are distributed near to its central moment, which is normally large. MAPE: The mean absolute percentage error (MAPE) is the mean or average of forecast errors with an absolute percentage. Error is characterized as real or detected,minus the expected value. Percentage errors are summed up lacking regard to MAPE calculation signing. This calculation is easy to grasp, because it rendersthemistakeinpercentage terms. Furthermore, since absolute percentage errors are considered, the issue of equally dismissing positive and negative errors is prevented. MAPE therefore has a strategic reach and is a metric which is widely includedinforecasting. The smaller the MAPE the better the forecast. RELATED WORKS: A comparison analysis was carried out between LSTM and ARIMA in AjilaElmasdotter's forecasting sale retail[2]. Furthermore, ARIMA time series is a commonly used methodology for the budgetary time series23 in econometrics, numerous ARIMA models have been formulated to evaluate and project stock markets [24-26]. In addition, ARIMA is used for water expenditure / forecasting of consumption[27,28] and the requirement for electricity[29]. Most notably, there has been a great deal of concern in the usage of deep learning models [30,31], particularly recursive models like LSTM Neural Network for the estimation of budgetary time series, particularly in the stock market[32- 34].In[32] presented a simulationandsimulationofChina stock returns employing LSTM architecture via an accepted 27.2 percent accuracy, in[33] studied the usefulness of recurrent neural networks to the assessment of stock market price fluctuations. ShanghaiCompositeIndexandDow JonesIndex were eventually proposed[34].Withregardtobudgetanalysis and forecasting, hardly any research has been reported trying to compare or even implementingthetwotechniques, the majority of which have been adapted to the stock price as discussed above. It is due largely to the complexity of accessing specific data sets and the unreliablenatureofsuch data. None of its authors correlated LSTM and ARIMA models to efficiency. MAJOR DIFFERENCES BETWEEN ARIMA AND LSTM: Even though the work is new, it is obvious that LSTM architectures provide tremendous prospects as contenders for modeling and forecasting time series. We review the major variations between an LSTM and ARIMA in the following Table. The usage of RNNs, especially the LSTM architecture, allows us to set many parameters that we need to change in order to achieve optimum efficiency in the forecasting tasks. The problem lies in selecting the appropriate parameters to identify the correct architecture for the platform. An ARIMA model is easy to customize as it provides a reasonable efficiency, such model also includes the determination of the parameters p, d and q such that p is the order of the autoregressive part (AR), the order of differentiation and q the order of the average moving portion (MA).
  • 5. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 413 ARIMA LSTM ARIMA is a linear model Small amount of data is also sufficient for ARIMA (i.e., minimum number of observations is 50-100). ARIMA is a parametric model, which is to say for each series we have to define the parameters p, d and q. ARIMA is dedicated only for time series analysis LSTM is a nonlinear model. LSTM can store and process large amount of data. As LSTM is a non-parametric model, requires adjustment of some hypermeters. LSTM models can process sequential data as well. Evaluation measures of real world time series data: MAE RMSE MAPE SMAPE 1 Formula 1 1 ˆ n i i i MAE y y n = = − ∑ ( ) 2 1 n i i i f o RMSE n = − = ∑ 1 1 n t t t t A F MAPE n A = − = ∑ ( ) 1 100% / 2 n t t t t t F A n A F = − + ∑ 2 Acceptable range The lower the MSE, the higher the accuracy of the prediction. Lower values of RMSE indicate better fit. Since, MAPE is a measure of error, high number are bad and low numbers are good Lower the value, higher the accuracy. Comparison of Software: ARIMA LSTM EViews Julia. Mathematicia MATLAB NCSS Python R Ruby SAS IBM SPSS Stata Python Rnn Tensor Flow IBM SPSS MATLAB SAS Julia Ruby CONCLUSIONS: These methodologies are acquiring prominence amongst investigators across diverse disciplines with the latest enhancement in the development of advanced machine learning-based approaches and especially deep learning algorithms. The major concern then is how specifically and effectively these newly developed techniques compete with conventional methods. The outcomes of the overall discrepancy in error indicate that in regards of both RMSE and MAE, the LSTM-model tended to have greaterpredictive accuracy than the ARIMA-model. All in all, we have shown that on lengthy-term predictions, neural networks provide more precision. Since the error for the ARIMA model presented by Box Jenkins is not significantly higher, we note that its use remains an advantage in the prediction of time series. We also saw the two models enjoy complimentary differences: ARIMA is far pickier on seasonal peaks while LSTM suits the balanced portion of the series better. For some developed countries dysfunctional economic system makes reliable prediction models complicated. The probability of having a better estimate (without sacrificing the complexity of the data through diff and linearization) emerges with the aid of new approaches in RNN. References: [1] Chatfield C. Time-series forecasting. CRC press; 2000 Oct 25. [2] Elmasdotter A, Nyströmer C. A comparative study between LSTM and ARIMA for sales forecasting in retail. [3] Fathi O. Time series forecastingusinga hybridARIMA and LSTM model. In Velvet Consulting 2019. [4] Zhang GP. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing. 2003 Jan 1; 50: 159-75. [5] Zhang GP. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing. 2003 Jan 1; 50: 159-75. [6] Chen L, Wang S, Fan W, Sun J, Naoi S. Beyond human recognition: A CNN-based framework for handwritten character recognition. In2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) 2015 Nov 3 (pp. 695-699). IEEE. [7] Siami-Namini S, Tavakoli N, Namin AS. A comparison of ARIMA and LSTM in forecastingtimeseries.In2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA) 2018 Dec 17 (pp. 1394-1401). IEEE. [8] A. M. Alonso, C. Garcia-Martos. Time Series Analysis - Forecasting with ARIMA models. In2012 Universidad Carlos III de Madrid, Universidad Politecnica de Madrid. [9] Earnest A, Chen MI, Ng D, Sin LY.Usingautoregressive integrated moving average (ARIMA) models to predict and monitor the number of beds occupied during a SARS outbreak in a tertiary hospital in Singapore. BMC Health Services Research. 2005 Dec 1; 5(1): 36. [10] Tang R, Zeng F, Chen Z, Wang JS, Huang CM, WuZ.The Comparison of Predicting Storm-time Ionospheric TEC by Three Methods: ARIMA, LSTM, and Seq2Seq. Atmosphere. 2020 Apr; 11(4): 316. [11] Khashei M, Bijari M. A novel hybridization of artificial neural networks and ARIMA models for time series forecasting. Applied Soft Computing. 2011 Mar 1; 11(2): 2664-75. [12] Liu LM, Hudak GB, Box GE, Muller ME, Tiao GC. Forecasting and time series analysis using the SCA statistical system. DeKalb, IL: Scientific Computing Associates; 1992.
  • 6. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD39831 | Volume – 5 | Issue – 3 | March-April 2021 Page 414 [13] K. Lawrence and M. Geurts. Advances in Businessand Management Forecasting. In2004 vol. 19. Emerald Group Publishing. [14] Rhanoui M, Yousfi S, Mikram M, Merizak H. Forecasting financial budget time series: ARIMA random walk vs LSTM neural network. IAES International Journal of Artificial Intelligence. 2019 Dec 1; 8(4). [15] Rojas R. Neural networks: a systematic introduction. Springer Science & Business Media; 2013 Jun 29. [16] Li X, Peng L, Yao X, Cui S, Hu Y, You C, Chi T. Long short-term memory neural network for air pollutant concentration predictions: Method developmentand evaluation. Environmental pollution.2017Dec1;231: 997-1004. [17] Cortez B, Carrera B, Kim YJ, Jung JY. An architecture for emergency event predictionusingLSTM recurrent neural networks. Expert Systems with Applications. 2018 May 1; 97: 315-24. [18] Krstanovic S, Paulheim H. Ensembles of recurrent neural networks for robust time series forecasting.In International Conference on Innovative Techniques and Applications of Artificial Intelligence2017Dec12 (pp. 34-46). Springer, Cham. [19] Jia Y, Wu Z, Xu Y, Ke D, Su K. Long Short-Term Memory Projection Recurrent Neural Network Architectures for Piano’s Continuous Note Recognition. Journal of Robotics. 2017 Jan 1; 2017. [20] Chung J, Gulcehre C, Cho K, Bengio Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555. 2014 Dec 11. [21] Schmidhuber J. Deep learning in neural networks: An overview. Neural networks. 2015 Jan 1; 61: 85-117. [22] Hochreiter S, Schmidhuber J. Long short-term memory. Neural computation. 1997 Nov 15; 9(8): 1735-80. [23] Mills TC, Markellos RN. The econometric modellingof financial time series. Cambridge University Press; 2008 Mar 20. [24] Pai PF, Lin CS. A hybrid ARIMA and support vector machines model in stock price forecasting. Omega. 2005 Dec 1; 33(6): 497-505. [25] Jarrett JE, KyperE.ARIMAmodelingwithintervention to forecast and analyze Chinese stock prices. International Journal of Engineering Business Management. 2011 Aug 15; 3(3): 53-8. [26] Ariyo AA, Adewumi AO, Ayo CK. Stock price prediction using the ARIMA model. In2014 UKSim- AMSS 16th International Conference on Computer Modelling and Simulation2014Mar26(pp.106-112). IEEE. [27] Birylo M, Rzepecka Z, Kuczynska-Siehien J, Nastula J. Analysis of water budget prediction accuracy using ARIMA models. Water Science and Technology: Water Supply. 2018 Jun; 18(3): 819-30. [28] Razali SN, Rusiman MS, Zawawi NI, Arbin N. Forecasting ofwaterconsumptions expenditureusing Holt-Winter’s and ARIMA. [29] Kandananond K. Electricity demand forecasting in buildings based on ARIMA and ARX models. In Proceedings of the 8th International Conference on Informatics, Environment, Energy and Applications 2019 Mar 16 (pp. 268-271). [30] Chong E, Han C, Park FC. Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies. Expert Systems with Applications. 2017 Oct 15;83:187-205. [31] Singh R, Srivastava S. Stock prediction using deep learning. Multimedia Tools and Applications. 2017 Sep 1; 76(18): 18569-84. [32] Chen K, Zhou Y, Dai F. A LSTM-basedmethodforstock returns prediction: A case study of China stock market. In2015 IEEE international conference on big data (big data) 2015 Oct 29 (pp. 2823-2824). IEEE. [33] Nelson DM, Pereira AC, de Oliveira RA.Stock market's price movement prediction with LSTM neural networks. In2017 International joint conference on neural networks (IJCNN) 2017 May 14 (pp. 1419- 1426). IEEE. [34] Jiang Q, Tang C, Chen C, Wang X, Huang Q. Stock price forecast based on LSTM neural network. In International Conference onManagementScienceand Engineering Management 2018 Aug 1 (pp. 393-408). Springer, Cham.