SlideShare a Scribd company logo
1 of 4
Download to read offline
ISSN: 2278 – 1323
                       International Journal of Advanced Research in Computer Engineering & Technology
                                                                           Volume 1, Issue 4, June 2012



       Applicability of Box Jenkins ARIMA Model in
            Crime Forecasting: A case study of
              counterfeiting in Gujarat State
                                                 Anand Kumar Shrivastav, Dr. Ekata



                                                                            Institute of Justice (NIJ) awarded five grants to study crime
   Abstract— For any police organization, detection and                      forecasting for police use as an extension of crime mapping
prevention of crime incidents is a big challenge. Normally police            [12]. The purpose of this paper was to probe the applicability
organization maintains data of crime and criminals for various               of univariate time series model for crime forecasting. This
purposes like investigation, coordination and future strategies.             paper, attempts to outline the practical steps which need to be
The forecast made with the help of historical crime data may                 undertaken to use Box Jenkins ARIMA time series models
support law enforcement agencies in their decision making
activities and tactical operations. The time series is one of the
                                                                             for crime forecasting. The paper is organized as following: In
tools for making prediction and autoregressive integrated                    Section 2, brief outline of time series forecasting, ARIMA
moving average (ARIMA) model has been successfully used in                   and ARIMA modeling have been given. In section 3 we have
forecasting econometrics, social science and many other                      applied ARIMA model on historical crime data of Gujarat
problems. This model has the advantage of accurate forecasting               State, pertaining to counterfeiting, to make short term
over short-term. In this study, we have utilized the crime data              forecasting. Section 4 is related to results and conclusions.
of Gujarat State pertaining to counterfeiting of currency. We
have used Box Jenkins ARIMA model for short term crime                                      II. TIME SERIES FORECASTING
forecasting.
                                                                                A time series is a set of data pertaining to the values of a
    Index Terms— ARIMA, Box Jenkins, Crime, Forecasting.                     variable at different times. A time series has an important
                                                                             property which makes it quite distinct from any other kind of
                                                                             statistical data. Formal examples of time series are the
                       I. INTRODUCTION                                       population of India at each successive decennial census; daily
                                                                             business handled by a bank, monthly production statistics of
   Despite the increased emphasis on proactive policing, the
                                                                             a steel mill; monthly traffic accident fatalities etc. The crime
core of police work remains that of responding to calls for
                                                                             criminal data can be arranged in a regular fashion i.e.
service, making effective deployment strategies paramount to
                                                                             monthly, quarterly, half yearly or yearly and can be assumed
a well-functioning police department [12]. The crime
                                                                             to be a time series data. Time series analysis is used to detect
forecasting is an emerging approach in criminological
                                                                             patterns of change in statistical information over regular
research. Crime forecasting is not widely practiced by
                                                                             intervals of time. We project these patterns to arrive at an
police. While there are numerous econometric studies of
                                                                             estimate for the future. The time series analysis helps us cope
crime or incorporating crime in the literature, one is
                                                                             with uncertainty about the future [8]. All statistical
hard-pressed to find police departments or other police
                                                                             forecasting methods are extrapolatory in nature i.e. they
organizations making regular use of forecasting for policing.
                                                                             involve the projection of past patterns or relationships into
From a more proactive standpoint, problem-oriented policing
                                                                             the future [3]. Time series forecast is one of the most
efforts may be enhanced by a more accurate scanning of areas
                                                                             important tools for research in the field of social sciences and
with crime problems, in that one can examine both
                                                                             engineering. Forecasting is an essential tool in any decision
distributions of past crimes and predictions of future
                                                                             making process. The quality of the forecasts management can
concentrations [8]. The ability to predict can serve as a
                                                                             make it strongly related to the information that can be
valuable source of knowledge for law enforcement agencies,
                                                                             extracted and used from past data.
both from tactical as well strategic perspectives. Forecasting
can help a police department's performance by strategic
                                                                             A.) ARIMA (Autoregressive Integrated Moving Average)
deployment efforts and efficient investigation direction. The
origin of crime forecasting is in year 1998, when US National
                                                                                ARIMA model was introduced by Box and Jenkins (hence
                                                                             also known as Box-Jenkins model) in 1960. It is an
   Manuscript received May 15, 2012.                                         extrapolation method for forecasting and, like any other such
    Anand Kumar Shrivastav, Research Scholar, Department of Computer
Science,     Mewar      University,   Chittorgarh,     India,     (e-mail:
                                                                             method, it requires only the historical time series data on the
shrivastav.anand@gmail.com).                                                 variable under forecasting. ARIMA models are the most
   Dr. Ekata, Associate Professor, Department of Applied Science, Krishna    general class of models for forecasting a time series.
Institute of Engineering and Technology, Ghaziabad, India (e-mail:           Normally, the ARIMA model is represented as
ekata4@rediffmail.com).
                                                                             ARIMA(p,d,q) where p is the number of autoregressive
.
                                                                                                                                         494
                                              All Rights Reserved © 2012 IJARCET
ISSN: 2278 – 1323
                    International Journal of Advanced Research in Computer Engineering & Technology
                                                                        Volume 1, Issue 4, June 2012
terms, d is the number of non-seasonal differences, and q is        functions (PACFs). The following table summarizes rough
the number of lagged forecast errors in the prediction              guideline for using these parameters in initial model
equation. The identification of the appropriate ARIMA               identification.
model for a time series begins with the process of finding
integer, usually very small (e.g., 0, 1, or 2), values of p, d,                       AR(p)                MA(q)            ARMA(p,q)
and q that model the patterns in the data. When the value is 0,        ACF            Tails off            Cuts off after   Tails off
the element is not needed in the model. The middle element,                                                lag q
d, also known as trend component is investigated before p              PACF           Cuts off after       Tails off        Tails off
and q. The goal is to determine if the process is stationary                          lag p
and, if not, to make it stationary before determining the
values of p and q. The augmented Dickey–Fuller (ADF) test
                                                                    B.) Steps of ARIMA model
is most widely used test for checking stationarity of a series.
If d = 0, the model becomes ARMA, which is linear
stationary model. ARIMA (i.e. d > 0) is a linear                    (i)     Identification of ARIMA (p,d,q) structure
non-stationary model. If the underlying time series is              (ii)    Estimating the coefficients of the formulation
non-stationary, taking the difference of the series with itself     (iii)   Fitting test on the estimated residuals
„d‟ times makes it stationary, and then ARMA is applied onto        (iv)    Forecasting the future outcomes based on the historical
the differenced series. A stationary process has a constant                  data
mean and variance over the time period. There are various
methods available to make a time series stationary. Normally          The steps of ARIMA model building methodology is
differencing techniques are used to transform a time series         presented in a flow chart in figure-1.
from a non-stationary to stationary by subtracting each datum
in a series from its predecessor. If a series is stationary
without any differencing it is designated as I(0), or integrated
of order 0. On the other hand, a series that has stationary first
differences is designated I(1), or integrated of order 1. The
term „shock‟ is used to indicate an unexpected change in the
value of a variable (or error). For a stationary series a shock
will gradually die away. In other words, the effect of a shock
during time„t‟ will have a smaller effect in time„t+1‟, a still
smaller effect in time„t+2‟, etc. The lags of the differenced
series appearing in the forecasting equations are called
“auto-regressive” terms. The auto-regressive components
represent the memory of the process for preceding
observations. The value of p is the number of auto-regressive
components in an ARIMA (p, d, q) model. The value of p is 0
if there is no relationship between adjacent observations.                           (Fig.1: ARIMA model building steps)
When the value of p is 1, there is a relationship between
observations at lag 1 and the correlation coefficient      is the
magnitude of the relationship. When the value of p is 2, there        III. BUILDING ARIMA MODEL FOR CRIME FORECASTING
is a relationship between observations at lag 2 and the
correlation coefficient          is the magnitude of the
                                                                      The formulation of ARIMA model depends on the
relationship. Thus p is the number of correlations we need to       characteristics of the series. In this paper, we have used the
model the relationship. The lags of the forecast errors are         Crime data of counterfeit currency for past 8 years (96
called “moving average” terms. The moving average                   months) beginning year 2001 of Gujarat State (Table-1). The
components represent the memory of the process for                  crime data has been taken from “Crime in India” , an annual
preceding random shocks. The value q indicates the number           publication of National Crime Records Bureau
of moving average components in an ARIMA (p, d, q). When            (www.ncrb.gov.in). We have utilized 93 months data for
q is zero, there are no moving average components. When q           analysis and 3 months data for validation of our forecasted
is 1, there is a relationship between the current score and the     results. We have used GRETL (Gnu Regression,
random shock at lag 1 and the correlation coefficient θ1            Econometrics and Time-series Library) software for plotting
represents the magnitude of the relationship. When q is 2,          the graphs and analysis of the data set.
there is a relationship between the current score and the
random shock at lag 2, and the correlation coefficient θ2
                                                                            (Table-1: Registered cases of Counterfeit Currency)
represents the magnitude of the relationship. When one of the                 Year     Month      No. of    Year   Month    No. of
terms is zero, it‟s usual to drop AR, I or MA component. For                                      Cases                     Cases
example, I(1) model is ARIMA(0,1,0), and a MA(1) model                        2001     Jan             2    2005   Jan          21
is ARIMA(0,0,1). The autocorrelation function (ACF) and                                Feb             6           Feb          90
partial correlation function (PACF) are very important for the                         Mar             4           Mar          75
definition of the internal structure of the analysed series. The                       Apr             4           Apr          37
models can be identified through patterns in their                                     May             2           May          18
autocorrelation functions (ACFs) and partial autocorrelation
                                                                                                                                        495
                                        All Rights Reserved © 2012 IJARCET
ISSN: 2278 – 1323
                                                     International Journal of Advanced Research in Computer Engineering & Technology
                                                                                                         Volume 1, Issue 4, June 2012
                                               Jun           2             Jun            26                  The non-stationarity was also confirmed with the help of
                                               Jul           5             Jul            19                Augmented Dickey Fuller (ADF) test. The first order
                                               Aug           3             Aug            22                differencing transformation was made to make the series
                                               Sep           3             Sep            29                stationary. The time series plot of original and differenced
                                               Oct           4             Oct            12                series are shown at figure-2a and Figure-2b respectively. The
                                               Nov           3             Nov            19                ACF and PACF plot of differenced series is shown at
                                               Dec           3             Dec            18                figure-3.
                                   2002        Jan           6     2006    Jan            20
                                               Feb           1             Feb            19                   80


                                               Mar           3             Mar            18
                                                                                                               60
                                               Apr           2             Apr             7
                                               May           4             May            15
                                                                                                               40
                                               Jun           8             Jun            10
                                               Jul           6             Jul            14                   20

                                               Aug           1             Aug             8




                                                                                                  d_Cases
                                               Sep           5             Sep             5                        0

                                               Oct           7             Oct            15
                                               Nov           4             Nov            16                  -20


                                               Dec           2             Dec            13
                                                                                                              -40
                                   2003        Jan           4     2007    Jan            17
                                               Feb           3             Feb            21
                                                                                                              -60
                                               Mar           4             Mar            15
                                               Apr           2             Apr            23                  -80
                                                                                                                        2001      2002    2003   2004           2005       2006   2007           2008
                                               May           3             May            25
                                               Jun           2             Jun            35                                   (Fig.2b: Time series plot of differenced series)
                                               Jul           4             Jul            32
                                               Aug           2             Aug            10                                                            ACF for d_Cases

                                               Sep          82             Sep            16                 0.4                                                                         +- 1.96/T^0.5
                                                                                                             0.3
                                               Oct          89             Oct            19                 0.2

                                               Nov          19             Nov             6                 0.1
                                                                                                               0
                                               Dec          31             Dec            19                 -0.1
                                                                                                             -0.2
                                   2004        Jan          19     2008    Jan            14                 -0.3
                                                                                                             -0.4
                                               Feb          17             Feb            18                        0                    5                     10                 15                      20

                                               Mar           3             Mar            11                                                                  lag


                                               Apr           6             Apr            11                                                            PACF for d_Cases

                                               May           2             May            10                 0.4                                                                         +- 1.96/T^0.5
                                                                                                             0.3
                                               Jun           4             Jun             9                 0.2
                                                                                                             0.1
                                               Jul           2             Jul            15                   0
                                                                                                             -0.1
                                               Aug           4             Aug            11                 -0.2
                                                                                                             -0.3
                                               Sep           3             Sep            11                 -0.4

                                               Oct           5             Oct             8                        0                    5                     10                 15                      20
                                                                                                                                                              lag
                                               Nov           6             Nov            10
                                               Dec          42             Dec            16                                    (Fig.3: Correllogram of differenced series)

                            The Box-Jenkins‟s methodology for forecasting requires
                          the series to be stationary. The time series plot and                             A.) Model Parameters Estimation
                          correllogram of data provide a strong evidence of a
                          non-stationary series.                                                              There are several model selection criteria like AIC (Akaike
                              90
                                                                                                            information criteria), HQ (Hannan-Quinn criteria) and
                              80
                                                                                                            SIC(Schwarz information criteria). Among the several
                              70                                                                            methods studied in the literature to judge the fitness of the
                              60
                                                                                                            models, we used Akaike information criterion (AIC) [2].
                                                                                                            According to this the model with least AIC value will be
Cases of Counterfeiting




                              50

                                                                                                            selected. We entertained six tentative ARMA models and
                              40
                                                                                                            chose that model which has minimum AIC (Akaike
                              30
                                                                                                            Information Criterion). We have used GRETL(Gnu
                              20                                                                            Regression, Econometrics and Time-series Library) software
                              10
                                                                                                            for model identification and forecasting. Total 118 function
                                                                                                            evaluations and 35 gradient evaluations were performed by
                              0
                                   2001       2002   2003   2004    2005    2006   2007    2008
                                                                                                            GRETL to find the model parameters. The models and
                                          (Fig.2a: Time series plot of original series)                     corresponding AIC, HQ and BIC values are given in table-2.

                                                                                                                                                                                                         496
                                                                           All Rights Reserved © 2012 IJARCET
ISSN: 2278 – 1323
                          International Journal of Advanced Research in Computer Engineering & Technology
                                                                              Volume 1, Issue 4, June 2012
                                                                                                 100
It is obvious from table-2, that the ARIMA(1,1,1) is the best                                                        Cases
                                                                                                                  forecast
                                                                                                        95 percent interval
model.
                                                                                                 80



               (Table-2: Alternative ARIMA model)
                                                                                                 60
    ARIM          Akaike       Hannan-Quinn Schwarz
    A             criterion    criterion     criterion
    Model                                                                                        40



    101            762.8011         766.8915                       772.9315
    010            770.0137         771.0315                       772.5354                      20



    110            770.9250         773.9785                       778.4904
                                                                                                  0
    111            757.0627         761.1339                       767.1498
    211            758.9649         764.0539                       771.5738
                                                                                                 -20
    210            768.3624         772.4336                       778.4495                            2005       2005.5      2006   2006.5   2007   2007.5   2008   2008.5    2009



                                                                                                               (Fig.5: Plot of actual and forecast data)

B.) Model verification
                                                                                                                                IV. CONCLUSION
  The model verification is concerned with checking the
residuals of the model to see if they contain any systematic
                                                                                             Crime forecasting is an interesting application area of
pattern which still can be removed to improve on the chosen
                                                                                           research and ARIMA model offers a good technique for
ARIMA. This is done through examining the
                                                                                           predicting the magnitude of any variable. Its strength lies in
autocorrelations and partial autocorrelations of the residuals
                                                                                           the fact that the method is suitable for any time series with
of various orders. For this purpose, the various correlations
                                                                                           any pattern of change and it does not require the forecaster to
upto 16 lags were computed and the same along with their
                                                                                           choose a priori the value of any parameter. Its limitations
significance were tested by Q-statistics. It was observed that,
                                                                                           include its requirement of a long time series. In our study the
none of these correlations is significantly different from zero
                                                                                           developed model for crime forecasting was found to be
at a reasonable level. The ACF and PACF of the residuals
                                                                                           ARIMA (1,1,1). The validity of the forecasted values was
(Figure-4) also indicate good fit of the model. This proves
                                                                                           checked with the actual data for the lead periods and it is
that the selected ARIMA model is an appropriate model.
                                                                                           established that the ARIMA model can be used by researcher
                                      Residual ACF

   0.3
                                                                                           for short time forecasting of crime in India.
                                                                      +- 1.96/T^0.5
   0.2

   0.1
                                                                                                                                     REFERENCES
     0

   -0.1

   -0.2
                                                                                           [1]  Ajoy K. Palit and Dobrivoje Popovic; “Computational Intelligence in
   -0.3                                                                                         Time Series Forecasting Theory and Engineering Applications”, 2005,
          0    2      4       6          8            10      12        14            16
                                                                                                Springer
                                             lag
                                                                                           [2] Akaike, H., 1974, “A new look at the statistical model identification,”
                                      Residual PACF                                             IEEE Transactions on Automatic control, 19(6), 716-723.
   0.3
                                                                      +- 1.96/T^0.5        [3] Alan Pankratz, “Forecasting With Univariate Box- Jenkins Models,
   0.2
                                                                                                Concepts and cases”, John Wily & Sons 1983.
   0.1

     0
                                                                                           [4] Bowerman, B. L., Connell, R. T., and Koehler, A B; “Forecasting,
   -0.1
                                                                                                Time Series, and Regression: An Applied Approach”, Thomson,
   -0.2                                                                                         Belmont, CA 2005.
   -0.3                                                                                    [5] Box G E, Jenkinks G M; “Time series analysis: Forecasting and
          0    2      4       6          8            10      12        14            16
                                             lag
                                                                                                control”; Holden Day, San Francisco
                                                                                           [6] C.B. Gupta, Vijay Gupta; “An Introduction to Statistical Methods”,
                   (Fig.4: Correllogram of residuals)                                           Vikas Publishing House; 295-336.
                                                                                           [7] Dickey D A, Fuller W A; “Distribution of the estimators for
C.) Forecast                                                                                    autoregressive time series with a unit root”; Journal of the American
                                                                                                Statistical Association 74, 427-431.
                                                                                           [8] Groff, Elizabeth R., and La Vigne, Nancy G., : “Forecasting The
   The results show in table-3 and Figure-5 indicate that the                                   Future Of Predictive Crime Mapping, Crime Prevention Studies”, vol.
forecasting model selected is appropriate, therefore the                                        13, pp. 29-57 (2002)
model can be used for forecasting crime in India.                                          [9] Loganathan, Nanthakumar and Yahaya Ibrahim; “Forecasting
                                                                                                International Tourism Demand in Malaysia using Box Jenkins Sarima
                                                                                                Application”; South Asian Journal of Tourism and Heritage (2010),
                       (Table-3: Forecast data)                                                 Vol. 3, Number 2
              For 95% confidence intervals, z(0.025) = 1.96                                [10] Richard I Levin, David S Rubin; “Statistics for Management”; Pearson
  Obs           Cases     prediction std. err. 95% interval                                     Prentice Hall, 861-919.
                                                                                           [11] S. Fan, L. Chen and W. J. Lee, "Short-term load forecasting using
2008:10       undefined           15. 50           13.93           (-11.81, 42.81)              comprehensive combination based on multimeteorological
2008:11       undefined           18.17            16.10           (-13.39, 49.73)              information," IEEE Trans. Industry Applications, Vol. 45, No. 4,
2008:12       undefined           19.78            16.77           (-13.09, 52.64)              July/Aug. 2009.
                                                                                           [12] Wipen Gorr, Richard Harries; “Introduction to Crime forecasting”;
                                                                                                International Journal of Forecasting 19, 2003, 551-555.




                                                                                                                                                                              497
                                                           All Rights Reserved © 2012 IJARCET

More Related Content

Similar to 494 497

The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)theijes
 
Propose Data Mining AR-GA Model to Advance Crime analysis
Propose Data Mining AR-GA Model to Advance Crime analysisPropose Data Mining AR-GA Model to Advance Crime analysis
Propose Data Mining AR-GA Model to Advance Crime analysisIOSR Journals
 
Crime forecasting system for soic final
Crime forecasting system for soic finalCrime forecasting system for soic final
Crime forecasting system for soic finalTwaha Minai
 
On Modeling Murder Crimes in Nigeria
On Modeling Murder Crimes in NigeriaOn Modeling Murder Crimes in Nigeria
On Modeling Murder Crimes in NigeriaScientific Review SR
 
Crime Prediction and Analysis
Crime Prediction and AnalysisCrime Prediction and Analysis
Crime Prediction and AnalysisIRJET Journal
 
Life and science journal.pdf
Life and science journal.pdfLife and science journal.pdf
Life and science journal.pdfSarita30844
 
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...IJITCA Journal
 
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...IJITCA Journal
 
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...IJITCA Journal
 
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...IJITCA Journal
 
A Survey on Data Mining Techniques for Crime Hotspots Prediction
A Survey on Data Mining Techniques for Crime Hotspots PredictionA Survey on Data Mining Techniques for Crime Hotspots Prediction
A Survey on Data Mining Techniques for Crime Hotspots PredictionIJSRD
 
Inspection of Certain RNN-ELM Algorithms for Societal Applications
Inspection of Certain RNN-ELM Algorithms for Societal ApplicationsInspection of Certain RNN-ELM Algorithms for Societal Applications
Inspection of Certain RNN-ELM Algorithms for Societal ApplicationsIRJET Journal
 
Crime Data Analysis, Visualization and Prediction using Data Mining
Crime Data Analysis, Visualization and Prediction using Data MiningCrime Data Analysis, Visualization and Prediction using Data Mining
Crime Data Analysis, Visualization and Prediction using Data MiningAnavadya Shibu
 
IRJET- Crime Analysis using Data Mining and Data Analytics
IRJET- Crime Analysis using Data Mining and Data AnalyticsIRJET- Crime Analysis using Data Mining and Data Analytics
IRJET- Crime Analysis using Data Mining and Data AnalyticsIRJET Journal
 
Using machine learning algorithms to
Using machine learning algorithms toUsing machine learning algorithms to
Using machine learning algorithms tomlaij
 
Survey on Crime Interpretation and Forecasting Using Machine Learning
Survey on Crime Interpretation and Forecasting Using Machine LearningSurvey on Crime Interpretation and Forecasting Using Machine Learning
Survey on Crime Interpretation and Forecasting Using Machine LearningIRJET Journal
 
SUPERVISED AND UNSUPERVISED MACHINE LEARNING METHODOLOGIES FOR CRIME PATTERN ...
SUPERVISED AND UNSUPERVISED MACHINE LEARNING METHODOLOGIES FOR CRIME PATTERN ...SUPERVISED AND UNSUPERVISED MACHINE LEARNING METHODOLOGIES FOR CRIME PATTERN ...
SUPERVISED AND UNSUPERVISED MACHINE LEARNING METHODOLOGIES FOR CRIME PATTERN ...ijaia
 
Supervised and Unsupervised Machine Learning Methodologies for Crime Pattern ...
Supervised and Unsupervised Machine Learning Methodologies for Crime Pattern ...Supervised and Unsupervised Machine Learning Methodologies for Crime Pattern ...
Supervised and Unsupervised Machine Learning Methodologies for Crime Pattern ...gerogepatton
 
Predictive Modeling for Topographical Analysis of Crime Rate
Predictive Modeling for Topographical Analysis of Crime RatePredictive Modeling for Topographical Analysis of Crime Rate
Predictive Modeling for Topographical Analysis of Crime RateIRJET Journal
 

Similar to 494 497 (20)

The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 
Propose Data Mining AR-GA Model to Advance Crime analysis
Propose Data Mining AR-GA Model to Advance Crime analysisPropose Data Mining AR-GA Model to Advance Crime analysis
Propose Data Mining AR-GA Model to Advance Crime analysis
 
Crime forecasting system for soic final
Crime forecasting system for soic finalCrime forecasting system for soic final
Crime forecasting system for soic final
 
ACCESS.2020.3028420.pdf
ACCESS.2020.3028420.pdfACCESS.2020.3028420.pdf
ACCESS.2020.3028420.pdf
 
On Modeling Murder Crimes in Nigeria
On Modeling Murder Crimes in NigeriaOn Modeling Murder Crimes in Nigeria
On Modeling Murder Crimes in Nigeria
 
Crime Prediction and Analysis
Crime Prediction and AnalysisCrime Prediction and Analysis
Crime Prediction and Analysis
 
Life and science journal.pdf
Life and science journal.pdfLife and science journal.pdf
Life and science journal.pdf
 
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
 
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
 
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
 
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
APPLYING DATA ENVELOPMENT ANALYSIS AND CLUSTERING ANALYSIS IN ENHANCING THE P...
 
A Survey on Data Mining Techniques for Crime Hotspots Prediction
A Survey on Data Mining Techniques for Crime Hotspots PredictionA Survey on Data Mining Techniques for Crime Hotspots Prediction
A Survey on Data Mining Techniques for Crime Hotspots Prediction
 
Inspection of Certain RNN-ELM Algorithms for Societal Applications
Inspection of Certain RNN-ELM Algorithms for Societal ApplicationsInspection of Certain RNN-ELM Algorithms for Societal Applications
Inspection of Certain RNN-ELM Algorithms for Societal Applications
 
Crime Data Analysis, Visualization and Prediction using Data Mining
Crime Data Analysis, Visualization and Prediction using Data MiningCrime Data Analysis, Visualization and Prediction using Data Mining
Crime Data Analysis, Visualization and Prediction using Data Mining
 
IRJET- Crime Analysis using Data Mining and Data Analytics
IRJET- Crime Analysis using Data Mining and Data AnalyticsIRJET- Crime Analysis using Data Mining and Data Analytics
IRJET- Crime Analysis using Data Mining and Data Analytics
 
Using machine learning algorithms to
Using machine learning algorithms toUsing machine learning algorithms to
Using machine learning algorithms to
 
Survey on Crime Interpretation and Forecasting Using Machine Learning
Survey on Crime Interpretation and Forecasting Using Machine LearningSurvey on Crime Interpretation and Forecasting Using Machine Learning
Survey on Crime Interpretation and Forecasting Using Machine Learning
 
SUPERVISED AND UNSUPERVISED MACHINE LEARNING METHODOLOGIES FOR CRIME PATTERN ...
SUPERVISED AND UNSUPERVISED MACHINE LEARNING METHODOLOGIES FOR CRIME PATTERN ...SUPERVISED AND UNSUPERVISED MACHINE LEARNING METHODOLOGIES FOR CRIME PATTERN ...
SUPERVISED AND UNSUPERVISED MACHINE LEARNING METHODOLOGIES FOR CRIME PATTERN ...
 
Supervised and Unsupervised Machine Learning Methodologies for Crime Pattern ...
Supervised and Unsupervised Machine Learning Methodologies for Crime Pattern ...Supervised and Unsupervised Machine Learning Methodologies for Crime Pattern ...
Supervised and Unsupervised Machine Learning Methodologies for Crime Pattern ...
 
Predictive Modeling for Topographical Analysis of Crime Rate
Predictive Modeling for Topographical Analysis of Crime RatePredictive Modeling for Topographical Analysis of Crime Rate
Predictive Modeling for Topographical Analysis of Crime Rate
 

More from Editor IJARCET

Electrically small antennas: The art of miniaturization
Electrically small antennas: The art of miniaturizationElectrically small antennas: The art of miniaturization
Electrically small antennas: The art of miniaturizationEditor IJARCET
 
Volume 2-issue-6-2205-2207
Volume 2-issue-6-2205-2207Volume 2-issue-6-2205-2207
Volume 2-issue-6-2205-2207Editor IJARCET
 
Volume 2-issue-6-2195-2199
Volume 2-issue-6-2195-2199Volume 2-issue-6-2195-2199
Volume 2-issue-6-2195-2199Editor IJARCET
 
Volume 2-issue-6-2200-2204
Volume 2-issue-6-2200-2204Volume 2-issue-6-2200-2204
Volume 2-issue-6-2200-2204Editor IJARCET
 
Volume 2-issue-6-2190-2194
Volume 2-issue-6-2190-2194Volume 2-issue-6-2190-2194
Volume 2-issue-6-2190-2194Editor IJARCET
 
Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189Editor IJARCET
 
Volume 2-issue-6-2177-2185
Volume 2-issue-6-2177-2185Volume 2-issue-6-2177-2185
Volume 2-issue-6-2177-2185Editor IJARCET
 
Volume 2-issue-6-2173-2176
Volume 2-issue-6-2173-2176Volume 2-issue-6-2173-2176
Volume 2-issue-6-2173-2176Editor IJARCET
 
Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172Editor IJARCET
 
Volume 2-issue-6-2159-2164
Volume 2-issue-6-2159-2164Volume 2-issue-6-2159-2164
Volume 2-issue-6-2159-2164Editor IJARCET
 
Volume 2-issue-6-2155-2158
Volume 2-issue-6-2155-2158Volume 2-issue-6-2155-2158
Volume 2-issue-6-2155-2158Editor IJARCET
 
Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Editor IJARCET
 
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Editor IJARCET
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Editor IJARCET
 
Volume 2-issue-6-2139-2142
Volume 2-issue-6-2139-2142Volume 2-issue-6-2139-2142
Volume 2-issue-6-2139-2142Editor IJARCET
 
Volume 2-issue-6-2130-2138
Volume 2-issue-6-2130-2138Volume 2-issue-6-2130-2138
Volume 2-issue-6-2130-2138Editor IJARCET
 
Volume 2-issue-6-2125-2129
Volume 2-issue-6-2125-2129Volume 2-issue-6-2125-2129
Volume 2-issue-6-2125-2129Editor IJARCET
 
Volume 2-issue-6-2114-2118
Volume 2-issue-6-2114-2118Volume 2-issue-6-2114-2118
Volume 2-issue-6-2114-2118Editor IJARCET
 
Volume 2-issue-6-2108-2113
Volume 2-issue-6-2108-2113Volume 2-issue-6-2108-2113
Volume 2-issue-6-2108-2113Editor IJARCET
 
Volume 2-issue-6-2102-2107
Volume 2-issue-6-2102-2107Volume 2-issue-6-2102-2107
Volume 2-issue-6-2102-2107Editor IJARCET
 

More from Editor IJARCET (20)

Electrically small antennas: The art of miniaturization
Electrically small antennas: The art of miniaturizationElectrically small antennas: The art of miniaturization
Electrically small antennas: The art of miniaturization
 
Volume 2-issue-6-2205-2207
Volume 2-issue-6-2205-2207Volume 2-issue-6-2205-2207
Volume 2-issue-6-2205-2207
 
Volume 2-issue-6-2195-2199
Volume 2-issue-6-2195-2199Volume 2-issue-6-2195-2199
Volume 2-issue-6-2195-2199
 
Volume 2-issue-6-2200-2204
Volume 2-issue-6-2200-2204Volume 2-issue-6-2200-2204
Volume 2-issue-6-2200-2204
 
Volume 2-issue-6-2190-2194
Volume 2-issue-6-2190-2194Volume 2-issue-6-2190-2194
Volume 2-issue-6-2190-2194
 
Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189
 
Volume 2-issue-6-2177-2185
Volume 2-issue-6-2177-2185Volume 2-issue-6-2177-2185
Volume 2-issue-6-2177-2185
 
Volume 2-issue-6-2173-2176
Volume 2-issue-6-2173-2176Volume 2-issue-6-2173-2176
Volume 2-issue-6-2173-2176
 
Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172
 
Volume 2-issue-6-2159-2164
Volume 2-issue-6-2159-2164Volume 2-issue-6-2159-2164
Volume 2-issue-6-2159-2164
 
Volume 2-issue-6-2155-2158
Volume 2-issue-6-2155-2158Volume 2-issue-6-2155-2158
Volume 2-issue-6-2155-2158
 
Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154
 
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147
 
Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124Volume 2-issue-6-2119-2124
Volume 2-issue-6-2119-2124
 
Volume 2-issue-6-2139-2142
Volume 2-issue-6-2139-2142Volume 2-issue-6-2139-2142
Volume 2-issue-6-2139-2142
 
Volume 2-issue-6-2130-2138
Volume 2-issue-6-2130-2138Volume 2-issue-6-2130-2138
Volume 2-issue-6-2130-2138
 
Volume 2-issue-6-2125-2129
Volume 2-issue-6-2125-2129Volume 2-issue-6-2125-2129
Volume 2-issue-6-2125-2129
 
Volume 2-issue-6-2114-2118
Volume 2-issue-6-2114-2118Volume 2-issue-6-2114-2118
Volume 2-issue-6-2114-2118
 
Volume 2-issue-6-2108-2113
Volume 2-issue-6-2108-2113Volume 2-issue-6-2108-2113
Volume 2-issue-6-2108-2113
 
Volume 2-issue-6-2102-2107
Volume 2-issue-6-2102-2107Volume 2-issue-6-2102-2107
Volume 2-issue-6-2102-2107
 

Recently uploaded

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 

Recently uploaded (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

494 497

  • 1. ISSN: 2278 – 1323 International Journal of Advanced Research in Computer Engineering & Technology Volume 1, Issue 4, June 2012 Applicability of Box Jenkins ARIMA Model in Crime Forecasting: A case study of counterfeiting in Gujarat State Anand Kumar Shrivastav, Dr. Ekata  Institute of Justice (NIJ) awarded five grants to study crime Abstract— For any police organization, detection and forecasting for police use as an extension of crime mapping prevention of crime incidents is a big challenge. Normally police [12]. The purpose of this paper was to probe the applicability organization maintains data of crime and criminals for various of univariate time series model for crime forecasting. This purposes like investigation, coordination and future strategies. paper, attempts to outline the practical steps which need to be The forecast made with the help of historical crime data may undertaken to use Box Jenkins ARIMA time series models support law enforcement agencies in their decision making activities and tactical operations. The time series is one of the for crime forecasting. The paper is organized as following: In tools for making prediction and autoregressive integrated Section 2, brief outline of time series forecasting, ARIMA moving average (ARIMA) model has been successfully used in and ARIMA modeling have been given. In section 3 we have forecasting econometrics, social science and many other applied ARIMA model on historical crime data of Gujarat problems. This model has the advantage of accurate forecasting State, pertaining to counterfeiting, to make short term over short-term. In this study, we have utilized the crime data forecasting. Section 4 is related to results and conclusions. of Gujarat State pertaining to counterfeiting of currency. We have used Box Jenkins ARIMA model for short term crime II. TIME SERIES FORECASTING forecasting. A time series is a set of data pertaining to the values of a Index Terms— ARIMA, Box Jenkins, Crime, Forecasting. variable at different times. A time series has an important property which makes it quite distinct from any other kind of statistical data. Formal examples of time series are the I. INTRODUCTION population of India at each successive decennial census; daily business handled by a bank, monthly production statistics of Despite the increased emphasis on proactive policing, the a steel mill; monthly traffic accident fatalities etc. The crime core of police work remains that of responding to calls for criminal data can be arranged in a regular fashion i.e. service, making effective deployment strategies paramount to monthly, quarterly, half yearly or yearly and can be assumed a well-functioning police department [12]. The crime to be a time series data. Time series analysis is used to detect forecasting is an emerging approach in criminological patterns of change in statistical information over regular research. Crime forecasting is not widely practiced by intervals of time. We project these patterns to arrive at an police. While there are numerous econometric studies of estimate for the future. The time series analysis helps us cope crime or incorporating crime in the literature, one is with uncertainty about the future [8]. All statistical hard-pressed to find police departments or other police forecasting methods are extrapolatory in nature i.e. they organizations making regular use of forecasting for policing. involve the projection of past patterns or relationships into From a more proactive standpoint, problem-oriented policing the future [3]. Time series forecast is one of the most efforts may be enhanced by a more accurate scanning of areas important tools for research in the field of social sciences and with crime problems, in that one can examine both engineering. Forecasting is an essential tool in any decision distributions of past crimes and predictions of future making process. The quality of the forecasts management can concentrations [8]. The ability to predict can serve as a make it strongly related to the information that can be valuable source of knowledge for law enforcement agencies, extracted and used from past data. both from tactical as well strategic perspectives. Forecasting can help a police department's performance by strategic A.) ARIMA (Autoregressive Integrated Moving Average) deployment efforts and efficient investigation direction. The origin of crime forecasting is in year 1998, when US National ARIMA model was introduced by Box and Jenkins (hence also known as Box-Jenkins model) in 1960. It is an Manuscript received May 15, 2012. extrapolation method for forecasting and, like any other such Anand Kumar Shrivastav, Research Scholar, Department of Computer Science, Mewar University, Chittorgarh, India, (e-mail: method, it requires only the historical time series data on the shrivastav.anand@gmail.com). variable under forecasting. ARIMA models are the most Dr. Ekata, Associate Professor, Department of Applied Science, Krishna general class of models for forecasting a time series. Institute of Engineering and Technology, Ghaziabad, India (e-mail: Normally, the ARIMA model is represented as ekata4@rediffmail.com). ARIMA(p,d,q) where p is the number of autoregressive . 494 All Rights Reserved © 2012 IJARCET
  • 2. ISSN: 2278 – 1323 International Journal of Advanced Research in Computer Engineering & Technology Volume 1, Issue 4, June 2012 terms, d is the number of non-seasonal differences, and q is functions (PACFs). The following table summarizes rough the number of lagged forecast errors in the prediction guideline for using these parameters in initial model equation. The identification of the appropriate ARIMA identification. model for a time series begins with the process of finding integer, usually very small (e.g., 0, 1, or 2), values of p, d, AR(p) MA(q) ARMA(p,q) and q that model the patterns in the data. When the value is 0, ACF Tails off Cuts off after Tails off the element is not needed in the model. The middle element, lag q d, also known as trend component is investigated before p PACF Cuts off after Tails off Tails off and q. The goal is to determine if the process is stationary lag p and, if not, to make it stationary before determining the values of p and q. The augmented Dickey–Fuller (ADF) test B.) Steps of ARIMA model is most widely used test for checking stationarity of a series. If d = 0, the model becomes ARMA, which is linear stationary model. ARIMA (i.e. d > 0) is a linear (i) Identification of ARIMA (p,d,q) structure non-stationary model. If the underlying time series is (ii) Estimating the coefficients of the formulation non-stationary, taking the difference of the series with itself (iii) Fitting test on the estimated residuals „d‟ times makes it stationary, and then ARMA is applied onto (iv) Forecasting the future outcomes based on the historical the differenced series. A stationary process has a constant data mean and variance over the time period. There are various methods available to make a time series stationary. Normally The steps of ARIMA model building methodology is differencing techniques are used to transform a time series presented in a flow chart in figure-1. from a non-stationary to stationary by subtracting each datum in a series from its predecessor. If a series is stationary without any differencing it is designated as I(0), or integrated of order 0. On the other hand, a series that has stationary first differences is designated I(1), or integrated of order 1. The term „shock‟ is used to indicate an unexpected change in the value of a variable (or error). For a stationary series a shock will gradually die away. In other words, the effect of a shock during time„t‟ will have a smaller effect in time„t+1‟, a still smaller effect in time„t+2‟, etc. The lags of the differenced series appearing in the forecasting equations are called “auto-regressive” terms. The auto-regressive components represent the memory of the process for preceding observations. The value of p is the number of auto-regressive components in an ARIMA (p, d, q) model. The value of p is 0 if there is no relationship between adjacent observations. (Fig.1: ARIMA model building steps) When the value of p is 1, there is a relationship between observations at lag 1 and the correlation coefficient is the magnitude of the relationship. When the value of p is 2, there III. BUILDING ARIMA MODEL FOR CRIME FORECASTING is a relationship between observations at lag 2 and the correlation coefficient is the magnitude of the The formulation of ARIMA model depends on the relationship. Thus p is the number of correlations we need to characteristics of the series. In this paper, we have used the model the relationship. The lags of the forecast errors are Crime data of counterfeit currency for past 8 years (96 called “moving average” terms. The moving average months) beginning year 2001 of Gujarat State (Table-1). The components represent the memory of the process for crime data has been taken from “Crime in India” , an annual preceding random shocks. The value q indicates the number publication of National Crime Records Bureau of moving average components in an ARIMA (p, d, q). When (www.ncrb.gov.in). We have utilized 93 months data for q is zero, there are no moving average components. When q analysis and 3 months data for validation of our forecasted is 1, there is a relationship between the current score and the results. We have used GRETL (Gnu Regression, random shock at lag 1 and the correlation coefficient θ1 Econometrics and Time-series Library) software for plotting represents the magnitude of the relationship. When q is 2, the graphs and analysis of the data set. there is a relationship between the current score and the random shock at lag 2, and the correlation coefficient θ2 (Table-1: Registered cases of Counterfeit Currency) represents the magnitude of the relationship. When one of the Year Month No. of Year Month No. of terms is zero, it‟s usual to drop AR, I or MA component. For Cases Cases example, I(1) model is ARIMA(0,1,0), and a MA(1) model 2001 Jan 2 2005 Jan 21 is ARIMA(0,0,1). The autocorrelation function (ACF) and Feb 6 Feb 90 partial correlation function (PACF) are very important for the Mar 4 Mar 75 definition of the internal structure of the analysed series. The Apr 4 Apr 37 models can be identified through patterns in their May 2 May 18 autocorrelation functions (ACFs) and partial autocorrelation 495 All Rights Reserved © 2012 IJARCET
  • 3. ISSN: 2278 – 1323 International Journal of Advanced Research in Computer Engineering & Technology Volume 1, Issue 4, June 2012 Jun 2 Jun 26 The non-stationarity was also confirmed with the help of Jul 5 Jul 19 Augmented Dickey Fuller (ADF) test. The first order Aug 3 Aug 22 differencing transformation was made to make the series Sep 3 Sep 29 stationary. The time series plot of original and differenced Oct 4 Oct 12 series are shown at figure-2a and Figure-2b respectively. The Nov 3 Nov 19 ACF and PACF plot of differenced series is shown at Dec 3 Dec 18 figure-3. 2002 Jan 6 2006 Jan 20 Feb 1 Feb 19 80 Mar 3 Mar 18 60 Apr 2 Apr 7 May 4 May 15 40 Jun 8 Jun 10 Jul 6 Jul 14 20 Aug 1 Aug 8 d_Cases Sep 5 Sep 5 0 Oct 7 Oct 15 Nov 4 Nov 16 -20 Dec 2 Dec 13 -40 2003 Jan 4 2007 Jan 17 Feb 3 Feb 21 -60 Mar 4 Mar 15 Apr 2 Apr 23 -80 2001 2002 2003 2004 2005 2006 2007 2008 May 3 May 25 Jun 2 Jun 35 (Fig.2b: Time series plot of differenced series) Jul 4 Jul 32 Aug 2 Aug 10 ACF for d_Cases Sep 82 Sep 16 0.4 +- 1.96/T^0.5 0.3 Oct 89 Oct 19 0.2 Nov 19 Nov 6 0.1 0 Dec 31 Dec 19 -0.1 -0.2 2004 Jan 19 2008 Jan 14 -0.3 -0.4 Feb 17 Feb 18 0 5 10 15 20 Mar 3 Mar 11 lag Apr 6 Apr 11 PACF for d_Cases May 2 May 10 0.4 +- 1.96/T^0.5 0.3 Jun 4 Jun 9 0.2 0.1 Jul 2 Jul 15 0 -0.1 Aug 4 Aug 11 -0.2 -0.3 Sep 3 Sep 11 -0.4 Oct 5 Oct 8 0 5 10 15 20 lag Nov 6 Nov 10 Dec 42 Dec 16 (Fig.3: Correllogram of differenced series) The Box-Jenkins‟s methodology for forecasting requires the series to be stationary. The time series plot and A.) Model Parameters Estimation correllogram of data provide a strong evidence of a non-stationary series. There are several model selection criteria like AIC (Akaike 90 information criteria), HQ (Hannan-Quinn criteria) and 80 SIC(Schwarz information criteria). Among the several 70 methods studied in the literature to judge the fitness of the 60 models, we used Akaike information criterion (AIC) [2]. According to this the model with least AIC value will be Cases of Counterfeiting 50 selected. We entertained six tentative ARMA models and 40 chose that model which has minimum AIC (Akaike 30 Information Criterion). We have used GRETL(Gnu 20 Regression, Econometrics and Time-series Library) software 10 for model identification and forecasting. Total 118 function evaluations and 35 gradient evaluations were performed by 0 2001 2002 2003 2004 2005 2006 2007 2008 GRETL to find the model parameters. The models and (Fig.2a: Time series plot of original series) corresponding AIC, HQ and BIC values are given in table-2. 496 All Rights Reserved © 2012 IJARCET
  • 4. ISSN: 2278 – 1323 International Journal of Advanced Research in Computer Engineering & Technology Volume 1, Issue 4, June 2012 100 It is obvious from table-2, that the ARIMA(1,1,1) is the best Cases forecast 95 percent interval model. 80 (Table-2: Alternative ARIMA model) 60 ARIM Akaike Hannan-Quinn Schwarz A criterion criterion criterion Model 40 101 762.8011 766.8915 772.9315 010 770.0137 771.0315 772.5354 20 110 770.9250 773.9785 778.4904 0 111 757.0627 761.1339 767.1498 211 758.9649 764.0539 771.5738 -20 210 768.3624 772.4336 778.4495 2005 2005.5 2006 2006.5 2007 2007.5 2008 2008.5 2009 (Fig.5: Plot of actual and forecast data) B.) Model verification IV. CONCLUSION The model verification is concerned with checking the residuals of the model to see if they contain any systematic Crime forecasting is an interesting application area of pattern which still can be removed to improve on the chosen research and ARIMA model offers a good technique for ARIMA. This is done through examining the predicting the magnitude of any variable. Its strength lies in autocorrelations and partial autocorrelations of the residuals the fact that the method is suitable for any time series with of various orders. For this purpose, the various correlations any pattern of change and it does not require the forecaster to upto 16 lags were computed and the same along with their choose a priori the value of any parameter. Its limitations significance were tested by Q-statistics. It was observed that, include its requirement of a long time series. In our study the none of these correlations is significantly different from zero developed model for crime forecasting was found to be at a reasonable level. The ACF and PACF of the residuals ARIMA (1,1,1). The validity of the forecasted values was (Figure-4) also indicate good fit of the model. This proves checked with the actual data for the lead periods and it is that the selected ARIMA model is an appropriate model. established that the ARIMA model can be used by researcher Residual ACF 0.3 for short time forecasting of crime in India. +- 1.96/T^0.5 0.2 0.1 REFERENCES 0 -0.1 -0.2 [1] Ajoy K. Palit and Dobrivoje Popovic; “Computational Intelligence in -0.3 Time Series Forecasting Theory and Engineering Applications”, 2005, 0 2 4 6 8 10 12 14 16 Springer lag [2] Akaike, H., 1974, “A new look at the statistical model identification,” Residual PACF IEEE Transactions on Automatic control, 19(6), 716-723. 0.3 +- 1.96/T^0.5 [3] Alan Pankratz, “Forecasting With Univariate Box- Jenkins Models, 0.2 Concepts and cases”, John Wily & Sons 1983. 0.1 0 [4] Bowerman, B. L., Connell, R. T., and Koehler, A B; “Forecasting, -0.1 Time Series, and Regression: An Applied Approach”, Thomson, -0.2 Belmont, CA 2005. -0.3 [5] Box G E, Jenkinks G M; “Time series analysis: Forecasting and 0 2 4 6 8 10 12 14 16 lag control”; Holden Day, San Francisco [6] C.B. Gupta, Vijay Gupta; “An Introduction to Statistical Methods”, (Fig.4: Correllogram of residuals) Vikas Publishing House; 295-336. [7] Dickey D A, Fuller W A; “Distribution of the estimators for C.) Forecast autoregressive time series with a unit root”; Journal of the American Statistical Association 74, 427-431. [8] Groff, Elizabeth R., and La Vigne, Nancy G., : “Forecasting The The results show in table-3 and Figure-5 indicate that the Future Of Predictive Crime Mapping, Crime Prevention Studies”, vol. forecasting model selected is appropriate, therefore the 13, pp. 29-57 (2002) model can be used for forecasting crime in India. [9] Loganathan, Nanthakumar and Yahaya Ibrahim; “Forecasting International Tourism Demand in Malaysia using Box Jenkins Sarima Application”; South Asian Journal of Tourism and Heritage (2010), (Table-3: Forecast data) Vol. 3, Number 2 For 95% confidence intervals, z(0.025) = 1.96 [10] Richard I Levin, David S Rubin; “Statistics for Management”; Pearson Obs Cases prediction std. err. 95% interval Prentice Hall, 861-919. [11] S. Fan, L. Chen and W. J. Lee, "Short-term load forecasting using 2008:10 undefined 15. 50 13.93 (-11.81, 42.81) comprehensive combination based on multimeteorological 2008:11 undefined 18.17 16.10 (-13.39, 49.73) information," IEEE Trans. Industry Applications, Vol. 45, No. 4, 2008:12 undefined 19.78 16.77 (-13.09, 52.64) July/Aug. 2009. [12] Wipen Gorr, Richard Harries; “Introduction to Crime forecasting”; International Journal of Forecasting 19, 2003, 551-555. 497 All Rights Reserved © 2012 IJARCET