SlideShare a Scribd company logo
1 of 12
Download to read offline
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –225
PREDICTION OF STORM DISASTER USING CLOUD
MAP-REDUCE METHOD
ARJU JINDAL
M.Tech Scholar Computer Science and Engineering Department,
BGIET Sangrur, Punjab, India
arzoojindal@gmail.com
Er. YOGESH KUMAR
Assistant Professor, Computer science and Engineering Department,
BGIET Sangrur, Punjab, India
yksingla37@gmail.com
Manuscript History
Number: IJIRAE/RS/Vol.05/Issue06/JNAE10084
Received: 03, June 2018
Final Correction: 10, June 2018
Final Accepted: 19, June 2018
Published: June 2018
Citation: JINDAL & KUMAR (2018). PREDICTION OF STORM DISASTER USING CLOUD MAP-REDUCE METHOD,
IJIRAE:: International Journal of Innovative Research in Advanced Engineering, Volume V, 225-236.
doi://10.26562/IJIRAE.2018.JNAE10084
Editor: Dr.A.Arul L.S, Chief Editor, IJIRAE, AM Publications, India
Copyright: ©2018 This is an open access article distributed under the terms of the Creative Commons Attribution
License, Which Permits unrestricted use, distribution, and reproduction in any medium, provided the original author
and source are credited
Abstract: Data mining is the process of analyzing data from different perspectives and summarizing it into useful
information the patterns, associations, or relationships among all this data can provide information. Spatial Data
Mining (SDM) is the part of data mining. It is mainly used for finding the figure in data that is related to space. In
spatial data mining, to get the result analyst use geographical or spatial information which require some special
technique to get geographical data in the appropriate formats. SDM is mainly used in earth science data, crime
mapping, census data, traffic data, and cancer cluster (i.e. to investigate environment health hazards). For real time
processing, Stream data mining is used for the prediction of storm using spatial dataset with the help of stream data
mining strategy i.e. CMR (Cloud Map-Reduce). In this, Stream data mining is presented to detect the storm disaster
of coastal area which is located in Central America country and then by taking the dataset of coastal area which are
having various regions. There are various regions i.e. Panama, Greater Antilles, Mexico golf which is used to detect
the storm. It also detects that, is the region is affected or not, if affected then which area of that region is affected and
from this it is helpful to predict the storm before the disaster occur. In this, two parameters are used to test the
technique i.e. processing time and computational load and using this parameter compare the previous and proposed
technique. The main aim of this is to predict storm disaster before it happen with the help of directionality and
velocity of storm.
Keywords: Spatial data mining, spatial data, stream data mining, cloud map-reduce, location prediction
I. INTRODUCTION
Spatial Data Miningbasically used for detects the characteristics and relationship which are stored in spatial
database. There is a massive amount of spatial data which capture from satellite images, medical equipment, video
cameras, etc. Spatial data mining is the part of data mining. Spatial data mining is mainly used for finding the pattern
in data which is related to geography [1]. In spatial data mining, to get the result analyst use geographical or spatial
information which require some special technique to get geographical data in the appropriate formats. There are
two types of data that kept in spatial database i.e. raster data (satellite, aerial/digital images) and vector data (lines,
points, polygons).
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –226
A. Spatial Data Mining Task:
Spatial data mining has the following six tasks which are briefly explained below [2]:
1. Classification: In spatial data mining, classification is a machine learning technique used to predict the set of
rules which helps to find the class of the classified object according to its attribute.
2. Association Rules: It helps to provide the rules correlate the presence of a set of items with another set of item.
3. Characteristics Rules: It is used to explain certain portion of the database.
4. Discriminate Rules: It is used to explain the contrast between any two portions of the database.
5. Clustering: Clustering is an automatic process of grouping related records together.
6. Trend Detection: It is used to discovering most significant changes in the database.
B. Spatial Data Mining Techniques
In spatial data mining there are various techniques which are discuss below [2]:
1. Clustering and Outlier Detection: In this cluster analysis divides data into meaningful or useful groups. It is very
useful in spatial database. The collection of clusters is known as clustering.
Spatial trend detection is a technique for discovering most significant changes in the database. In this clustering
algorithm is used to deal with geographical datasets which are of the following forms:
 Partitioning Method: In this partitioning algorithm is used to arrange the object into cluster so that deviation
of cluster from its Centre should be minimized.
 Hierarchical Method: In this, datasets are breakdown until stopping standard is meeting. Breakdown is by
merging and splitting all clusters.
 Density-Based Method: Density based method is helpful to screen out noise and trend detection which is differ
from partitioning method that is helpful to find out cluster of arbitrary shapes.
 Grid-Based Method: In this, initially subdivide the cluster into number of cells after that do some operations
upon the grid structure. The benefit of this method is having the fast processing time.
2. Association and Co-Location: According to the non-spatial attributes is helpful to detect the characteristics rules
and spatial objects while doing clustering method on data.
3. Classification: Classification is a machine learning technique which is used to create the model, from that model
whole dataset is analyzed.
4. Trend Detection: It is used to discovering the most significant change that take place in database.
C. Issues of Spatial Data Mining
There are the various issues of spatial data mining that are described as:
1. Spatial data mining algorithm can relocate from the data mining algorithm which are not supposed to be for
storage of spatial data, its processing and characteristics of spatial data itself. Spatial data mining algorithm can
provide knowledge which is not much refined due to the efficiency of spatial data mining which is not so high.
Spatial data mining has various method and task but it is applicable only for specific purpose. Therefore,
extraction of knowledge from this is finite [3]. In spatial data mining there is no query language. So to improve
the spatial data mining, develop the SDM query language to demonstrate the effective spatial data mining.
2. If there is a linkage between spatial and temporal having the relevant information like distance, direction, before
and after, topology which is to be taken in spatiotemporal data analysis and mining [4]. Those attributes should
be considered that contain nearby pattern which having the important effect on the pattern. Interpretation of
spatial patterns, data structure to define and effective spatial datasets are also challenging issues.
3. Spatial object mainly arranged in the ascending and descending Oder using hierarchy so that it is easy to present
the spatial object at different level of source [5]. Spatial patterns contain the spatial object which can be mounted
but they are not much secure. Predictive tasks are developed by inductive learning algorithm that needs large set
of labeled data. So, very large data can be classified and only few data can be mined and it is quite expensive.
4. The dimensionality is the most important issue in time series domain when number of dimensions is low [6].
INTRODUCTION TO STREAM DATA MINING
In recent year, advances in hardware technology have facilitated new ways of collecting data continuously.
Tremendous and potentially infinite volumes of data streams are often generated by real –time surveillance
systems, communication networks, Internet traffic, on-line transactions in the financial market or retail industry,
electric power grids, industry production processes, scientific and engineering experiments, remote sensors, and
other dynamic environments. Data Stream mining is the process of extracting knowledge structures from such
continuous, rapid data records. Mining data streams raises new problems for the data mining community about how
to mine continuous high-speed data items that you can only have one look at. Due to this reason, traditional data
mining approach needs to be changed and to discover knowledge or patterns from data streams , it is necessary to
develop single-scan, on-line , multilevel , multi-dimensional stream processing and analysis methods.
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –227
Stream data mining is concerned with extracting knowledge from non-stopping and continuous stream of
information [7]. A data stream is a massive, infinite, temporally ordered, continuous and rapid sequence of data
elements such as customer click streams, supermarket, telephone records, stock market, meteorological research,
multimedia data, scientific and engineering experiments and sensor data. Some of issues of data stream like
dynamic nature, Infinite size and high speed, unbounded memory requirements, Lack of global view, handling the
continuous flow of data impose a great challenge for the researchers dealing with streaming data.
In data stream model, data items can be relational tuples like network measurements and call records. In
comparison with traditional data sets, data stream flows continuously in systems with varying update rate. Data
streams are continuous, temporally ordered, fast changing, massive and potentially infinite. Due to huge amount and
high storage cost, it is impossible to store an entire data streams or to scan through it multiple times. So it makes so
many challenges in storage, computational and communication capabilities of computational systems [8]. Because of
high volume and speed of input data, it is needed to use semi-automatic interactional techniques to extract
embedded knowledge from data. Data stream mining is the extraction of structures of knowledge that are
represented in the case of models and patterns of infinite streams of information. The general process of data
stream mining is depicted in Fig 1 below:
Fig 1: General process of data stream mining [8]
A. Data Stream Method
There are various kinds of method which are used for processing large amount of big data using stream processing
which are explained below [9]:
1. Apache Storm:Apache Storm is for managing unbounded data streams over a distributed real-time streaming,
developed by twitter. Mainly it varies from map-reduce is due to maintaining storm topologies. For example, a
stream of tweets from twitter can be transformed into stream of trending topics. The two main components of
Storm are tuples and nodes as shown in Fig 2
Fig 2: Storm Architecture [9]
Problem: In Storm even though it can process a million tuples per second per node. But, at node it hold the data to
send upon the acknowledge had to be received it is a Delay for data processing for stream data & it cannot handle
Stragglers.
2. Apache Kafka: Apache Kafka is a pub/sub system developed by LinkedIn. Which describes the partitioned for
data and it provides replication & distribution based of log-based service. Kafka maintains the topics for
consumer to feed messages and subscribing the topics will publish message feeds as producer due to the
requests & responses of Kafka cluster as shown in Fig 3.
Problem: In Kafka model, based on the pub/sub system it maintains cluster & handle. Faults but in high-level
latencies it cannot handle faults & stragglers issues.
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –228
Fig 3: Apache Kafka Architecture [9]
3. Apache Spark: Apache Spark is a stream processing engine for fast & large-scale processing of stream data. Spark
introduces D-Streams (Discretized streams) for processing the stream data using RDD based on in-memory
processing concept. Here RDD means Resilient Distributed Datasets which process the stateless data work sets
for handling fault-tolerance. In spark these RDD manages the streaming computations based on batches of
results due to some speculation & handles Straggler using RDD as shown in Fig 4.
Fig 4: Apache Spark Architecture [9]
Spark Streaming providing the features for unified & batch processing as follows: First, for the spark job we can
combine D-streams with RDDs computations. Second, using batch processing mode we run D-streams on past
historical data. Means streaming report on past data. Third, using Scala console D-streams run ad-hoc queries.
Problem: Spark is better than all stream processing engine and it can handle faults & stragglers, but using RDD in-
memory concept it processing low streaming compare to FLINK streaming.
4. Apache Flink:Apache Flink is component of alternative approach for Hadoop map-reduce style for data
processing system. It comes with its own runtime, rather than building on top of Map-Reduce. It is independent
for the ecosystem of Hadoop. Therefore, in-built Hadoop cluster & YARN support is there for flink to processing
the data in map-reduce style and also in iterative & intensive stream processing as shown in Fig 5.
Fig 5: Apache Flink Architecture [9]
5. Map-Reduce: Map-Reduce is a Framework, which consists of basically two phases Map & Reduce with process the
data using batch processing style with applying sort & shuffle functionalities of data. Map-Reduce consists of two
separate and distinct phase:
 Mapper: A function from an input pair of key/value to a list of intermediate pairs of key/value
Map: (key-input, value-input) List (key-map, value-map)
 Reduce: A function from an intermediate pairs of key/ value to a list of output pairs of key/value13.
Reduce: (key-map, list (value-map)) list (key-reduce, value-reduce) as shown in Fig 6.
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –229
Fig 6: Flow of Map-Reduce Architecture [9]
Problem: In the Map-reduce framework the problem is when there is a data processing using batch processing, it
cannot handle the multiple & short-live queries for requests and responses & also it cannot handle the Stragglers.
6. Hadoop: Hadoop is used for storing and processing big data. Apache Hadoop is a distributed computing
framework modelled after Google Map-Reduce to process large amounts of data in parallel. Once in a while, the
first thing that comes to my mind when speaking about distributed computing is EJB. EJB is de facto a component
model with remote capability but short of the critical features being a distributed computing framework that
include computational parallelization, work distribution, and tolerance to unreliable hardware and software. In
general, data flows from components to components in an enterprise application. This is the case for application
frameworks (EJB and spring framework), integration engines (Camel and Spring Integration), as well as ESB
(Enterprise Service Bus) products. Nevertheless, for the data-intensive processes Hadoop deals with, it makes
better sense to load a big data set once and perform various analysis jobs locally to minimize IO and network
cost, the so-called "Move-Code-To-Data" philosophy. When you load a big data file to HDFS, the file is split into
chunks (or file blocks) through a centralized Name Node (master node) and resides on individual Data Nodes
(slave nodes) in the Hadoop cluster for parallel processing. The structure of Hadoop is shown in Fig 7 below:
Fig 7: Structure of Hadoop [9]
II. LITERATURE REVIEW
Refonna et al. in this paper, author discuss the analysis and prediction of natural disaster by using spatial data
mining techniques [1]. In this basically map-reduce framework is used to predict the disaster like earthquake, storm
identification, rainfall etc. It also used the big data concept which contain two services first is Hadoop software and
related hardware second is No SQL database software and related hardware.
Wang et al. in this, spatial pattern analyse with time series domain from 3D images. Space filling curve is used when
traversing an image. Analyse the spatial sequence in the transformed domain to produce different patterns [6]. In
this various techniques was used to analyse the time series i.e. Euclidean distance, reduction technique, singular
value, piecewise aggregate approximation etc. fMRI dataset can be analysed.
Mohamed MedhatGaber, et al.in this paper author explains the stream data mining. In stream data mining there are
two kinds of solution based on problems and challenges i.e. data based solution which is used to summarize the
whole dataset or choosing a subset of the incoming stream to be analysed [10]. Second is task based solution which
is used to modify the existing techniques or invent new ones in order to address the computational challenges of
data stream processing. It also presents the mining techniques and research issues.
Pereyra Zamora et al.in this paper author discuss the spatial data mining technique is applicable to spatial load
forecasting methods. It is used in distribution system [11]. Fundamental stage of KDD process is data preparation.
Neural network is efficient in spatial data mining tasks.
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –230
Elena Ikonomovksa, et al.in this paper author presents the introduction of stream data mining also the theoretical
foundations of data stream analysis and identifies potential directions of future research [12]. Mining data stream
techniques are being critically reviewed. Mining data streams is an immature, growing field of study. There are
many open issues that need to be addressed. It improve the real-time decision making process in almost every area
of our life.
Sumathi et al.in this, spatial data mining algorithm is divided into four categories, first is clustering and outlier
detection second is association and co-location third is classification and fourth is trend detection [2].Clustering is
an unsupervised method. There are a number of clusters presented in data set. The cluster results are needed for
validation.
Aakunuri et al.author describes the various kind clustering algorithms used for spatial data mining. First is
Partitioning around Medoids (PAM) and second is clustering large application (CLARA). Both algorithms are
inefficient in terms of complexity [13]. After that new algorithm is used to overcome this problem i.e. clustering
large application based on Randomized search (CLARANS) is designed for cluster analysis. By using this, two
algorithms are designed for spatial data mining i.e. spatial dominant approach (SD CLARANS) and other is non-
spatial dominant approach (NSD CLARANS).
SvitlanaVolkovaauthor proposed the algorithm, method and frameworks for processing and analysing the real time
data stream [14]. It also discusses the difference between batch and stream mode learning and analyse the learning
framework which include MOA and Jerboa. In this application for learning from social media streams is also
explained.
Otavio M. de Carvalho, et al.this paper reviews the state-of-the-art in event processing systems. A set of the most
significant weaknesses and limitations is discussed at a high level, and also outline requirements that a system
should meet to excel at a variety of event processing applications. It also discussed the evolution in the event
processing field [15]. It shows the path of development of these tools has changed since by integrating the Map-
Reduce model into the event processing domain.
Jayasinghe et al.author describe the spatial data mining techniques is mainly used to find the forests extent changes.
In spatial data mining satellite image were used to get knowledge about any change in forests. In this two
approaches are used to derive the thematic maps i.e. supervised and unsupervised [16]. Back propagation algorithm
is used to detect the forests changes and update existing forests maps
Georg Krempl, et al.in this paper presents a discussion on eight open challenges for data stream mining which is used
to identify gaps between current research and meaningful applications, highlight open problems, and define new
application-relevant research directions for data stream mining [17]. The resulting analysis is illustrated by
practical applications and provides general suggestions concerning lines of future research in data stream mining.
III. EXISTING METHOD
Map-Reduce: Map-Reduce is a Framework, which consists of basically two phases Map & Reduce with process the
data using batch processing style with applying sort & shuffle functionalities of data. Map-Reduce consist of two
separate and distinct phase:
1. Mapper:A function from an input pair of key/value to a list of intermediate pairs of key/value
Map: (key-input, value-input) List (key-map, value-map)
2. Reduce: A function from an intermediate pairs of key/ value to a list of output pairs of key/value13.
Reduce: (key-map, list (value-map)) list (key-reduce, value-reduce) as shown in Fig 8.
Fig 8: Flow of Map-Reduce Architecture
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –231
Problem: In the Map-reduce framework the problem is when there is a data processing using batch processing, it
cannot handle the multiple & short-live queries for requests and responses & also it cannot handle the Stragglers.
This algorithm is not much appropriate due to lack of scalability and does not support stream data mining.
IV. PROBLEM FORMULATION
Problem Statement
After doing a literature review, it has been found that data mining of spatial data can be very helpful in solving the
problem of predicting natural disasters based on spatial data mining techniques. Spatial Data is geo location based
data which provides us information about natural disasters. In recent work Map-Reduce technique is used for
spatial data mining. It consists of basically two phases Map & Reduce that process the data using batch processing
style with applying sort & shuffle functionalities of data. The problem faced in recent work is, the processing of Map
and Reduce Phases is sequential, Limited scalability since it is a cluster based, Do not support flexible estimation,
Stream data processing is not supported. Therefore, in proposed work new technique is presented i.e. Cloud Map-
Reduce which is based on the concept of stream data mining. This technique is helpful in mining real-time data. This
technique will help to reduce computing load and processing time as processing of big data will be performed on
cloud.
Methodology
1. Studying the existing stream mining techniques.
2. Developing a stream mining technique specific to spatial data mining.
3. Implement proposed technique on MATLAB.
4. Compare the previous technique with the proposed technique.
V. PROPOSED METHOD
Cloud Map-Reduce
Cloud Map-Reduce (CMR) is a framework for processing large dataset in cloud. In CMR, it support streaming data as
input using pipelining between the Map and Reduce phases ensuring that the output of the map phase is made
available to the reduce phase as soon as it is produced. This CMR approach leads to increase the parallelism
between the Map and Reduce phases as follows:
1. Supporting streaming data as input
2. Reduce delays
3. Improve processing time and computational load
4. Real time processing
5. Provide flexibility and scalability
The architecture of CMR consist of one input queue, multiple reduce queue which act as a staging areas for holding
the intermediate (key, value) pairs produced by the mapper, a master reduce queue that holds the pointers to the
reduce queues, and the output queue that holds the final result. The flow of cloud map-reduce architecture is shown
in Fig 9:
Fig 9: Flow of Cloud Map-Reduce Architecture
Inputs And Outputs Of Cloud Map-Reduce
The Cloud Map Reduce framework operates on key-value pairs, that is, the framework views the input to the job as a
set of key-value pairs and produces a set of key-value pair as the output of the job, conceivably of different types.
The key and value classes have to be serializable by the framework and hence, it is required to implement the
Writable interface. Additionally, the key classes have to implement the Writable Comparable interface to facilitate
sorting by the framework. Both the input and output format of a Cloud Map Reduce jobs are in the form of key-value
pairs as shown in the table1 below:
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –232
Table 1: I/O for Computing Portioning Function
VI. PROPOSED ALGORITHM
Data mining is the process of analysing data from different perspectives and summarizing it into useful information
the patterns, associations, or relationships among all this data can provide information. There is a huge amount of
data available in the Information Industry. This data is of no use until it is converted into useful information. Spatial
data mining is mainly used for finding the pattern in data which is related to geography. Spatial dataset is used to
predict the storm disaster using the stream data mining strategy i.e. cloud map reduce. For the detection of storm
general cloud map-reduce algorithm is as follows:
Step 1: Initialize system
Step 2: ObjMapv= Initialize Map Values;
Step 3: ObjRedv= Initialize Reduce Values;
Step 4: for i=1 to all dimensions (1)
Step 5: if (Map (i) ==ID // ID is interest dimension
Step 6: FinalMap=ID;
Step 7: end if
Step 8: end for
Step 9: Reducevaluecount=0
Step 10: for 1=1 to all dimension (FinalMap)
Step 11: if (Red (i) ==IV) // IV is the interest dimension
Step 12: FinalRed(ReduceValueCount)=IV
Step 13: end if
Step 14: end for
Step 15: if (FinalRed==FinalOutput)
Step 16: end SystemCycle
Step 17: else
Step 18: Go to Step 9
Step 19: end if
VII. FLOW CHART OF PROPOSED TECHNIQUE
Fuction Input (key,value) Output (key, value)
cloud (location+dname) (dname+(dcol,drow))
map (dname+(dcol,drow)) (dcol(item.id),drow(item.id))
reduce (dcol(item.id),drow(item.id)) (item.id),drow(item.id.sq.id))
result drow(item.id.sq.id).res Res(Final Result)
Start
Take spatial data
set
Read image
Image processing
Apply CMR
Technique
Store Down sample
image in map-store
Storm Detection
Mapping
Reduction
Storm prediction
Generate result
Compare result
End
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –233
VIII. PARAMETER USED
In this to compare the performance of both techniques we use two parameters which are briefly explained below:
1. Processing Time: The time taken to process the spatial images in this is known as processing. Here the processing
time is parameter which is used to compare the performance of previous and proposed techniques.
2. Computational Load: In the computer system, the system load is a measure of computational work that a
computer system performs. It simply checks the load in the operating system and also by using technique it
compare the load on each technique.
IX. RESULT ANALYSIS
Spatial Data Mining basically used to detect the characteristics and relationship which are stored in spatial database.
There is a massive amount of spatial data which capture from satellite images, medical equipment, video cameras,
etc. Data Stream mining is the process of extracting knowledge structures from such continuous, rapid data records.
A technique is presented for prediction of storm disaster area using spatial images with the help of stream data
mining, there are images of different regions such as Panama, Greater Antilles, Mexico golf etc. are taken and
according to proposed technique storm disaster of each area is predicted. The results of proposed technique are
shown in the different images below.
Result
In these all figures there are different regions to find the storm disaster area. In Fig 10(a) shows the region to find
storm disaster area, in Fig 10(b) image with red dots called detected storm disaster area and next one Fig 10(c)
shows the predicted storm disaster area. These steps are performed in images of different regions. Fig 10(a) it show
region to find storm prone area (Greater Antilles), Fig 10(b) shows detected storm prone area (Greater Antilles), Fig
10(c) shows predicted storm prone area from region (Greater Antilles)
Fig 10(a, b, c)
Fig 11(a) it show region to find storm prone area (Mexico Golf), Fig 11(b) shows detected storm prone area (Mexico
Golf), Fig 11(c) shows predicted storm prone area from region (Mexico Golf)
Fig 11 (a, b, c)
Fig 12(a) it show region to find storm prone area (Panama), Fig 12(b) shows detected storm prone area (Panama),
Fig 12(c) shows predicted storm prone area from region (Panama)
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –234
Fig 12(a, b, c)
In Fig 13 graph shows the comparison of centroid mean location of x-axis to the y-axis with parameters location and
mean location.
Table 2: Values of mean location vs. location
Fig 13: Graph representation of Mean Location vs. Location
In Fig 14 graph shows the comparison of centroid variance over location of x-axis to the y-axis with parameters
location and deviation.
Fig 14 Graph representation of Deviation vs. Location
Centroid
Mean
Location
x-axis
(Location)
y-axis
(Mean
Location)
1 2 203
2 2.9825 117.5263
3 1.5000 38.5000
4 27.8019 31.9245
5 26.6486 34.3019
6 17.2736 45.5094
7 16.7547 37.5849
8 1 1
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –235
Centroid Variance over Location x-axis (Location) y-axis ( Deviation)
1 0 0
2 1.00873379027825 87.7598397542076
3 0 0
4 38.3654588457129 9.59136471142823
5 37.8936569849650 20.5203262131073
6 28.9484860391625 24.8675144617927
7 28.9172086126574 19.7039067622951
8 0 0
Table 3: value table deviation vs. location
Comparison between Both Techniques Using The Parameter
During the preliminary study, to compare the performance of both techniques parameters are used i.e. processing
time and computational load. In Fig 15 show the Comparison between previous and proposed technique w.r.t
processing time. The value of Processing Time for proposed work is 11.7627 seconds and for previous work is
13.7966 seconds
Fig 15 Graph representation of processing time previous and proposed work
In Fig 16 show the Comparison between previous and proposed technique w.r.t Computational Load. The values of
Computational load for proposed work is 1070.4021 and for previous work are 1255.492
Fig 16 Graph representation of Computational Load previous and proposed work
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 06, Volume 5 (June 2018) www.ijirae.com
___________________________________________________________________________________________________
IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 |
ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 18, All Rights Reserved Page –236
X. CONCLUSION
Spatial data mining is the process of discovering interesting and previously unknown, but potentially useful patterns
from large spatial datasets. Extracting interesting and useful patterns from spatial datasets is more difficult than
extracting the corresponding patterns from traditional numeric and categorical data due to the complexity of spatial
data types, spatial relationships, and spatial auto correlation. We have presented a technique for prediction of storm
prone area using spatial images with the help of spatial data mining, there are images of different regions are taken
such as Panama, Greater Antilles, Mexico golf and according to our proposed method we have to find the storm
prone area of each region in image. The results of proposed technique are shown in the different images in result
section. We have taken spatial images of different gulf areas after the processing of images we get to know the
detected storm prone area and on the basis of detected storm calculations are made and with the help of
directionality and speed of storm we are able to predict storm prone area. In the future we can develop the
technique to predict the natural disasters like Storms, Tornados and Tsunami etc. and the developed technique is
working with gulf spatial images but it can be tested with spatial images of plain areas.
REFERENCE
1. J.Refonaa, Dr. M. Lakshmi, V.Vivek, “Analysis And Prediction Of Natural Disaster Using Spatial Data Mining
Technique,” [ICCPCT] International Conference on Circuit, Power and Computing Technologies, 2017.
2. N.Sumathi, R.Geetha, Dr.S.SathiyaBama, “Spatial Data Mining - Techniques Trends and Its Applications”.
3. HailiangJin,Baoliang Miao, “The Research Progress of Spatial Data Mining Technique”.
4. D.Malerba, “Mining Spatial Data: Opportunities and Challenges of a Relational Approach,”Dipartimento di
Informatica, Universit`adegliStudi di Bari, Via Orabona 4, I-70126 Bari, Italy.
5. K. KotaiahSwamy, E. Ravi Kumar, “Investigation and Prediction of Natural Disaster With Spatial Data Mining
Technique,” International Journal of Current Trends in Engineering & Research (IJCTER) e-ISSN 2455–1392
Vol.2 Issue 5, May 2016.
6. Qiang Wang, DespinaKontos, Guo Li, VasileiosMegalooikonomou, “Application of time series techniques to data
mining and analysis of spatial patterns in 3D images”
7. Twinkle B Ankleshwaria and J. S Dhoni “Mining Data Stream: A Survey” International Journal of Advance
Research in Computer Science and Management Studies. Vol.2, Issue 2, February 2014.
8. MahnooshKholghi,MohammadrezaKeyvanpour, “An Analytical Framework for Data Stream Mining Techniques
Based On Challenges and Requirements,”International Journal of Engineering Science and Technology (IJEST).
9. B.V.S Srikanth,V. Krishna Reddy, “Efficiency of Stream Processing Engines for Processing BIGDATA Streams,”
Indian Journal of Science and technology, Vol.9(14) April 2016.
10. Mohamed MedhatGaber,ShonaliKrishnaswamy and ArkadyZaslavsky, “Cost-Efficient Mining Techniques for
Data Streams,”2004.
11. F.H.PereyraZamora,C.M.V.Tahan, “Data Mining Techniques Applied To Spatial Load Forecasting,” 18th
International Conference on Electricity Distribution, Turin, 6-9 June 2005.
12. Elena Lkonomovska, suzanaLoskovska, DejanGjorgjevik, “A Survey of Stream Data Mining,” 2007.
13. ManjulaAakunuri, Dr.G.Narasimha, SudhakarKatherapaka, “Spatial Data Mining: A Recent Survey and New
Discussions,” (IJCSIT) International Journal of Computer Science and Information Technologies, ISSN: 0975-
9646, Vol.2 (4), 2011, 1501-1504.
14. SvitlanaVolkova, “Data Stream Mining: A Review of Learning Methods and Frameworks,” October 12, 2012.
15. Otavio M. De Carvalho, Eduardo Roloff and Philippe O.A. Navaux, “A Survey of the State-of-the-art in Event
Processing,” 11th workshop on parallel and Distributed Processing , 2013.
16. P.K.S.C.Jayasinghe, Masao Yoshida, “Spatial data mining technique to evaluate forest extent changes using GIS
and Remote Sensing,” International Conference on Advances in ICT for Emerging Regions (ICTer): 222 – 227,
2013.
17. Georg krempl, IndreZliobaite, Dariusz Brzezinski, “Open Challenges for Data Stream Mining Research,” SIGKDD
Vol.16, Issue 1, 2013.

More Related Content

What's hot

Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...IOSRjournaljce
 
Integrating Web Services With Geospatial Data Mining Disaster Management for ...
Integrating Web Services With Geospatial Data Mining Disaster Management for ...Integrating Web Services With Geospatial Data Mining Disaster Management for ...
Integrating Web Services With Geospatial Data Mining Disaster Management for ...Waqas Tariq
 
A Novel Data mining Technique to Discover Patterns from Huge Text Corpus
A Novel Data mining Technique to Discover Patterns from Huge  Text CorpusA Novel Data mining Technique to Discover Patterns from Huge  Text Corpus
A Novel Data mining Technique to Discover Patterns from Huge Text CorpusIJMER
 
Data mining techniques application for prediction in OLAP cube
Data mining techniques application for prediction in OLAP cubeData mining techniques application for prediction in OLAP cube
Data mining techniques application for prediction in OLAP cubeIJECEIAES
 
Incomplete Data and Technological Progress in Energy Storage Technologies
Incomplete Data and Technological Progress in Energy Storage TechnologiesIncomplete Data and Technological Progress in Energy Storage Technologies
Incomplete Data and Technological Progress in Energy Storage TechnologiesBert van Dorp
 
Data Imputation by Soft Computing
Data Imputation by Soft ComputingData Imputation by Soft Computing
Data Imputation by Soft Computingijtsrd
 
A quick overview of geospatial analysis
A quick overview of geospatial analysisA quick overview of geospatial analysis
A quick overview of geospatial analysisMd.Farhad Hossen
 
Active learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classificationActive learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classificationPioneer Natural Resources
 
New Research Articles 2019 April Issue International Journal on Computational...
New Research Articles 2019 April Issue International Journal on Computational...New Research Articles 2019 April Issue International Journal on Computational...
New Research Articles 2019 April Issue International Journal on Computational...ijcsa
 
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...theijes
 
Volume 2-issue-6-1930-1932
Volume 2-issue-6-1930-1932Volume 2-issue-6-1930-1932
Volume 2-issue-6-1930-1932Editor IJARCET
 
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...ertekg
 
Data repository for sensor network a data mining approach
Data repository for sensor network  a data mining approachData repository for sensor network  a data mining approach
Data repository for sensor network a data mining approachijdms
 
Geographic query and analysis
Geographic query and analysisGeographic query and analysis
Geographic query and analysisMohsin Siddique
 
5A_ 2_Developing a statistical methodology to improve classification and mapp...
5A_ 2_Developing a statistical methodology to improve classification and mapp...5A_ 2_Developing a statistical methodology to improve classification and mapp...
5A_ 2_Developing a statistical methodology to improve classification and mapp...GISRUK conference
 
Carpita metulini 111220_dssr_bari_version2
Carpita metulini 111220_dssr_bari_version2Carpita metulini 111220_dssr_bari_version2
Carpita metulini 111220_dssr_bari_version2University of Salerno
 

What's hot (19)

Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...
 
Integrating Web Services With Geospatial Data Mining Disaster Management for ...
Integrating Web Services With Geospatial Data Mining Disaster Management for ...Integrating Web Services With Geospatial Data Mining Disaster Management for ...
Integrating Web Services With Geospatial Data Mining Disaster Management for ...
 
Aa31163168
Aa31163168Aa31163168
Aa31163168
 
A Novel Data mining Technique to Discover Patterns from Huge Text Corpus
A Novel Data mining Technique to Discover Patterns from Huge  Text CorpusA Novel Data mining Technique to Discover Patterns from Huge  Text Corpus
A Novel Data mining Technique to Discover Patterns from Huge Text Corpus
 
4 2-13-397
4 2-13-3974 2-13-397
4 2-13-397
 
Data mining techniques application for prediction in OLAP cube
Data mining techniques application for prediction in OLAP cubeData mining techniques application for prediction in OLAP cube
Data mining techniques application for prediction in OLAP cube
 
Incomplete Data and Technological Progress in Energy Storage Technologies
Incomplete Data and Technological Progress in Energy Storage TechnologiesIncomplete Data and Technological Progress in Energy Storage Technologies
Incomplete Data and Technological Progress in Energy Storage Technologies
 
Data Imputation by Soft Computing
Data Imputation by Soft ComputingData Imputation by Soft Computing
Data Imputation by Soft Computing
 
A quick overview of geospatial analysis
A quick overview of geospatial analysisA quick overview of geospatial analysis
A quick overview of geospatial analysis
 
Active learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classificationActive learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classification
 
New Research Articles 2019 April Issue International Journal on Computational...
New Research Articles 2019 April Issue International Journal on Computational...New Research Articles 2019 April Issue International Journal on Computational...
New Research Articles 2019 April Issue International Journal on Computational...
 
CV
CVCV
CV
 
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
 
Volume 2-issue-6-1930-1932
Volume 2-issue-6-1930-1932Volume 2-issue-6-1930-1932
Volume 2-issue-6-1930-1932
 
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
Re-Mining Association Mining Results Through Visualization, Data Envelopment ...
 
Data repository for sensor network a data mining approach
Data repository for sensor network  a data mining approachData repository for sensor network  a data mining approach
Data repository for sensor network a data mining approach
 
Geographic query and analysis
Geographic query and analysisGeographic query and analysis
Geographic query and analysis
 
5A_ 2_Developing a statistical methodology to improve classification and mapp...
5A_ 2_Developing a statistical methodology to improve classification and mapp...5A_ 2_Developing a statistical methodology to improve classification and mapp...
5A_ 2_Developing a statistical methodology to improve classification and mapp...
 
Carpita metulini 111220_dssr_bari_version2
Carpita metulini 111220_dssr_bari_version2Carpita metulini 111220_dssr_bari_version2
Carpita metulini 111220_dssr_bari_version2
 

Similar to PREDICTION OF STORM DISASTER USING CLOUD MAP-REDUCE METHOD

Survey of the Euro Currency Fluctuation by Using Data Mining
Survey of the Euro Currency Fluctuation by Using Data MiningSurvey of the Euro Currency Fluctuation by Using Data Mining
Survey of the Euro Currency Fluctuation by Using Data Miningijcsit
 
Parametric comparison based on split criterion on classification algorithm
Parametric comparison based on split criterion on classification algorithmParametric comparison based on split criterion on classification algorithm
Parametric comparison based on split criterion on classification algorithmIAEME Publication
 
Real World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining ToolsReal World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining Toolsijsrd.com
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining TechniquesAn analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining Techniquesijcnes
 
A Review On Data Mining From Past To The Future
A Review On Data Mining From Past To The FutureA Review On Data Mining From Past To The Future
A Review On Data Mining From Past To The FutureKaela Johnson
 
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICSA STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICSijistjournal
 
11.challenging issues of spatio temporal data mining
11.challenging issues of spatio temporal data mining11.challenging issues of spatio temporal data mining
11.challenging issues of spatio temporal data miningAlexander Decker
 
WebSite Visit Forecasting Using Data Mining Techniques
WebSite Visit Forecasting Using Data Mining  TechniquesWebSite Visit Forecasting Using Data Mining  Techniques
WebSite Visit Forecasting Using Data Mining TechniquesChandana Napagoda
 
Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsSurvey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsIRJET Journal
 
Paper id 25201431
Paper id 25201431Paper id 25201431
Paper id 25201431IJRAT
 
Data Mining and Knowledge Management
Data Mining and Knowledge ManagementData Mining and Knowledge Management
Data Mining and Knowledge ManagementIRJET Journal
 
Coordination issues of multi agent systems in distributed data mining
Coordination issues of multi agent systems in distributed data miningCoordination issues of multi agent systems in distributed data mining
Coordination issues of multi agent systems in distributed data miningIAEME Publication
 
1. Web Mining – Web mining is an application of data mining for di.docx
1. Web Mining – Web mining is an application of data mining for di.docx1. Web Mining – Web mining is an application of data mining for di.docx
1. Web Mining – Web mining is an application of data mining for di.docxbraycarissa250
 
New approaches of Data Mining for the Internet of things with systems: Litera...
New approaches of Data Mining for the Internet of things with systems: Litera...New approaches of Data Mining for the Internet of things with systems: Litera...
New approaches of Data Mining for the Internet of things with systems: Litera...IRJET Journal
 
Data Mining @ Information Age
Data Mining @ Information AgeData Mining @ Information Age
Data Mining @ Information AgeIIRindia
 
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUESGI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUESAM Publications
 
REVIEW: Frequent Pattern Mining Techniques
REVIEW: Frequent Pattern Mining TechniquesREVIEW: Frequent Pattern Mining Techniques
REVIEW: Frequent Pattern Mining TechniquesEditor IJMTER
 
A COMPREHENSIVE SURVEY ON DATA MINING
A COMPREHENSIVE SURVEY ON DATA MININGA COMPREHENSIVE SURVEY ON DATA MINING
A COMPREHENSIVE SURVEY ON DATA MININGArlene Smith
 
A Study on Data Visualization Techniques of Spatio Temporal Data
A Study on Data Visualization Techniques of Spatio Temporal DataA Study on Data Visualization Techniques of Spatio Temporal Data
A Study on Data Visualization Techniques of Spatio Temporal DataIJMTST Journal
 

Similar to PREDICTION OF STORM DISASTER USING CLOUD MAP-REDUCE METHOD (20)

Survey of the Euro Currency Fluctuation by Using Data Mining
Survey of the Euro Currency Fluctuation by Using Data MiningSurvey of the Euro Currency Fluctuation by Using Data Mining
Survey of the Euro Currency Fluctuation by Using Data Mining
 
Parametric comparison based on split criterion on classification algorithm
Parametric comparison based on split criterion on classification algorithmParametric comparison based on split criterion on classification algorithm
Parametric comparison based on split criterion on classification algorithm
 
Real World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining ToolsReal World Application of Big Data In Data Mining Tools
Real World Application of Big Data In Data Mining Tools
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining TechniquesAn analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining Techniques
 
A Review On Data Mining From Past To The Future
A Review On Data Mining From Past To The FutureA Review On Data Mining From Past To The Future
A Review On Data Mining From Past To The Future
 
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICSA STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS
 
11.challenging issues of spatio temporal data mining
11.challenging issues of spatio temporal data mining11.challenging issues of spatio temporal data mining
11.challenging issues of spatio temporal data mining
 
WebSite Visit Forecasting Using Data Mining Techniques
WebSite Visit Forecasting Using Data Mining  TechniquesWebSite Visit Forecasting Using Data Mining  Techniques
WebSite Visit Forecasting Using Data Mining Techniques
 
Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsSurvey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
 
Paper id 25201431
Paper id 25201431Paper id 25201431
Paper id 25201431
 
Data Mining and Knowledge Management
Data Mining and Knowledge ManagementData Mining and Knowledge Management
Data Mining and Knowledge Management
 
Coordination issues of multi agent systems in distributed data mining
Coordination issues of multi agent systems in distributed data miningCoordination issues of multi agent systems in distributed data mining
Coordination issues of multi agent systems in distributed data mining
 
1. Web Mining – Web mining is an application of data mining for di.docx
1. Web Mining – Web mining is an application of data mining for di.docx1. Web Mining – Web mining is an application of data mining for di.docx
1. Web Mining – Web mining is an application of data mining for di.docx
 
New approaches of Data Mining for the Internet of things with systems: Litera...
New approaches of Data Mining for the Internet of things with systems: Litera...New approaches of Data Mining for the Internet of things with systems: Litera...
New approaches of Data Mining for the Internet of things with systems: Litera...
 
Data Mining @ Information Age
Data Mining @ Information AgeData Mining @ Information Age
Data Mining @ Information Age
 
Fp3111131118
Fp3111131118Fp3111131118
Fp3111131118
 
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUESGI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
 
REVIEW: Frequent Pattern Mining Techniques
REVIEW: Frequent Pattern Mining TechniquesREVIEW: Frequent Pattern Mining Techniques
REVIEW: Frequent Pattern Mining Techniques
 
A COMPREHENSIVE SURVEY ON DATA MINING
A COMPREHENSIVE SURVEY ON DATA MININGA COMPREHENSIVE SURVEY ON DATA MINING
A COMPREHENSIVE SURVEY ON DATA MINING
 
A Study on Data Visualization Techniques of Spatio Temporal Data
A Study on Data Visualization Techniques of Spatio Temporal DataA Study on Data Visualization Techniques of Spatio Temporal Data
A Study on Data Visualization Techniques of Spatio Temporal Data
 

More from AM Publications

DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...AM Publications
 
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...AM Publications
 
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGNTHE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGNAM Publications
 
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...AM Publications
 
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...AM Publications
 
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISESANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISESAM Publications
 
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS AM Publications
 
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...AM Publications
 
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITIONHMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITIONAM Publications
 
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...AM Publications
 
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...AM Publications
 
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...AM Publications
 
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...AM Publications
 
OPTICAL CHARACTER RECOGNITION USING RBFNN
OPTICAL CHARACTER RECOGNITION USING RBFNNOPTICAL CHARACTER RECOGNITION USING RBFNN
OPTICAL CHARACTER RECOGNITION USING RBFNNAM Publications
 
DETECTION OF MOVING OBJECT
DETECTION OF MOVING OBJECTDETECTION OF MOVING OBJECT
DETECTION OF MOVING OBJECTAM Publications
 
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENTSIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENTAM Publications
 
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...AM Publications
 
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...AM Publications
 
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY AM Publications
 

More from AM Publications (20)

DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
 
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
 
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGNTHE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
 
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
 
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
 
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISESANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
 
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
 
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
 
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITIONHMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
 
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
 
INTELLIGENT BLIND STICK
INTELLIGENT BLIND STICKINTELLIGENT BLIND STICK
INTELLIGENT BLIND STICK
 
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
 
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
 
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
 
OPTICAL CHARACTER RECOGNITION USING RBFNN
OPTICAL CHARACTER RECOGNITION USING RBFNNOPTICAL CHARACTER RECOGNITION USING RBFNN
OPTICAL CHARACTER RECOGNITION USING RBFNN
 
DETECTION OF MOVING OBJECT
DETECTION OF MOVING OBJECTDETECTION OF MOVING OBJECT
DETECTION OF MOVING OBJECT
 
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENTSIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
 
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
 
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
 
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
 

Recently uploaded

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 

Recently uploaded (20)

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 

PREDICTION OF STORM DISASTER USING CLOUD MAP-REDUCE METHOD

  • 1. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –225 PREDICTION OF STORM DISASTER USING CLOUD MAP-REDUCE METHOD ARJU JINDAL M.Tech Scholar Computer Science and Engineering Department, BGIET Sangrur, Punjab, India arzoojindal@gmail.com Er. YOGESH KUMAR Assistant Professor, Computer science and Engineering Department, BGIET Sangrur, Punjab, India yksingla37@gmail.com Manuscript History Number: IJIRAE/RS/Vol.05/Issue06/JNAE10084 Received: 03, June 2018 Final Correction: 10, June 2018 Final Accepted: 19, June 2018 Published: June 2018 Citation: JINDAL & KUMAR (2018). PREDICTION OF STORM DISASTER USING CLOUD MAP-REDUCE METHOD, IJIRAE:: International Journal of Innovative Research in Advanced Engineering, Volume V, 225-236. doi://10.26562/IJIRAE.2018.JNAE10084 Editor: Dr.A.Arul L.S, Chief Editor, IJIRAE, AM Publications, India Copyright: ©2018 This is an open access article distributed under the terms of the Creative Commons Attribution License, Which Permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited Abstract: Data mining is the process of analyzing data from different perspectives and summarizing it into useful information the patterns, associations, or relationships among all this data can provide information. Spatial Data Mining (SDM) is the part of data mining. It is mainly used for finding the figure in data that is related to space. In spatial data mining, to get the result analyst use geographical or spatial information which require some special technique to get geographical data in the appropriate formats. SDM is mainly used in earth science data, crime mapping, census data, traffic data, and cancer cluster (i.e. to investigate environment health hazards). For real time processing, Stream data mining is used for the prediction of storm using spatial dataset with the help of stream data mining strategy i.e. CMR (Cloud Map-Reduce). In this, Stream data mining is presented to detect the storm disaster of coastal area which is located in Central America country and then by taking the dataset of coastal area which are having various regions. There are various regions i.e. Panama, Greater Antilles, Mexico golf which is used to detect the storm. It also detects that, is the region is affected or not, if affected then which area of that region is affected and from this it is helpful to predict the storm before the disaster occur. In this, two parameters are used to test the technique i.e. processing time and computational load and using this parameter compare the previous and proposed technique. The main aim of this is to predict storm disaster before it happen with the help of directionality and velocity of storm. Keywords: Spatial data mining, spatial data, stream data mining, cloud map-reduce, location prediction I. INTRODUCTION Spatial Data Miningbasically used for detects the characteristics and relationship which are stored in spatial database. There is a massive amount of spatial data which capture from satellite images, medical equipment, video cameras, etc. Spatial data mining is the part of data mining. Spatial data mining is mainly used for finding the pattern in data which is related to geography [1]. In spatial data mining, to get the result analyst use geographical or spatial information which require some special technique to get geographical data in the appropriate formats. There are two types of data that kept in spatial database i.e. raster data (satellite, aerial/digital images) and vector data (lines, points, polygons).
  • 2. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –226 A. Spatial Data Mining Task: Spatial data mining has the following six tasks which are briefly explained below [2]: 1. Classification: In spatial data mining, classification is a machine learning technique used to predict the set of rules which helps to find the class of the classified object according to its attribute. 2. Association Rules: It helps to provide the rules correlate the presence of a set of items with another set of item. 3. Characteristics Rules: It is used to explain certain portion of the database. 4. Discriminate Rules: It is used to explain the contrast between any two portions of the database. 5. Clustering: Clustering is an automatic process of grouping related records together. 6. Trend Detection: It is used to discovering most significant changes in the database. B. Spatial Data Mining Techniques In spatial data mining there are various techniques which are discuss below [2]: 1. Clustering and Outlier Detection: In this cluster analysis divides data into meaningful or useful groups. It is very useful in spatial database. The collection of clusters is known as clustering. Spatial trend detection is a technique for discovering most significant changes in the database. In this clustering algorithm is used to deal with geographical datasets which are of the following forms:  Partitioning Method: In this partitioning algorithm is used to arrange the object into cluster so that deviation of cluster from its Centre should be minimized.  Hierarchical Method: In this, datasets are breakdown until stopping standard is meeting. Breakdown is by merging and splitting all clusters.  Density-Based Method: Density based method is helpful to screen out noise and trend detection which is differ from partitioning method that is helpful to find out cluster of arbitrary shapes.  Grid-Based Method: In this, initially subdivide the cluster into number of cells after that do some operations upon the grid structure. The benefit of this method is having the fast processing time. 2. Association and Co-Location: According to the non-spatial attributes is helpful to detect the characteristics rules and spatial objects while doing clustering method on data. 3. Classification: Classification is a machine learning technique which is used to create the model, from that model whole dataset is analyzed. 4. Trend Detection: It is used to discovering the most significant change that take place in database. C. Issues of Spatial Data Mining There are the various issues of spatial data mining that are described as: 1. Spatial data mining algorithm can relocate from the data mining algorithm which are not supposed to be for storage of spatial data, its processing and characteristics of spatial data itself. Spatial data mining algorithm can provide knowledge which is not much refined due to the efficiency of spatial data mining which is not so high. Spatial data mining has various method and task but it is applicable only for specific purpose. Therefore, extraction of knowledge from this is finite [3]. In spatial data mining there is no query language. So to improve the spatial data mining, develop the SDM query language to demonstrate the effective spatial data mining. 2. If there is a linkage between spatial and temporal having the relevant information like distance, direction, before and after, topology which is to be taken in spatiotemporal data analysis and mining [4]. Those attributes should be considered that contain nearby pattern which having the important effect on the pattern. Interpretation of spatial patterns, data structure to define and effective spatial datasets are also challenging issues. 3. Spatial object mainly arranged in the ascending and descending Oder using hierarchy so that it is easy to present the spatial object at different level of source [5]. Spatial patterns contain the spatial object which can be mounted but they are not much secure. Predictive tasks are developed by inductive learning algorithm that needs large set of labeled data. So, very large data can be classified and only few data can be mined and it is quite expensive. 4. The dimensionality is the most important issue in time series domain when number of dimensions is low [6]. INTRODUCTION TO STREAM DATA MINING In recent year, advances in hardware technology have facilitated new ways of collecting data continuously. Tremendous and potentially infinite volumes of data streams are often generated by real –time surveillance systems, communication networks, Internet traffic, on-line transactions in the financial market or retail industry, electric power grids, industry production processes, scientific and engineering experiments, remote sensors, and other dynamic environments. Data Stream mining is the process of extracting knowledge structures from such continuous, rapid data records. Mining data streams raises new problems for the data mining community about how to mine continuous high-speed data items that you can only have one look at. Due to this reason, traditional data mining approach needs to be changed and to discover knowledge or patterns from data streams , it is necessary to develop single-scan, on-line , multilevel , multi-dimensional stream processing and analysis methods.
  • 3. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –227 Stream data mining is concerned with extracting knowledge from non-stopping and continuous stream of information [7]. A data stream is a massive, infinite, temporally ordered, continuous and rapid sequence of data elements such as customer click streams, supermarket, telephone records, stock market, meteorological research, multimedia data, scientific and engineering experiments and sensor data. Some of issues of data stream like dynamic nature, Infinite size and high speed, unbounded memory requirements, Lack of global view, handling the continuous flow of data impose a great challenge for the researchers dealing with streaming data. In data stream model, data items can be relational tuples like network measurements and call records. In comparison with traditional data sets, data stream flows continuously in systems with varying update rate. Data streams are continuous, temporally ordered, fast changing, massive and potentially infinite. Due to huge amount and high storage cost, it is impossible to store an entire data streams or to scan through it multiple times. So it makes so many challenges in storage, computational and communication capabilities of computational systems [8]. Because of high volume and speed of input data, it is needed to use semi-automatic interactional techniques to extract embedded knowledge from data. Data stream mining is the extraction of structures of knowledge that are represented in the case of models and patterns of infinite streams of information. The general process of data stream mining is depicted in Fig 1 below: Fig 1: General process of data stream mining [8] A. Data Stream Method There are various kinds of method which are used for processing large amount of big data using stream processing which are explained below [9]: 1. Apache Storm:Apache Storm is for managing unbounded data streams over a distributed real-time streaming, developed by twitter. Mainly it varies from map-reduce is due to maintaining storm topologies. For example, a stream of tweets from twitter can be transformed into stream of trending topics. The two main components of Storm are tuples and nodes as shown in Fig 2 Fig 2: Storm Architecture [9] Problem: In Storm even though it can process a million tuples per second per node. But, at node it hold the data to send upon the acknowledge had to be received it is a Delay for data processing for stream data & it cannot handle Stragglers. 2. Apache Kafka: Apache Kafka is a pub/sub system developed by LinkedIn. Which describes the partitioned for data and it provides replication & distribution based of log-based service. Kafka maintains the topics for consumer to feed messages and subscribing the topics will publish message feeds as producer due to the requests & responses of Kafka cluster as shown in Fig 3. Problem: In Kafka model, based on the pub/sub system it maintains cluster & handle. Faults but in high-level latencies it cannot handle faults & stragglers issues.
  • 4. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –228 Fig 3: Apache Kafka Architecture [9] 3. Apache Spark: Apache Spark is a stream processing engine for fast & large-scale processing of stream data. Spark introduces D-Streams (Discretized streams) for processing the stream data using RDD based on in-memory processing concept. Here RDD means Resilient Distributed Datasets which process the stateless data work sets for handling fault-tolerance. In spark these RDD manages the streaming computations based on batches of results due to some speculation & handles Straggler using RDD as shown in Fig 4. Fig 4: Apache Spark Architecture [9] Spark Streaming providing the features for unified & batch processing as follows: First, for the spark job we can combine D-streams with RDDs computations. Second, using batch processing mode we run D-streams on past historical data. Means streaming report on past data. Third, using Scala console D-streams run ad-hoc queries. Problem: Spark is better than all stream processing engine and it can handle faults & stragglers, but using RDD in- memory concept it processing low streaming compare to FLINK streaming. 4. Apache Flink:Apache Flink is component of alternative approach for Hadoop map-reduce style for data processing system. It comes with its own runtime, rather than building on top of Map-Reduce. It is independent for the ecosystem of Hadoop. Therefore, in-built Hadoop cluster & YARN support is there for flink to processing the data in map-reduce style and also in iterative & intensive stream processing as shown in Fig 5. Fig 5: Apache Flink Architecture [9] 5. Map-Reduce: Map-Reduce is a Framework, which consists of basically two phases Map & Reduce with process the data using batch processing style with applying sort & shuffle functionalities of data. Map-Reduce consists of two separate and distinct phase:  Mapper: A function from an input pair of key/value to a list of intermediate pairs of key/value Map: (key-input, value-input) List (key-map, value-map)  Reduce: A function from an intermediate pairs of key/ value to a list of output pairs of key/value13. Reduce: (key-map, list (value-map)) list (key-reduce, value-reduce) as shown in Fig 6.
  • 5. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –229 Fig 6: Flow of Map-Reduce Architecture [9] Problem: In the Map-reduce framework the problem is when there is a data processing using batch processing, it cannot handle the multiple & short-live queries for requests and responses & also it cannot handle the Stragglers. 6. Hadoop: Hadoop is used for storing and processing big data. Apache Hadoop is a distributed computing framework modelled after Google Map-Reduce to process large amounts of data in parallel. Once in a while, the first thing that comes to my mind when speaking about distributed computing is EJB. EJB is de facto a component model with remote capability but short of the critical features being a distributed computing framework that include computational parallelization, work distribution, and tolerance to unreliable hardware and software. In general, data flows from components to components in an enterprise application. This is the case for application frameworks (EJB and spring framework), integration engines (Camel and Spring Integration), as well as ESB (Enterprise Service Bus) products. Nevertheless, for the data-intensive processes Hadoop deals with, it makes better sense to load a big data set once and perform various analysis jobs locally to minimize IO and network cost, the so-called "Move-Code-To-Data" philosophy. When you load a big data file to HDFS, the file is split into chunks (or file blocks) through a centralized Name Node (master node) and resides on individual Data Nodes (slave nodes) in the Hadoop cluster for parallel processing. The structure of Hadoop is shown in Fig 7 below: Fig 7: Structure of Hadoop [9] II. LITERATURE REVIEW Refonna et al. in this paper, author discuss the analysis and prediction of natural disaster by using spatial data mining techniques [1]. In this basically map-reduce framework is used to predict the disaster like earthquake, storm identification, rainfall etc. It also used the big data concept which contain two services first is Hadoop software and related hardware second is No SQL database software and related hardware. Wang et al. in this, spatial pattern analyse with time series domain from 3D images. Space filling curve is used when traversing an image. Analyse the spatial sequence in the transformed domain to produce different patterns [6]. In this various techniques was used to analyse the time series i.e. Euclidean distance, reduction technique, singular value, piecewise aggregate approximation etc. fMRI dataset can be analysed. Mohamed MedhatGaber, et al.in this paper author explains the stream data mining. In stream data mining there are two kinds of solution based on problems and challenges i.e. data based solution which is used to summarize the whole dataset or choosing a subset of the incoming stream to be analysed [10]. Second is task based solution which is used to modify the existing techniques or invent new ones in order to address the computational challenges of data stream processing. It also presents the mining techniques and research issues. Pereyra Zamora et al.in this paper author discuss the spatial data mining technique is applicable to spatial load forecasting methods. It is used in distribution system [11]. Fundamental stage of KDD process is data preparation. Neural network is efficient in spatial data mining tasks.
  • 6. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –230 Elena Ikonomovksa, et al.in this paper author presents the introduction of stream data mining also the theoretical foundations of data stream analysis and identifies potential directions of future research [12]. Mining data stream techniques are being critically reviewed. Mining data streams is an immature, growing field of study. There are many open issues that need to be addressed. It improve the real-time decision making process in almost every area of our life. Sumathi et al.in this, spatial data mining algorithm is divided into four categories, first is clustering and outlier detection second is association and co-location third is classification and fourth is trend detection [2].Clustering is an unsupervised method. There are a number of clusters presented in data set. The cluster results are needed for validation. Aakunuri et al.author describes the various kind clustering algorithms used for spatial data mining. First is Partitioning around Medoids (PAM) and second is clustering large application (CLARA). Both algorithms are inefficient in terms of complexity [13]. After that new algorithm is used to overcome this problem i.e. clustering large application based on Randomized search (CLARANS) is designed for cluster analysis. By using this, two algorithms are designed for spatial data mining i.e. spatial dominant approach (SD CLARANS) and other is non- spatial dominant approach (NSD CLARANS). SvitlanaVolkovaauthor proposed the algorithm, method and frameworks for processing and analysing the real time data stream [14]. It also discusses the difference between batch and stream mode learning and analyse the learning framework which include MOA and Jerboa. In this application for learning from social media streams is also explained. Otavio M. de Carvalho, et al.this paper reviews the state-of-the-art in event processing systems. A set of the most significant weaknesses and limitations is discussed at a high level, and also outline requirements that a system should meet to excel at a variety of event processing applications. It also discussed the evolution in the event processing field [15]. It shows the path of development of these tools has changed since by integrating the Map- Reduce model into the event processing domain. Jayasinghe et al.author describe the spatial data mining techniques is mainly used to find the forests extent changes. In spatial data mining satellite image were used to get knowledge about any change in forests. In this two approaches are used to derive the thematic maps i.e. supervised and unsupervised [16]. Back propagation algorithm is used to detect the forests changes and update existing forests maps Georg Krempl, et al.in this paper presents a discussion on eight open challenges for data stream mining which is used to identify gaps between current research and meaningful applications, highlight open problems, and define new application-relevant research directions for data stream mining [17]. The resulting analysis is illustrated by practical applications and provides general suggestions concerning lines of future research in data stream mining. III. EXISTING METHOD Map-Reduce: Map-Reduce is a Framework, which consists of basically two phases Map & Reduce with process the data using batch processing style with applying sort & shuffle functionalities of data. Map-Reduce consist of two separate and distinct phase: 1. Mapper:A function from an input pair of key/value to a list of intermediate pairs of key/value Map: (key-input, value-input) List (key-map, value-map) 2. Reduce: A function from an intermediate pairs of key/ value to a list of output pairs of key/value13. Reduce: (key-map, list (value-map)) list (key-reduce, value-reduce) as shown in Fig 8. Fig 8: Flow of Map-Reduce Architecture
  • 7. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –231 Problem: In the Map-reduce framework the problem is when there is a data processing using batch processing, it cannot handle the multiple & short-live queries for requests and responses & also it cannot handle the Stragglers. This algorithm is not much appropriate due to lack of scalability and does not support stream data mining. IV. PROBLEM FORMULATION Problem Statement After doing a literature review, it has been found that data mining of spatial data can be very helpful in solving the problem of predicting natural disasters based on spatial data mining techniques. Spatial Data is geo location based data which provides us information about natural disasters. In recent work Map-Reduce technique is used for spatial data mining. It consists of basically two phases Map & Reduce that process the data using batch processing style with applying sort & shuffle functionalities of data. The problem faced in recent work is, the processing of Map and Reduce Phases is sequential, Limited scalability since it is a cluster based, Do not support flexible estimation, Stream data processing is not supported. Therefore, in proposed work new technique is presented i.e. Cloud Map- Reduce which is based on the concept of stream data mining. This technique is helpful in mining real-time data. This technique will help to reduce computing load and processing time as processing of big data will be performed on cloud. Methodology 1. Studying the existing stream mining techniques. 2. Developing a stream mining technique specific to spatial data mining. 3. Implement proposed technique on MATLAB. 4. Compare the previous technique with the proposed technique. V. PROPOSED METHOD Cloud Map-Reduce Cloud Map-Reduce (CMR) is a framework for processing large dataset in cloud. In CMR, it support streaming data as input using pipelining between the Map and Reduce phases ensuring that the output of the map phase is made available to the reduce phase as soon as it is produced. This CMR approach leads to increase the parallelism between the Map and Reduce phases as follows: 1. Supporting streaming data as input 2. Reduce delays 3. Improve processing time and computational load 4. Real time processing 5. Provide flexibility and scalability The architecture of CMR consist of one input queue, multiple reduce queue which act as a staging areas for holding the intermediate (key, value) pairs produced by the mapper, a master reduce queue that holds the pointers to the reduce queues, and the output queue that holds the final result. The flow of cloud map-reduce architecture is shown in Fig 9: Fig 9: Flow of Cloud Map-Reduce Architecture Inputs And Outputs Of Cloud Map-Reduce The Cloud Map Reduce framework operates on key-value pairs, that is, the framework views the input to the job as a set of key-value pairs and produces a set of key-value pair as the output of the job, conceivably of different types. The key and value classes have to be serializable by the framework and hence, it is required to implement the Writable interface. Additionally, the key classes have to implement the Writable Comparable interface to facilitate sorting by the framework. Both the input and output format of a Cloud Map Reduce jobs are in the form of key-value pairs as shown in the table1 below:
  • 8. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –232 Table 1: I/O for Computing Portioning Function VI. PROPOSED ALGORITHM Data mining is the process of analysing data from different perspectives and summarizing it into useful information the patterns, associations, or relationships among all this data can provide information. There is a huge amount of data available in the Information Industry. This data is of no use until it is converted into useful information. Spatial data mining is mainly used for finding the pattern in data which is related to geography. Spatial dataset is used to predict the storm disaster using the stream data mining strategy i.e. cloud map reduce. For the detection of storm general cloud map-reduce algorithm is as follows: Step 1: Initialize system Step 2: ObjMapv= Initialize Map Values; Step 3: ObjRedv= Initialize Reduce Values; Step 4: for i=1 to all dimensions (1) Step 5: if (Map (i) ==ID // ID is interest dimension Step 6: FinalMap=ID; Step 7: end if Step 8: end for Step 9: Reducevaluecount=0 Step 10: for 1=1 to all dimension (FinalMap) Step 11: if (Red (i) ==IV) // IV is the interest dimension Step 12: FinalRed(ReduceValueCount)=IV Step 13: end if Step 14: end for Step 15: if (FinalRed==FinalOutput) Step 16: end SystemCycle Step 17: else Step 18: Go to Step 9 Step 19: end if VII. FLOW CHART OF PROPOSED TECHNIQUE Fuction Input (key,value) Output (key, value) cloud (location+dname) (dname+(dcol,drow)) map (dname+(dcol,drow)) (dcol(item.id),drow(item.id)) reduce (dcol(item.id),drow(item.id)) (item.id),drow(item.id.sq.id)) result drow(item.id.sq.id).res Res(Final Result) Start Take spatial data set Read image Image processing Apply CMR Technique Store Down sample image in map-store Storm Detection Mapping Reduction Storm prediction Generate result Compare result End
  • 9. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –233 VIII. PARAMETER USED In this to compare the performance of both techniques we use two parameters which are briefly explained below: 1. Processing Time: The time taken to process the spatial images in this is known as processing. Here the processing time is parameter which is used to compare the performance of previous and proposed techniques. 2. Computational Load: In the computer system, the system load is a measure of computational work that a computer system performs. It simply checks the load in the operating system and also by using technique it compare the load on each technique. IX. RESULT ANALYSIS Spatial Data Mining basically used to detect the characteristics and relationship which are stored in spatial database. There is a massive amount of spatial data which capture from satellite images, medical equipment, video cameras, etc. Data Stream mining is the process of extracting knowledge structures from such continuous, rapid data records. A technique is presented for prediction of storm disaster area using spatial images with the help of stream data mining, there are images of different regions such as Panama, Greater Antilles, Mexico golf etc. are taken and according to proposed technique storm disaster of each area is predicted. The results of proposed technique are shown in the different images below. Result In these all figures there are different regions to find the storm disaster area. In Fig 10(a) shows the region to find storm disaster area, in Fig 10(b) image with red dots called detected storm disaster area and next one Fig 10(c) shows the predicted storm disaster area. These steps are performed in images of different regions. Fig 10(a) it show region to find storm prone area (Greater Antilles), Fig 10(b) shows detected storm prone area (Greater Antilles), Fig 10(c) shows predicted storm prone area from region (Greater Antilles) Fig 10(a, b, c) Fig 11(a) it show region to find storm prone area (Mexico Golf), Fig 11(b) shows detected storm prone area (Mexico Golf), Fig 11(c) shows predicted storm prone area from region (Mexico Golf) Fig 11 (a, b, c) Fig 12(a) it show region to find storm prone area (Panama), Fig 12(b) shows detected storm prone area (Panama), Fig 12(c) shows predicted storm prone area from region (Panama)
  • 10. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –234 Fig 12(a, b, c) In Fig 13 graph shows the comparison of centroid mean location of x-axis to the y-axis with parameters location and mean location. Table 2: Values of mean location vs. location Fig 13: Graph representation of Mean Location vs. Location In Fig 14 graph shows the comparison of centroid variance over location of x-axis to the y-axis with parameters location and deviation. Fig 14 Graph representation of Deviation vs. Location Centroid Mean Location x-axis (Location) y-axis (Mean Location) 1 2 203 2 2.9825 117.5263 3 1.5000 38.5000 4 27.8019 31.9245 5 26.6486 34.3019 6 17.2736 45.5094 7 16.7547 37.5849 8 1 1
  • 11. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –235 Centroid Variance over Location x-axis (Location) y-axis ( Deviation) 1 0 0 2 1.00873379027825 87.7598397542076 3 0 0 4 38.3654588457129 9.59136471142823 5 37.8936569849650 20.5203262131073 6 28.9484860391625 24.8675144617927 7 28.9172086126574 19.7039067622951 8 0 0 Table 3: value table deviation vs. location Comparison between Both Techniques Using The Parameter During the preliminary study, to compare the performance of both techniques parameters are used i.e. processing time and computational load. In Fig 15 show the Comparison between previous and proposed technique w.r.t processing time. The value of Processing Time for proposed work is 11.7627 seconds and for previous work is 13.7966 seconds Fig 15 Graph representation of processing time previous and proposed work In Fig 16 show the Comparison between previous and proposed technique w.r.t Computational Load. The values of Computational load for proposed work is 1070.4021 and for previous work are 1255.492 Fig 16 Graph representation of Computational Load previous and proposed work
  • 12. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 06, Volume 5 (June 2018) www.ijirae.com ___________________________________________________________________________________________________ IJIRAE: Impact Factor Value – SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 18, All Rights Reserved Page –236 X. CONCLUSION Spatial data mining is the process of discovering interesting and previously unknown, but potentially useful patterns from large spatial datasets. Extracting interesting and useful patterns from spatial datasets is more difficult than extracting the corresponding patterns from traditional numeric and categorical data due to the complexity of spatial data types, spatial relationships, and spatial auto correlation. We have presented a technique for prediction of storm prone area using spatial images with the help of spatial data mining, there are images of different regions are taken such as Panama, Greater Antilles, Mexico golf and according to our proposed method we have to find the storm prone area of each region in image. The results of proposed technique are shown in the different images in result section. We have taken spatial images of different gulf areas after the processing of images we get to know the detected storm prone area and on the basis of detected storm calculations are made and with the help of directionality and speed of storm we are able to predict storm prone area. In the future we can develop the technique to predict the natural disasters like Storms, Tornados and Tsunami etc. and the developed technique is working with gulf spatial images but it can be tested with spatial images of plain areas. REFERENCE 1. J.Refonaa, Dr. M. Lakshmi, V.Vivek, “Analysis And Prediction Of Natural Disaster Using Spatial Data Mining Technique,” [ICCPCT] International Conference on Circuit, Power and Computing Technologies, 2017. 2. N.Sumathi, R.Geetha, Dr.S.SathiyaBama, “Spatial Data Mining - Techniques Trends and Its Applications”. 3. HailiangJin,Baoliang Miao, “The Research Progress of Spatial Data Mining Technique”. 4. D.Malerba, “Mining Spatial Data: Opportunities and Challenges of a Relational Approach,”Dipartimento di Informatica, Universit`adegliStudi di Bari, Via Orabona 4, I-70126 Bari, Italy. 5. K. KotaiahSwamy, E. Ravi Kumar, “Investigation and Prediction of Natural Disaster With Spatial Data Mining Technique,” International Journal of Current Trends in Engineering & Research (IJCTER) e-ISSN 2455–1392 Vol.2 Issue 5, May 2016. 6. Qiang Wang, DespinaKontos, Guo Li, VasileiosMegalooikonomou, “Application of time series techniques to data mining and analysis of spatial patterns in 3D images” 7. Twinkle B Ankleshwaria and J. S Dhoni “Mining Data Stream: A Survey” International Journal of Advance Research in Computer Science and Management Studies. Vol.2, Issue 2, February 2014. 8. MahnooshKholghi,MohammadrezaKeyvanpour, “An Analytical Framework for Data Stream Mining Techniques Based On Challenges and Requirements,”International Journal of Engineering Science and Technology (IJEST). 9. B.V.S Srikanth,V. Krishna Reddy, “Efficiency of Stream Processing Engines for Processing BIGDATA Streams,” Indian Journal of Science and technology, Vol.9(14) April 2016. 10. Mohamed MedhatGaber,ShonaliKrishnaswamy and ArkadyZaslavsky, “Cost-Efficient Mining Techniques for Data Streams,”2004. 11. F.H.PereyraZamora,C.M.V.Tahan, “Data Mining Techniques Applied To Spatial Load Forecasting,” 18th International Conference on Electricity Distribution, Turin, 6-9 June 2005. 12. Elena Lkonomovska, suzanaLoskovska, DejanGjorgjevik, “A Survey of Stream Data Mining,” 2007. 13. ManjulaAakunuri, Dr.G.Narasimha, SudhakarKatherapaka, “Spatial Data Mining: A Recent Survey and New Discussions,” (IJCSIT) International Journal of Computer Science and Information Technologies, ISSN: 0975- 9646, Vol.2 (4), 2011, 1501-1504. 14. SvitlanaVolkova, “Data Stream Mining: A Review of Learning Methods and Frameworks,” October 12, 2012. 15. Otavio M. De Carvalho, Eduardo Roloff and Philippe O.A. Navaux, “A Survey of the State-of-the-art in Event Processing,” 11th workshop on parallel and Distributed Processing , 2013. 16. P.K.S.C.Jayasinghe, Masao Yoshida, “Spatial data mining technique to evaluate forest extent changes using GIS and Remote Sensing,” International Conference on Advances in ICT for Emerging Regions (ICTer): 222 – 227, 2013. 17. Georg krempl, IndreZliobaite, Dariusz Brzezinski, “Open Challenges for Data Stream Mining Research,” SIGKDD Vol.16, Issue 1, 2013.