development of diagnostic enzyme assay to detect leuser virus
Tehrany2014
1. Flood susceptibility mapping using a novel ensemble
weights-of-evidence and support vector machine models in GIS
Mahyat Shafapour Tehrany, Biswajeet Pradhan ⇑
, Mustafa Neamah Jebur
Department of Civil Engineering, Geospatial Information Science Research Center (GISRC), Faculty of Engineering, University Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia
a r t i c l e i n f o
Article history:
Received 24 October 2013
Received in revised form 27 February 2014
Accepted 2 March 2014
Available online 12 March 2014
This manuscript was handled by
Geoff Syme, Editor-in-Chief
Keywords:
Flood susceptibility
Weights-of-evidence (WoE)
Support vector machine (SVM)
GIS
Remote sensing
Malaysia
s u m m a r y
Flood is one of the most devastating natural disasters that occur frequently in Terengganu, Malaysia.
Recently, ensemble based techniques are getting extremely popular in flood modeling. In this paper,
weights-of-evidence (WoE) model was utilized first, to assess the impact of classes of each conditioning
factor on flooding through bivariate statistical analysis (BSA). Then, these factors were reclassified using
the acquired weights and entered into the support vector machine (SVM) model to evaluate the correla-
tion between flood occurrence and each conditioning factor. Through this integration, the weak point of
WoE can be solved and the performance of the SVM will be enhanced. The spatial database included flood
inventory, slope, stream power index (SPI), topographic wetness index (TWI), altitude, curvature, dis-
tance from the river, geology, rainfall, land use/cover (LULC), and soil type. Four kernel types of SVM (lin-
ear kernel (LN), polynomial kernel (PL), radial basis function kernel (RBF), and sigmoid kernel (SIG)) were
used to investigate the performance of each kernel type. The efficiency of the new ensemble WoE and
SVM method was tested using area under curve (AUC) which measured the prediction and success rates.
The validation results proved the strength and efficiency of the ensemble method over the individual
methods. The best results were obtained from RBF kernel when compared with the other kernel types.
Success rate and prediction rate for ensemble WoE and RBF-SVM method were 96.48% and 95.67%
respectively. The proposed ensemble flood susceptibility mapping method could assist researchers and
local governments in flood mitigation strategies.
Ó 2014 Elsevier B.V. All rights reserved.
1. Introduction
Natural disasters are the main cause of irrecoverable damages
worldwide (Vorogushyn et al., 2012). Malaysia embraces flood
events annually mostly during the monsoon seasons (Kia et al.,
2012). These flooding have caused considerable damage to high-
ways, settlement, agriculture and livelihood (Pradhan and Youssef,
2011). These consequences can be decreased through an appropri-
ate and accurate susceptibility analysis. Therefore, generating
accurate flood modeling is one of the prime goals of scientists
and governments. Over the years, many hydrological approaches
have been used in the literature (Fenicia et al., 2013). Traditional
hydrological methods such as physical based rainfall–runoff
modeling techniques and data-driven techniques are not capable
for comprehensive analysis of rivers and inundation areas (Smith
and Ward, 1998). The reason being the hydrological methods
follow one-dimensional procedure while the morphology of the
river is not stable and it has dynamic characteristic due to the high
erosive potential (Chau and Lee, 1991; Refsgaard, 1997). Moreover,
these methods require fieldwork and huge budget for data collec-
tion (Fenicia et al., 2013).
Due to the aforementioned drawbacks, scientists started to use
the empirical and data driven methods such as artificial neural
networks (ANNs) (Kia et al., 2012; Oh and Pradhan, 2011), genetic
programming (GP) (Khu et al., 2001), evolutionary polynomial
regression (EPR) (Elshorbagy et al., 2010a), M5 model trees (M5)
(Elshorbagy et al., 2010b), and K-nearest neighbors (K-NN) (Toth
et al., 2000) techniques (Chau et al., 2005; Elshorbagy et al.,
2010b; Varoonchotikul, 2003). Remote sensing (RS) and geo-
graphic information system (GIS) techniques have made significant
contribution in flood modeling and prediction (Haq et al., 2012;
Pradhan et al., 2014; Pradhan and Shafiee, 2009). Variety of data
sources, high data quality, day and night data collection capabili-
ties, and rapid analysis were offered by RS and GIS techniques for
hydrological studies (Bates, 2004, 2012; Wanders et al., 2013).
2. Previous studies
The most popular methods in natural hazard modeling are ANN
(Pradhan and Buchroithner, 2010; Pradhan et al., 2010b), analytic
http://dx.doi.org/10.1016/j.jhydrol.2014.03.008
0022-1694/Ó 2014 Elsevier B.V. All rights reserved.
⇑ Corresponding author. Tel.: +60 3 89466383; fax: +60 3 89466459.
E-mail addresses: biswajeet24@gmail.com, biswajeet@lycos.com (B. Pradhan).
Journal of Hydrology 512 (2014) 332–343
Contents lists available at ScienceDirect
Journal of Hydrology
journal homepage: www.elsevier.com/locate/jhydrol
2. hierarchy process (AHP) (Yalcin, 2008), frequency ratio (FR) (Lee
et al., 2012; Pradhan et al., 2011), logistic regression (LR) (Pradhan,
2010) and fuzzy logic (Pradhan, 2011). In a recent paper, Zou et al.
(2013) described AHP as a cost-effective, understandable, and easy
to use method. The drawback of AHP is related to its dependency
on the expert’s knowledge which is the main source of uncertainty
(Chowdary et al., 2013). Statistical methods of LR and FR utilize
simple and understandable concepts which made them popular
(Liao and Carin, 2009). Some drawbacks can be seen in their perfor-
mance, although both have valuable advantages (Lee et al., 2012).
LR is capable to implement multivariate statistical analysis (MSA)
but it has some weak points to analyze the classes of each flood
conditioning factor. BSA can be done by FR which evaluates the
influence of classes of each conditioning factor on flooding (Lee
et al., 2012). The weak point of this method is the relationship
between the variables which is mostly neglected.
The application of machine learning methods in flood studies is
proven by many researches (Chau et al., 2005; Kia et al., 2012; Ma-
ier et al., 2010). Machine learning is the main source of techniques
for the data-driven modeling problem (Jebur et al., 2013). The fun-
damental concepts of machine learning and its usages in spatially
distributed data are given in Kanevskij et al. (2009). The efficiency
of the machine learning methods in flooded area extraction has
been examined by Lamovec et al. (2013) and Tehrany et al.
(2013b). Lamovec et al. (2013) stated that, due to the need of prop-
er flood analysis, the tested machine learning methods such as
decision tree (DT) and random forest (RF) were capable to detect
the flooded locations with relatively higher accuracy. ANN which
is one of the machine learning methods has been used in many
hydrological and water resource engineering applications due to
its computational efficiency (Kia et al., 2012). ANN is very popular
for prediction analysis which tries to make correlation between
some input conditioning factors and an output (Pradhan and
Buchroithner, 2010; Wan et al., 2010a). In a recent paper, Tiwari
and Chatterjee (2010) stated that the length of the dataset can
cause error in the process of ANN modeling. Furthermore, poor
predictions can be achieved when the validation dataset contain
values outside of the range of those used for training. Although
the ANN has some drawbacks, many researchers utilized this
method in their hydrological analysis which is more robust that
other statistical and deterministic methods (Dawson et al., 2006;
Kim and Barros, 2001). The efficiency of DT has been assessed re-
cently by Tehrany et al. (2013b) to map the flood susceptible areas
in Kelantan, Malaysia. Their study indicated that while DT could
produce acceptable results, some difficulties were encountered
while defining the rules. Moreover, the accuracy of the DT method
needed to be enhanced. From the above literature review, it is evi-
dent that the precision of the machine learning algorithms can be
increased if they can be combined with BSA. This interest has been
stimulated by the complex nature of hydrological systems and the
capability of machine learning methods to model nonlinear corre-
lations (Maier et al., 2010).
As it has been explained above, some methods have few limita-
tions in flood susceptibility mapping. Therefore, the purpose is to
obtain more accurate results through ensemble methods. This
has been proven by the ensemble methods in the literature which
could enhance the analysis and overcome the weak points of the
standalone individual methods (Wan et al., 2012). For instance,
adaptive-network-based fuzzy inference system (ANFIS) which is
a combination of ANN and fuzzy interface system (FIS) is found
to be more efficient than using the individual ANN and FIS methods
(Chau et al., 2005; Pradhan, 2013; Tien Bui et al., 2012b). Another
example of ensemble modeling is genetic algorithm-based artifi-
cial neural network (ANN-GA) which is a hybrid integration of
ANN and genetic algorithm (GA) that took advantage of both
schemes. Chau et al. (2005) evaluated the robustness and strength
of ANFIS, ANN-GA and LR through comparison of their perfor-
mance. LR showed the highest uncertainty compared to the other
hybrid methods. Both ANFIS, ANN-GA methods due to their
nonlinear natures were capable to perform better than statistical
linear LR.
From the aforementioned literature review, it can be concluded
that nonlinear and complex problems can be handled using ensem-
ble methods. In recent years, Terengganu, Malaysia has been hit by
several floods (Arbain and Wibowo, 2012). The current research
deals with the 2009 flood event in Terengganu. This study aims
to generate an accurate flood susceptibility mapping method by
enhancing the machine learning procedure of support vector ma-
chine (SVM) through its integration with weights-of-evidence
(WoE) based statistical method. WoE is mostly used in landslide
mapping and it is relatively new in flood modeling (Dahal et al.,
2008; Regmi et al., 2013). WoE uses the Bayesian probabilistic
model for solving problems of decision-making under uncertain-
ties. This method is appropriate for hazard mapping because its
uncertainty is connected with hazard events and their associations
with the complex landscape (Pourghasemi et al., 2013c). WoE is
able to implement the BSA and extract the correlation between
classes of each conditioning factor and flooding. However it is
not capable to perform multivariate statistical analyses (MSA)
which leads to neglect the relationship between the conditioning
factors. Hence through its integration with SVM this drawback
can be solved and performance of SVM can be enhanced simulta-
neously. The output of WoE will be used as an input for MSA of
SVM which leads to find the relationship between all conditioning
factors and flood occurrence. This ensemble of WoE and SVM offers
the novelty in this study.
3. Study area
The study is carried out in Terengganu which is situated in the
north-eastern part of Peninsular Malaysia, and is bordered in the
Northwest by Kelantan, the southwest by Pahang, and the east
by the South China Sea (Zaleha et al., 2006). Following heavy rain-
fall in 27th November 2009 the Terengganu state faced destructive
flooding which was chosen as a suitable application site for flood
susceptibility mapping. The study area covers Kuala Terengganu
city and is located between the latitudes 5°000
N and 5°320
N, and
longitudes 102°400
E and 103°100
E. Since the flooding was most in-
tense in the North East part of Terengganu, the analysis focused on
the selected catchment (Fig. 1). The bedrock geology of the Tereng-
ganu is generally made by Phyllite, slate, shale and sandstone.
4. Data used
4.1. Flood inventory
In order to estimate the future flood event in an area, analyzing
the past records of its occurrence is essential (Manandhar, 2010).
Therefore, an inventory map is considered as the most important
factor for prediction of future disaster occurrence and it can repre-
sent single or multiple events in a specific area (Tien Bui et al.,
2012a). In the current research the inventory map is a collection
of single flood occurrence which took place in 27th November
2009. To begin, a flood inventory map was produced by mapping
the flood locations in the Terengganu using documentary sources
and field survey. A total of 180 flood locations and flood inventory
map was prepared using these locations.
The flood inventory map was partitioned into 70% and 30% to be
used for training and testing respectively (Ohlmacher and Davis,
2003; Tunusluoglu et al., 2008) (Fig. 1). The training flood locations
(126 points) were selected randomly and flood layer which is
M.S. Tehrany et al. / Journal of Hydrology 512 (2014) 332–343 333
3. considered as a dependent factor was made. Flood layer was made
up by 0 and 1 values, as the value of 1 showed the existence and
the value of 0 illustrated the absence of flooding over the area.
Similarly, equal number of points (126 points) was chosen as non-
flooded areas to give the value of 0. As the flooding cannot be oc-
curred in the high elevation regions such as high hills, non-flooded
areas were randomly selected form these locations. The rest of the
flood events (54 points) were utilized for the purpose of testing.
4.2. Flood conditioning factors
It is essential to determine the flood conditioning factors in or-
der to perform flood susceptibility mapping (Kia et al., 2012).
Therefore, in the first stage, a flood related spatial database should
be created. Through the knowledge attained from the literature
(Smith and Ward, 1998) and field investigation the conditioning
factors were chosen. Hence, ten flood conditioning factors were se-
lected for the susceptibility analysis and the spatial database of
these factors was compiled. Those factors are: slope, stream power
index (SPI), topographic wetness index (TWI), altitude, curvature,
distance form river, geology, rainfall, landuse/cover (LULC), and soil
type (Table 2). The conditioning factors were nominal, ordinal and
scale. All the scale conditioning factors were classified using popu-
lar method of quantile (Tehrany et al., 2013b) which is a require-
ment for WoE analysis (Fig. 2). Once the dataset was prepared,
each conditioning factor was transformed into a grid spatial data-
base by 15 Â 15 m size and the grid of the Terengganu region was
constructed by 3751 columns and 2138 rows (8,019,638 pixels;
1804 km2
).
The geomorphological information can be extracted through
the analysis of slope, SPI, TWI and curvature. The mentioned
geomorphological layers were derived from DEM using ArcGIS 10
software. Water-related factors of SPI and TWI were calculated
using following equations:
TWI ¼ lnðAs= tan bÞ ð1Þ
SPI ¼ As tan b ð2Þ
where As is the specific catchment area (m2
mÀ1
), and b (radian) is
the slope gradient (in degrees) (Regmi et al., 2010). All the derived
conditioning factors were initially continuous, but quantile method
was used to classify each conditioning factor into different catego-
ries. In the slope (0–62.9), SPI (0.00–23.12), TWI (0.30–34.13) and
altitude (0–1208) maps, ten categories were constructed for each
for the analysis (Fig. 2a–d respectively). Curvature was classified
into three classes of concave, convex and flat (Fig. 2e). The distance
from the river has significant impact on the spread and magnitude
of flooding in the study area (Glenn et al., 2012; Wan et al., 2010b).
This map was produced using the buffer tool in ArcGIS 10 software
and ten buffer categories were made (Fig. 2f). The geology was ex-
tracted from the geological database which differentiated into three
classes of (1) phyllite, slate, shale and sandstone, (2) intrusive rocks;
mainly granite with minor granodiorite and (3) clay, silt, sand, peat
with minor gravel (Fig. 2g). In Malaysia, the monsoon flooding takes
place after heavy rain, so this factor should be considered as one of
the main contributors in flood occurrence (Bajabaa et al., 2013).
Rainfall data was classified into six classes (2375, 2625, 2875,
3125, 3375 and 3625) using the data from the water gauge stations
Fig. 1. Flood location map with hill-shaded map of Kuala Terengganu, Terengganu, Malaysia.
334 M.S. Tehrany et al. / Journal of Hydrology 512 (2014) 332–343
4. (Fig. 2h). LULC is also one of the main conditioning factors that
strongly contribute in flooding. Vegetated areas are less prone to
flooding due to the negative relationship between flooding and
vegetation density (Tehrany et al., 2013b). On the other hand, urban
areas which are mostly made by impervious surfaces and bare lands
increase the storm runoff. LULC map consists of forest, built-up,
cropland, wetlands and paddy (Fig. 2i). The study area is mostly
covered by forest and cropland. Soil layer was produced using 14
Fig. 2. Input thematic layers: (a) slope, (b) SPI, (c) TWI, (d) altitude, (e) curvature, (f) distance form river, (g) geology, (h) rainfall, (i) LULC, and (j) soil type.
M.S. Tehrany et al. / Journal of Hydrology 512 (2014) 332–343 335
5. soil types as can be seen in Fig. 2j. The dominant soil type can be
seen around the river in which most of the flooding occurred in this
region.
5. Methodology
The flood susceptibility analysis is one of the important studies
of river hydrology which was conducted using the ensemble WoE
and SVM methods. The step-by-step procedures for the proposed
method are presented in Fig. 3.
5.1. Weights-of-evidence
The current research demonstrates the application of the
ensemble SVM and WoE (a Bayesian probability model) method
for flood susceptibility mapping using GIS in a tropical area of Ter-
engganu, Malaysia. Using flood layer and spatial database of condi-
tioning factors, the WoE was applied to measure each relevant
factor’s weight. These weights can be extracted using the analysis
of the spatial relationships between the flood location and each of
the conditioning factors. This method has many pros compared to
the other statistical methods as it is a data-driven method that is
basically uses the Bayesian probability model (Pradhan et al.,
2010). Many studies have used WoE for landslide mapping
(Mohammady et al., 2012; Pourghasemi et al., 2013a; Regmi
et al., 2013); however, this method is relatively new in flood sus-
ceptibility mapping.
To perform WoE, positive weight (W+
) and negative weight
(WÀ
) should be calculated as the essential parameters. The method
measures the weight for each conditioning factor (B) based on the
presence or absence of the disaster (A) within the area (Bonham-
Carter, 1994) as follow:
Wþ
i ¼ ln
PfBjAg
PfBjAg
ð3Þ
WÀ
i ¼ ln
PfBjAg
PfBjAg
ð4Þ
where P is the probability and ln is the natural log. B and B are the
presence and absence of the conditioning factor respectively. Also A
and A represent the presence and absence of a disaster (Xu et al.,
2012). A positive weight (W+
) demonstrates the existence of the
conditioning factor at the disaster locations, and its magnitude
shows the positive relationship between the existence of the condi-
tioning factor and disaster occurrence respectively. A negative
weight (WÀ
) shows the absence of the conditioning factor and
indicates the level of negative relationship (Regmi et al., 2013).
The difference between the two weights of W+
and WÀ
is known
as the weight contrast which reveals the spatial association be-
tween the conditioning factor and the disaster occurrence (Dahal
et al., 2008). C is positive for a positive relationship and negative
for a negative spatial relationship. The studentized contrast which
is the final weight is a measure of confidence and is defined as
the ratio of the contrast divided by its standard deviation. The stu-
dentized contrast serves as an informal test that C is significantly
different from zero or if the contrast is likely to be ‘‘real’ (Regmi
et al., 2013). A detail explanation of the mathematical formulation
of this method can be seen in (Bonham-Carter, 1991, 1994, 2002;
Pradhan et al., 2010a; Regmi et al., 2013).
After applying the WoE model, the weights were normalized in
the range of 0 and 1. Conditioning factors were reclassified based
on these normalized values and consequently fed into SVM model.
The normalization should be done because the weights of condi-
tioning factors vary in dimensions and are not appropriate for di-
rect input for the SVM model.
5.2. Support vector machine
One of the popular machine learning algorithms is SVM which
is a supervised learning binary classifier and is based on the struc-
tural risk minimization principle (Wan and Lei, 2009; Yao et al.,
2008). Separating hyper-plane formation from training dataset is
the basis of this method. Separating hyper-plane is generated in
the original space of n coordinates (xi parameters in vector x) be-
tween the points of two distinct classes (Marjanovic´ et al., 2011).
Maximum margin of separation between the classes is discovered
by SVM and therefore, it builds a classification hyper-plane in the
central of the maximum margin (Pradhan, 2013; Tehrany et al.,
2013a). If the point is located over the hyper-plane it will be clas-
sified as +1, and if not, it will be classified as À1. The training points
Fig. 3. Methodological flow chart employed in this study.
336 M.S. Tehrany et al. / Journal of Hydrology 512 (2014) 332–343
6. that are closest to the optimal hyper-plane are called support vec-
tors. Once the decision surface is acquired, the classification of new
data can be done (Tien Bui et al., 2012a). Consider a training data-
set of instance-label pairs (xiyi) with xi 2 Rn
; yi 2 f1; À1g, and
i = 1, . . . , m. In the current situation of flood susceptibility, x is a
vector of input space that contains slope, SPI, TWI, altitude, curva-
ture, distance from the river, geology, rainfall, LULC, and soil type.
The two classes {1, À1} show flooded pixels and non-flooded
pixels. The goal of SVM is to recognize the optimal separating
hyper-plane, which can separate the two classes into flood and
non-flood {1, À1} from the training dataset. For the case of linear
separable data, a separating hyper-plane can be defined as:
yiðw Á xi þ bÞ P 1 À ni; ð5Þ
where w is a coefficient vector that defines the orientation of the
hyper-plane in the feature space, b is the offset of the hyper-plane
from the origin, and ni is the positive slack variables (Cortes and
Vapnik, 1995). The following optimization problem using Lagrang-
ian multipliers will be solved through the determination of an opti-
mal hyper-plane (Samui, 2008).
Minimize
Xn
i¼1
ai À
1
2
Xn
i¼1
Xn
j¼1
aiajyiyjðxixjÞ; ð6Þ
Subject to
Xn
i¼1
aiyj ¼ 0; 0 6 ai 6 C; ð7Þ
where ai are Lagrange multipliers, C is the penalty, and the slack
variables ni allows for penalized constraint violation. The decision
function, which will be used for the classification of new data, can
then be written as:
gðxÞ ¼ sign
Xn
i¼1
yiaixi þ b
!
ð8Þ
In the case where the hyper-plane cannot be separated by the
linear kernel function, the original input data may be shifted into
a high-dimension feature space through some nonlinear kernel
functions. The classification decision function is then written as:
gðxÞ ¼ sign
Xn
i¼1
yiajKðxi; xjÞ þ b
!
ð9Þ
where K(xi, xj) is the kernel function.
Precision of the results and successful classification are strongly
influenced by the choice of kernel type and its’ parameters
(Damaševicˇius, 2010). Linear kernel (LN), polynomial kernel (PL),
radial basis function kernel (RBF), and sigmoid kernel (SIG) are
the most demandable kernel types for SVM analysis (Pradhan,
2013). PL and RBF are called Gaussian kernels and they are the
most commonly used kernels in the literature (Marjanovic´ et al.,
2011). RBF is capable to produce efficient interpolation; however
it may have some weak points to give longer-range extrapolation.
PL compared to RBF has better extrapolation capabilities at lower-
order degrees but requires higher order degrees for good interpo-
lation (Zhu et al., 2011). The LN is considered to be a specific case
of RBF, and the SIG performs similar to the RBF for certain param-
eters (Song et al., 2011). In the case where RBF is used, LN is not
required, and also researchers believe that the precision of the
results acquired from SIG is less than using EBF (Keerthi and Lin,
2003; Lin and Lin, 2003). The LN is appropriate for linear separable
circumstances while the real world problems are not linearly
separable (Ali and Smith, 2003). Based on aforementioned litera-
ture, different kernels have different impacts on the results. Hence
current research aims to evaluate these impacts on the perfor-
mance of the proposed novel ensemble WoE and SVM method.
Each kernel requires different parameters as can be seen in
Table 1. Some researchers consider selecting the kernel parameters
as a difficult task of this method (Ali and Smith, 2003). Regulariza-
tion parameter or penalty (C) controls the tradeoff between
training errors and margin, which assists to prevent over-fitting
of the model (Marjanovic´ et al., 2011). Higher degree of training
errors can be expected in the case where value of C is small,
whereas, the precision of the results can be increased by increasing
the C value and subsequently generating a smaller margin. This
parameter needs to be defined for all the kernel types. Another
parameter which is involved in all the kernels except LN is the
kernel width or gamma (c) that controls the degree of nonlinearity
of the SVM model (Tien Bui et al., 2012a). Parameter d is the poly-
nomial degree in the kernel function for the PL kernel and r is the
bias term in the kernel function for the PL and SIG kernels.
The cross-validation was used to select the optimal kernel
parameters (Pradhan, 2013; Zhuang and Dai, 2006). To begin,
through the step size process, ranges of all parameters were de-
fined. For each set of parameters, the dataset was divided into n
folds: one fold was used for the purpose of validation and the com-
bination of the remaining n – 1-fold was considered as the training
dataset. After the cross validation method was performed using
SPSS Clementine V.14.2, n validation accuracies were attained
and the average of them was used to perform the final flood sus-
ceptibility model using each kernel type (Yao et al., 2008). In the
current study, the dataset was divided into five random groups
where every group had equal number of samples (Table 3). For
the first iteration (model 1), folds 2, 3, 4 and 5 were used as train-
ing and the fold 1, was used for validation. The same procedure
was repeated on the other four models. Finally, the area under
curve (AUC) was used to assess the performance of every combina-
tion in order to discover the best parameterization of SVM (Mu and
Nandi, 2007). The mentioned procedure was repeated for four
kernel types. A detailed cross-validation for RBF-SVM, SIG-SVM,
LN-SVM and PL-SVM is shown in Table 3.
6. Results and discussion
The basis of the WoE is constructed by W+
and WÀ
which the
weight contrast (C) can be measured by these two factors. Positive
correlation between the class of each conditioning factor and
flooding is shown by positive C, while the negative correlation is
shown by negative. The ratio of C to standard deviation is called
studentized value of C or C/S(C), which represents the final weight
for each class of conditioning factor. Standard deviation of the con-
trast is shown by S(C) and furthermore, two parameters of S2
W+
and S2
WÀ
are denoting the variances of W+
and WÀ
. As it has been
explained in the previous section all these parameters were calcu-
lated for each conditioning factor which are listed in Table 2. This
table represents the relationship between the classes of each con-
ditioning factors and flood occurrence.
Based on the value of C/S(C), it can be seen that the slope with the
range of 2.46–3.94 had a maximum weight among the other classes.
Therefore, the area which had the slope range of 2.46–3.94 repre-
sented maximum susceptibility with reference to flooding in the
Table 1
Kernel types and their required parameters. Source: Tien Bui et al. (2012a), Yao et al.
(2008).
Kernel Equation Kernel parameters
RBF Kðxi; xjÞ ¼ expðÀcxi À x2
j Þ c
LN Kðxi; xjÞ ¼ xT
i xj –
PL Kðxi; xjÞ ¼ ðÀcxT
i x þ 1Þ
d c, d
SIG Kðxi; xjÞ ¼ TanhðÀcxT
i x þ 1Þ
d c
M.S. Tehrany et al. / Journal of Hydrology 512 (2014) 332–343 337
8. catchment. The first class of SPI (0.00–0.01) had the highest value of
C/S(C) which was 3.28 while the class 9.98–10.61 acquired lowest
weight of À1.95. In the case of TWI, the range in between 21.80
and 34.13 had the highest weight, demonstrating high flood suscep-
tibility at this range of TWI. In the case of altitude, the highest weight
(7.80) was for the range of 0–9.61 (m), and the lowest weight (À2.70)
was for the range of 38.04–57.00 (m). The flat curvature had the
highest value of C/S(C), indicating high flood susceptibility in flat
areas. This can be proven by the natural characteristic of the flood
that mostly happens in the flat areas. The distance from the river
from 0 to 1745.76 showed positive correlation with flood occur-
rence, while the areas more than 1745.76 m far from the river repre-
sented the negative influence in flooding. These results indicated
that the flooding mostly occurs near to the bank of the river and
occasionally far from the river. Among the different lithology types,
the class of clay, silt, sand, peat with minor gravel had the highest C/
S(C) value of 5.38. This lithology represented highest flood suscepti-
bility in the study area. In the case of rainfall, the highest C/S(C) val-
ues were for classes of 2875 and 3125 (mm). In the case of LULC,
wetlands followed by paddy field had the highest value of C/S(C),
showing maximum flood susceptibility. Forest acquired weight of
À3.33 which showed the negative correlation with flood occurrence,
as the vegetated areas can reduce the runoff and therefore decrease
the flooding. In soil type, the highest weight of 6.49 was achieved for
the Tropofibrists type and the least was acquired for the Haplorth-
ods-Endoaquods. Finally, the derived C/S(C) was normalized and
subsequently reclassified each conditioning factor in order to use
them as an input for SVM.
Subsequently, cross validation was performed for four models
of RBF-SVM, SIG-SVM, LN-SVM and PL-SVM. Table 3 lists the
SVM Kernel types and their parameters acquired through cross
validation process. The efficiency and precision of each model
was assessed using AUC. Success rate and prediction rate were cal-
culated for each try of modeling and the parameters yielding the
highest accuracy for each model were then chosen as the optimal
parameters.
Table 2 (continued)
Factor Class W+
WÀ
C S2
(W+
) S2
(WÀ
) S(C) C/S(C)
Kandiudults-Hapludults À0.1373 0.0316 À0.1689 0.0526 0.0111 0.2525 À0.6688
Tropofibrists 1.1828 À0.2243 1.4071 0.0345 0.0125 0.2168 6.4914
Haplorthods-Endoaquods À0.6840 0.0282 À0.7122 0.3333 0.0094 0.5855 À1.2165
Tropohemists-Troposaprist 1.1226 À0.0730 1.1956 0.0909 0.0102 0.3180 3.7599
Paleudults-Hapludults 0 0.1521 0 0 0.0092 0 0
Paleudults-Paleudalfs 0.7052 À0.2572 0.9624 0.0250 0.0145 0.1987 4.8427
Urban land 0 0.0115 0 0 0.0092 0 0
Tropopsamments-Fluvaquents 0.0126 À0.0009 0.0134 0.1429 0.0098 0.3907 0.0343
Table 3
The optimal RBF-SVM, SIG-SVM, LN-SVM and PL-SVM parameters for each testing model and their accuracies.
WoE & RBF-SVM Model Training fold Testing fold c C Success rate% Prediction rate%
1 2, 3, 4, 5 1 0.1 10 95 94
2 1, 3, 4, 5 2 0.1 15 95.40 95.02
3 1, 2, 4, 5 3 0.3 10 95.95 95.52
4 1, 2, 3, 5 4 0.2 10 95.57 95.68
5 1, 2, 3, 4 5 0.3 20 96.23 95.45
0.2 13
WoE & SIG-SVM Model Training fold Testing fold c C Success rate% Prediction rate%
1 2, 3, 4, 5 1 1 10 80.57 82.67
2 1, 3, 4, 5 2 1 20 80.57 82.67
3 1, 2, 4, 5 3 2.5 20 85.30 84.80
4 1, 2, 3, 5 4 1.5 10 83.91 80.83
5 1, 2, 3, 4 5 3 10 85.36 85.72
1.8 14
WoE & LN-SVM Model Training fold Testing fold C Success rate% Prediction rate%
1 2, 3, 4, 5 1 10 95.22 93.76
2 1, 3, 4, 5 2 11 94.27 93.73
3 1, 2, 4, 5 3 14 94.13 93.70
4 1, 2, 3, 5 4 15 94.09 93.71
5 1, 2, 3, 4 5 15 94.02 93.66
13
WoE & PL-SVM Model Training fold Testing fold c d C Success rate% Prediction rate%
1 2, 3, 4, 5 1 1 3 10 95.42 90.88
2 1, 3, 4, 5 2 1 3 20 95.86 87.20
3 1, 2, 4, 5 3 5 5 20 91.64 86.86
4 1, 2, 3, 5 4 5 1 10 93.66 94.02
5 1, 2, 3, 4 5 10 7 10 92.58 93.25
4.4 3.8 14
The bold texts show the mean values.
Table 4
The results of ROC for each derived model.
Method c C d Success rate Prediction rate
WoE & RBF-SVM 0.2 13 – 96.48 95.67
WoE & SIG-SVM 2 14 – 84.67 84.28
WoE & LN-SVM – 13 – 93.65 94.18
WoE & PL-SVM 4.5 14 4 93.77 94.17
Standalone RBF-SVM 0.2 13 – 86.47 81.27
WoE – – – 71.91 66.09
M.S. Tehrany et al. / Journal of Hydrology 512 (2014) 332–343 339
9. Average of optimal parameters resulted from each kernel type
was used to perform final RBF-SVM, SIG-SVM, LN-SVM and
PL-SVM modeling. Table 4 shows the parameters which were used
to perform final modeling. In order to assess the efficiency of the
novel ensemble WoE and SVM method, each method was applied
individually for comparison purpose. RBF was selected to be used
in the individual SVM modeling as it is the most popular SVM
kernel type (Pourghasemi et al., 2013b). The results were then
converted into a GIS and probability index was measured. Fig. 4
shows the probability maps derived from each WoE and SVM
ensemble method.
Visually all the probability maps appeared similar except
ensemble WoE and SIG-SVM. This can be due to the less efficiency
of the SIG kernel to detect the flood probable areas precisely.
However the comparison and judgment cannot be done through
the visual assessment for which a proper validation method is
needed. For validation, the resultant probability maps were evalu-
ated using the AUC method and both success rate and prediction
rate curves were measured (Table 4). The AUC is considered as
one of the most popular methods to assess the efficiency of the
generated method which produce both success and prediction
rates (Pradhan and Lee, 2010). To calculate the relative ranks for
each prediction pattern, the measured probability index was sorted
in descending order. Consequently, the cell values were partitioned
into 100 classes on the vertical axis (y), with accumulated 1% inter-
vals in the horizontal axis (x). The existence of the flood locations
(training and testing) in each interval was assessed and the resul-
tant success and prediction rates were measured.
For visual interpretation of flood susceptible locations, the
probability map was needed to be classified into different zones.
For classification, various methods exist in the literature such as
equal interval, quantile, standard deviation, and natural break
Fig. 4. Flood probability maps derived from ensemble: (a) WoE and RBF-SVM, (b) WoE and SIG-SVM, (c) WoE and LN-SVM, and (d) WoE and PL-SVM.
Fig. 5. Flood susceptibility maps derived from ensemble: (a) WoE and RBF-SVM, (b) WoE and SIG-SVM, (c) WoE and LN-SVM, and (d) WoE and PL-SVM.
340 M.S. Tehrany et al. / Journal of Hydrology 512 (2014) 332–343
10. (Tehrany et al., 2013b). All these classification methods were ap-
plied and the results of all methods were examined. The best out-
put was achieved using quantile method, while other methods
classified the large part of the area as a highly susceptible zone
with high degree of exaggeration. Finally, flood susceptibility maps
were obtained and the study area was divided into five classes of
flood susceptibility using quantile classification scheme: very
low, low, moderate, high and very high (Pradhan, 2010). The flood
susceptibility maps are shown in Fig. 5.
It can be seen that the high susceptible zones are mostly located
in the East and South-East part of the catchment. A large part of the
catchment was classified as susceptible zone using the ensemble
WoE and SIG-SVM method. Based on the validation results that
can be seen in Table 4, ensemble WoE and RBF-SVM method
achieved 96.48% and 95.67% for success and prediction rate respec-
tively. The efficiency of the proposed ensemble method was proven
by the higher prediction accuracy than the individual WoE and
SVM methods. Results indicated a significant difference of around
29% can be seen between the derived prediction accuracies from
ensemble WoE and RBF-SVM method and individual WoE method.
Also the amount of difference between the measured prediction
accuracies from ensemble WoE and RBF-SVM method and individ-
ual RBF-SVM was 14%. The success rate and prediction rate of WoE
were 71.91% and 66.09% respectively, and success rate and predic-
tion rate of RBF-SVM were 86.47% and 81.27% respectively. The
achievement of this research can assist the hydrological studies
to detect the flood prone areas more accurately. The success rate
and prediction rate results were produced using the training and
testing flood locations respectively which can be seen in Fig. 6.
The highest accuracies achieved by the RBF and PL kernel types,
while the lowest accuracy was acquired using SIG kernel. Results
proved how the choice of kernel type could produce different out-
comes for the similar dataset. Through this analysis the efficiency
of each kernel type was measured and the strength of one model
over another was evaluated.
7. Conclusions
Flood is one of the most damaging catastrophic phenomena in
the tropical environments. Therefore flood susceptibility mapping
is vital for catchment management in order to have proper and
sustainable development. Over the last decades, the flood
susceptibility evaluation has been one of the hot topics in the
literature, because this evaluation is a tough and nonlinear prob-
lem. Various methods were proposed and examined by many
researchers in the literature; however each method has some weak
points which are required to be solved. The need of having accurate
and reliable method to detect the flood prone areas prompted the
authors to examine the integration of individual methods in order
to enhance the capability of the individual methods. The flooding
event which occurred on 27th November 2009 caused serious
damages in the state of Terengganu, Malaysia. Therefore Tereng-
ganu has been selected as study area for flood susceptibility map-
ping using the proposed novel ensemble method. In the current
research WoE was integrated with SVM in order to enhance the
performance of each individual method. Furthermore, the goal of
this study was to increase the accuracy of the flood susceptibility
mapping to assist having proper management over the prone
areas. Various SVM kernel types were utilized to map the flood
susceptible areas from a number of conditioning factors. An
ensemble WoE and SVM model was applied by using four kernel
types of RBF, SIG, LN and PL. The results were compared with the
flood susceptibility maps derived from individual WoE and SVM
methods. Comparison of the proposed methods gave the advanta-
ges of ensemble WoE and RBF-SVM which produced 96.48% and
95.67% for success and prediction rate respectively. The current
research showed the advantage in terms of selection of proper ker-
nel type and introduction of the novel ensemble WoE and SVM
method. RBF showed higher accuracy compared to other kernels
types. The information derived from current research can assist
governments, planners, and researchers to perform proper actions
in order to prevent and mitigate this disaster in future. Further-
more, suitable and safe locations can be detected to carry out
development. As a conclusion, the outputs attained from the pro-
posed ensemble method should be evaluated attentively by
hydrologists. Due to its high prediction power, it is possible to
acquire some misleading results. Furthermore, the precision of
the results could be enhanced if the quality of the data increases.
Acknowledgements
Thanks to two anonymous reviewers and editorial comment by
Professor Geoff Syme for their valuable comments in the earlier
version which helped us to improve the quality of the manuscript.
Fig. 6. Illustration of cumulative frequency diagram showing cumulative flood occurrence (%; y-axis) occurring in flood probability index rank (%; x-axis); (a) success rate and
(b) prediction rate.
M.S. Tehrany et al. / Journal of Hydrology 512 (2014) 332–343 341
11. References
Ali, S., Smith, K.A., 2003. Automatic parameter selection for polynomial kernel. In:
IEEE International Conference on Information Reuse and Integration, Australia,
pp. 243–249.
Arbain, S., Wibowo, A., 2012. Time series methods for water level forecasting of
Dungun river in Terengganu Malaysia. Int. J. Eng. Sci. Technol. 4, 1803–1811.
Bajabaa, S., Masoud, M., Al-Amri, N., 2013. Flash flood hazard mapping based on
quantitative hydrology, geomorphology and GIS techniques (case study of Wadi
Al Lith, Saudi Arabia). Arab. J. Geosci., 1–13. http://dx.doi.org/10.1007/s12517-
013-0941-2.
Bates, P.D., 2004. Remote sensing and flood inundation modelling. Hydrol. Process.
18, 2593–2597.
Bates, P.D., 2012. Integrating remote sensing data with flood inundation models:
how far have we got? Hydrol. Process. 26, 2515–2521.
Bonham-Carter, G.F., 1991. Integration of geoscientific data using GIS. In: Maguire,
D.J., Goodchild, M.F., Rhind, D.W. (Eds.), Geographic Information Systems:
Principles and Applications. Longman, London, pp. 171–184.
Bonham-Carter, G.F., 1994. Geographic Information Systems for Geoscientists:
Modelling with GIS. Pergamon, Ottawa, pp. 398.
Bonham-Carter, G.F., 2002. Geographic Information Systems for Geoscientist:
Modeling with GIS. Pergamon, New York, pp. 302–334.
Chau, K., Lee, J., 1991. Mathematical modelling of Shing Mun river network. Adv
Water Resour. 14, 106–112.
Chau, K., Wu, C., Li, Y., 2005. Comparison of several flood forecasting models in
Yangtze River. J. Hydrol. Eng. 10, 485–491.
Chowdary, V., Chakraborthy, D., Jeyaram, A., Murthy, Y.K., Sharma, J., Dadhwal, V.,
2013. Multi-Criteria decision making approach for watershed prioritization
using analytic hierarchy process technique and GIS. Water Resour. Manage. 27,
1–17.
Cortes, C., Vapnik, V., 1995. Support-vector networks. Mach. Learn. 20, 273–297.
Dahal, R.K., Hasegawa, S., Nonomura, A., Yamanaka, M., Masuda, T., Nishino, K.,
2008. GIS-based weights-of-evidence modelling of rainfall-induced landslides
in small catchments for landslide susceptibility mapping. Environ. Geol. 54,
311–324.
Damaševicˇius, R., 2010. Optimization of SVM parameters for recognition of
regulatory DNA sequences. Top 18, 339–353.
Dawson, C.W., Abrahart, R., Shamseldin, A., Wilby, R., 2006. Flood estimation at
ungauged sites using artificial neural networks. J. Hydrol. 319, 391–409.
Elshorbagy, A., Corzo, G., Srinivasulu, S., Solomatine, D., 2010a. Experimental
investigation of the predictive capabilities of data driven modeling techniques
in hydrology-Part 1: concepts and methodology. Hydrol. Earth Syst. Sci. 14,
1931–1941.
Elshorbagy, A., Corzo, G., Srinivasulu, S., Solomatine, D., 2010b. Experimental
investigation of the predictive capabilities of data driven modeling techniques
in hydrology-Part 2: application. Hydrol. Earth Syst. Sci. 14, 1943–1961.
Fenicia, F., Kavetski, D., Savenije, H.H., Clark, M.P., Schoups, G., Pfister, L., Freer, J.,
2013. Catchment properties, function, and conceptual model representation: is
there a correspondence? Hydrol. Process.. http://dx.doi.org/10.1002/hyp.9726.
Glenn, E., Morino, K., Nagler, P., Murray, R., Pearlstein, S., Hultine, K., 2012. Roles of
saltcedar (Tamarix spp.) and capillary rise in salinizing a non-flooding terrace on
a flow-regulated desert river. J. Arid Environ. 79, 56–65.
Haq, M., Akhtar, M., Muhammad, S., Paras, S., Rahmatullah, J., 2012. Techniques of
remote sensing and GIS for flood monitoring and damage assessment: a case
study of Sindh province, Pakistan. Egypt. J. Rem. Sens. Space Sci. 15, 135–141.
Jebur, M.N., Mohd Shafri, H.Z., Pradhan, B., Tehrany, M.S., 2013. Per-pixel and
object-oriented classification methods for mapping urban land cover extraction
using SPOT 5 imagery. Geocarto Int., 1–15. http://dx.doi.org/10.1080/
10106049.2013.848944.
Kanevskij, M., Pozdnoukhov, A., Timonin, V., 2009. Machine Learning for Spatial
Environmental Data: Theory, Applications and Software. Epfl Press, Lausanne,
pp. 47–52.
Keerthi, S.S., Lin, C.J., 2003. Asymptotic behaviors of support vector machines with
Gaussian kernel. Neural Comput. 15, 1667–1689.
Khu, S.T., Liong, S.Y., Babovic, V., Madsen, H., Muttil, N., 2001. Genetic programming
and its application in real-time runoff forecasting1. J. Am. Water Resour. Assoc.
37, 439–451.
Kia, M.B., Pirasteh, S., Pradhan, B., Mahmud, A.R., Sulaiman, W.N.A., Moradi, A., 2012.
An artificial neural network model for flood simulation using GIS: Johor River
Basin, Malaysia. Environ. Earth Sci. 67, 251–264.
Kim, G., Barros, A.P., 2001. Quantitative flood forecasting using multisensor data
and neural networks. J. Hydrol. 246, 45–62.
Lamovec, P., Veljanovski, T., Mikoš, M., Oštir, K., 2013. Detecting flooded areas with
machine learning techniques: case study of the Selška Sora river flash flood in
September 2007. J. Appl. Rem. Sens. 7, 073564–073564.
Lee, M.J., Kang, J.E., Jeon, S., 2012. Application of frequency ratio model and
validation for predictive flooded area susceptibility mapping using GIS. In:
Geoscience and Remote Sensing Symposium (IGARSS), Munich. pp. 895–898.
Liao, X., Carin, L., 2009. Migratory logistic regression for learning concept drift
between two data sets with application to UXO sensing. IEEE Trans. Geosci.
Rem. Sens. 47, 1454–1466.
Lin, H.T., Lin, C.J., 2003. A Study on Sigmoid Kernels for SVM and the Training of non-
PSD Kernels by SMO-type Methods. Technical Report, Department of Computer
Science and Information Engineering, National Taiwan University, pp. 32.
<http://www.csie.ntu.edu.tw/~cjlin/papers/tanh.pdf.
Maier, H.R., Jain, A., Dandy, G.C., Sudheer, K.P., 2010. Methods used for the
development of neural networks for the prediction of water resource variables
in river systems: current status and future directions. Environ. Modell. Softw.
25, 891–909.
Manandhar, B., 2010. Flood Plain Analysis and Risk Assessment of Lothar Khola, MSc
Thesis, Tribhuvan University, Phokara, Nepal, pp. 64.
Marjanovic´, M., Kovacˇevic´, M., Bajat, B., Vozˇenílek, V., 2011. Landslide susceptibility
assessment using SVM machine learning algorithm. Eng. Geol. 123, 225–234.
Mohammady, M., Pourghasemi, H.R., Pradhan, B., 2012. Landslide susceptibility
mapping at Golestan Province, Iran: a comparison between frequency ratio,
Dempster-Shafer, and weights-of-evidence models. J. Asian Earth Sci. 61, 221–
236.
Mu, T., Nandi, A.K., 2007. Breast cancer detection from FNA using SVM with
different parameter tuning systems and SOM–RBF classifier. J. Franklin Inst.
344, 285–311.
Oh, H.J., Pradhan, B., 2011. Application of a neuro-fuzzy model to landslide-
susceptibility mapping for shallow landslides in a tropical hilly area. Comput.
Geosci. 37, 1264–1276.
Ohlmacher, G.C., Davis, J.C., 2003. Using multiple logistic regression and GIS
technology to predict landslide hazard in northeast Kansas, USA. Eng. Geol. 69,
331–343.
Pourghasemi, H., Pradhan, B., Gokceoglu, C., Moezzi, K.D., 2013a. A comparative
assessment of prediction capabilities of Dempster-Shafer and weights-of-
evidence models in landslide susceptibility mapping using GIS. Geomatics
Nat. Hazards Risk 4, 93–118.
Pourghasemi, H.R., Jirandeh, A.G., Pradhan, B., Xu, C., Gokceoglu, C., 2013b. Landslide
susceptibility mapping using support vector machine and GIS at the Golestan
Province, Iran. J. Earth Syst. Sci. 122, 349–369.
Pourghasemi, H.R., Pradhan, B., Gokceoglu, C., Mohammadi, M., Moradi, H.R., 2013c.
Application of weights-of-evidence and certainty factor models and their
comparison in landslide susceptibility mapping at Haraz watershed, Iran. Arab.
J. Geosci. 6, 2351–2365.
Pradhan, B., 2010. Flood susceptible mapping and risk area delineation using
logistic regression, GIS and remote sensing. J. Spatial Hydrol. 9, 1–18.
Pradhan, B., 2011. Use of GIS-based fuzzy logic relations and its cross application to
produce landslide susceptibility maps in three test areas in Malaysia. Environ.
Earth Sci. 63, 329–349.
Pradhan, B., 2013. A comparative study on the predictive ability of the decision tree,
support vector machine and neuro-fuzzy models in landslide susceptibility
mapping using GIS. Comput. Geosci. 51, 350–365.
Pradhan, B., Buchroithner, M.F., 2010. Comparison and validation of landslide
susceptibility maps using an artificial neural network model for three test areas
in Malaysia. Environ. Eng. Geosci. 16, 107–126.
Pradhan, B., Lee, S., 2010. Landslide susceptibility assessment and factor effect
analysis: backpropagation artificial neural networks and their comparison with
frequency ratio and bivariate logistic regression modelling. Environ. Modell.
Softw. 25, 747–759.
Pradhan, B., Shafiee, M., 2009. Flood hazrad assessment for cloud prone rainy areas
in a typical tropical environment. Disaster Adv. 2, 7–15.
Pradhan, B., Youssef, A., 2011. A 100-year maximum flood susceptibility mapping
using integrated hydrological and hydrodynamic models: Kelantan River
Corridor, Malaysia. J. Flood Risk Manage. 4, 189–202.
Pradhan, B., Oh, H.J., Buchroithner, M., 2010a. Weights-of-evidence model applied
to landslide susceptibility mapping in a tropical hilly area. Geomatics Nat.
Hazards Risk. 1, 199–223.
Pradhan, B., Youssef, A.M., Varathrajoo, R., 2010b. Approaches for delineating
landslide hazard areas using different training sites in an advanced artificial
neural network model. Geospatial. Inf. Sci. 13, 93–102.
Pradhan, B., Mansor, S., Pirasteh, S., Buchroithner, M.F., 2011. Landslide hazard and
risk analyses at a landslide prone catchment area using statistical based
geospatial model. Int. J. Remote. Sens. 32, 4075–4087.
Pradhan, B., Hagemann, U., Shafapour Tehrany, M., Prechtel, N., 2014. An easy to use
ArcMap based texture analysis program for extraction of flooded areas from
TerraSAR-X satellite image. Comput. Geosci. 63, 34–43.
Refsgaard, J.C., 1997. Parameterisation, calibration and validation of distributed
hydrological models. J. Hydrol. 198, 69–97.
Regmi, N.R., Giardino, J.R., Vitek, J.D., 2010. Modeling susceptibility to landslides
using the weight of evidence approach: Western Colorado, USA.
Geomorphology 115, 172–187.
Regmi, A.D., Devkota, K.C., Yoshida, K., Pradhan, B., Pourghasemi, H.R., Kumamoto,
T., Akgun, A., 2013. Application of frequency ratio, statistical index, and
weights-of-evidence models and their comparison in landslide susceptibility
mapping in Central Nepal Himalaya. Arab. J. Geosci.. http://dx.doi.org/10.1007/
s12517-012-0807-z.
Samui, P., 2008. Slope stability analysis: a support vector machine approach.
Environ. Geology. 56, 255–267.
Smith, K., Ward, R., 1998. Floods: Physical Processes and Human Impacts. Wiley,
Chichester, pp. 382.
Song, S., Zhan, Z., Long, Z., Zhang, J., Yao, L., 2011. Comparative study of SVM
methods combined with voxel selection for object category classification on
fMRI data. PLoS ONE 6, e17191.
Tehrany, M.S., Pradhan, B., Jebu, M.N., 2013a. A comparative assessment between
object and pixel-based classification approaches for land use/land cover
mapping using SPOT 5 imagery. Geocarto Int., 1–19. http://dx.doi.org/
10.1080/10106049.2013.768300.
342 M.S. Tehrany et al. / Journal of Hydrology 512 (2014) 332–343
12. Tehrany, M.S., Pradhan, B., Jebur, M.N., 2013b. Spatial prediction of flood susceptible
areas using rule based decision tree (DT) and a novel ensemble bivariate and
multivariate statistical models in GIS. J. Hydrol. 504, 69–79.
Tien Bui, D., Pradhan, B., Lofman, O., Revhaug, I., 2012a. Landslide susceptibility
assessment in Vietnam using support vector machines, decision tree, and Naive
Bayes Models. Math. Probl. Eng. 26. http://dx.doi.org/10.1155/2012/974638.
Tien Bui, D., Pradhan, B., Lofman, O., Revhaug, I., Dick, O.B., 2012b. Landslide
susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive
neuro-fuzzy inference system and GIS. Comput. Geosci. 45, 199–211.
Tiwari, M.K., Chatterjee, C., 2010. Uncertainty assessment and ensemble flood
forecasting using bootstrap based artificial neural networks (BANNs). J. Hydrol.
382, 20–33.
Toth, E., Brath, A., Montanari, A., 2000. Comparison of short-term rainfall prediction
models for real-time flood forecasting. J. Hydrol. 239, 132–147.
Tunusluoglu, M., Gokceoglu, C., Nefeslioglu, H., Sonmez, H., 2008. Extraction of
potential debris source areas by logistic regression technique: a case study from
Barla, Besparmak and Kapi mountains (NW Taurids, Turkey). Environ. Geology.
54, 9–22.
Varoonchotikul, P., 2003. Flood Forecasting Using Artificial Neural Networks.
Balkema, Rotterdam, The Netherlands, pp. 101.
Vorogushyn, S., Lindenschmidt, K.E., Kreibich, H., Apel, H., Merz, B., 2012. Analysis of
a detention basin impact on dike failure probabilities and flood risk for a
channel-dike-floodplain system along the river Elbe, Germany. J. Hydrol. 436,
120–131.
Wan, S., Lei, T.C., 2009. A knowledge-based decision support system to analyze the
debris-flow problems at Chen-Yu-Lan River, Taiwan. Knowl.-Based Syst. 22,
580–588.
Wan, S., Lei, T., Chou, T.Y., 2010a. An enhanced supervised spatial decision support
system of image classification: consideration on the ancillary information of
paddy rice area. Int. J. Geogr. Inf. Sci. 24, 623–642.
Wan, S., Lei, T., Chou, T.Y., 2010b. A novel data mining technique of analysis and
classification for landslide problems. Nat. Hazards 52, 211–230.
Wan, S., Lei, T.C., Chou, T.Y., 2012. A landslide expert system: image classification
through integration of data mining approaches for multi-category analysis. Int.
J. Geogr. Inf. Sci. 26, 747–770.
Wanders, N., Bierkens, M.F., de Jong, S.M., de Roo, A., Karssenberg, D., 2013. The
benefits of using remotely sensed soil moisture in parameter identification of
large-scale hydrological models. In: EGU General Assembly Conference
Abstracts, vol. 15, pp. 10215.
Xu, C., Xu, X., Dai, F., Xiao, J., Tan, X., Yuan, R., 2012. Landslide hazard mapping using
GIS and weight of evidence model in Qingshui river watershed of 2008
Wenchuan earthquake struck region. J. Earth Sci. 23, 97–120.
Yalcin, A., 2008. GIS-based landslide susceptibility mapping using analytical
hierarchy process and bivariate statistics in Ardesen (Turkey): comparisons of
results and confirmations. Catena 72, 1–12.
Yao, X., Tham, L., Dai, F., 2008. Landslide susceptibility mapping based on support
vector machine: a case study on natural slopes of Hong Kong, China.
Geomorphology 101, 572–582.
Zaleha, K., Roswati, M.A., Iwasaki, N., 2006. Distribution of some species of
harpacticoid copepods in east coast of Peninsular Malaysia. Coast. Mar. Sci. 30,
140–145.
Zhu, X., Zhang, S., Jin, Z., Zhang, Z., Xu, Z., 2011. Missing value estimation for mixed-
attribute data sets. IEEE Trans. Knowl. Data Eng. 23, 110–121.
Zhuang, L., Dai, H., 2006. Parameter optimization of kernel-based one-class
classifier on imbalance learning. J. Comput. 1, 32–40.
Zou, Q., Zhou, J., Zhou, C., Song, L., Guo, J., 2013. Comprehensive flood risk
assessment based on set pair analysis-variable fuzzy sets model and fuzzy AHP.
Stoch. Environ. Res. Risk Assess. 27, 525–546.
M.S. Tehrany et al. / Journal of Hydrology 512 (2014) 332–343 343