Climate zones of Bangladesh are identified by using mathematical methodology of cluster analysis. Monthly data from 34 climate stations for rainfall from 1991 to 2013 are used in the cluster analysis. Five Agglomerative Hierarchical clustering measures based on mostly used six proximity measures are chosen to perform the regionalization. Besides three popular measures: K-means, Fuzzy and density based clustering techniques are applied initially to decide the most suitable method for the identification of homogeneous region. Stability of the cluster is also tested based on nine validity indices. It is decided that Ward method based on Euclidean distance, K-means, Fuzzy are the most likely to yield acceptable results in this particular case, as is often the case in climatological research. In this analysis we found seven different climate zones in Bangladesh.
As basic data, the reliability of precipitation data makes a significant impact on many results of environmental applications. In order to obtain spatially distributed precipitation data, measured points are interpolated. There are many spatial interpolation schemes, but none of them can perform best in all cases. So criteria of precision evaluation are established. This study aims to find an optimal interpolation scheme for rainfall in Ningxia. The study area is located in northwest China. Meteorological stations distribute at a low density here. Six interpolation methods have been tested after exploring data. Cross-validation was used as the criterion to evaluate the accuracy of various methods. The best results were obtained by cokriging with elevation as the second variable, while the inverse distance weighting (IDW) preform worst. Three types of model in cokriging were compared, and Gaussian model is the best.
Modeling monthly average daily diffuse radiation for dhaka, bangladesheSAT Journals
Abstract The diffuse part of solar radiation is one of the elements necessary for the design and evaluation of energy production of a solar system. However, in most cases, when radiometric measurements are made, only global radiation is available. To remedy this situation, this paper presents a model of the scattered radiation measured on a horizontal surface for the capital city of Bangladesh. The correlation established for the chosen site was compared to the work of Liu anf Jordan, Page, Collares Pereira and Rabl, Modi and Sukhatme and Gupta el al. Keywords: Diffuse Radiation, Clearness Index, Regression analysis, Horizontal Radiation.
Integration Method of Local-global SVR and Parallel Time Variant PSO in Water...TELKOMNIKA JOURNAL
Flood is one type of natural disaster that can’t be predicted, one of the main causes of flooding is the continuous rain (natural events). In terms of meteorology, the cause of flood is come from high rainfall and the high tide of the sea, resulting in increased the water level. Rainfall and water level analysis in each period, still not able to solve the existing problems. Therefore in this study, the proposed integration method of Parallel Time Variant PSO (PTVPSO) and Local-Global Support Vector Regression (SVR) is used to forecast water level. Implementation in this study combine SVR as regression method for forecast the water level, Local-Global concept take the role for the minimization for the computing time, while PTVPSO used in the SVR to obtain maximum performance and higher accurate result by optimize the parameters of SVR. Hopefully this system will be able to solve the existing problems for flood early warning system due to erratic weather.
Precipitation’s Level Prediction Based on Tree Augmented Naïve Bayes modelNooria Sukmaningtyas
At present, most of the precipitation’s level predictions use the laws of nature to build the
mathematical model which contains one or more series level to carry out the numerical simulation, as thus
to analyze the causes and consequences of the evolution. Bayesian model is one kind of the foregoing
said. In the Bayesian classification model, Naive Bayes model is known for its stability and easy to
operate, but the established precedent assumption tends to be inadmissible. So here the article proposed
a new precipitation’s level prediction model based on the tree Augmented Naïve Bayes(we called TAN
model for short hereafter), which improve the original Naïve Bayes model defects and increase the
association between the leaf nodes on the basis of the original model. And we use the Dongtai station,
Jiangsu province meteorology data to test the new precipitation model. The results show that the new
precipitation prediction model’s performance is superior to the traditional Naive Bayes model.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Calculation of solar radiation by using regression methodsmehmet şahin
Abstract. In this study, solar radiation was estimated at 53 location over Turkey with
varying climatic conditions using the Linear, Ridge, Lasso, Smoother, Partial least, KNN
and Gaussian process regression methods. The data of 2002 and 2003 years were used to
obtain regression coefficients of relevant methods. The coefficients were obtained based on
the input parameters. Input parameters were month, altitude, latitude, longitude and landsurface
temperature (LST).The values for LST were obtained from the data of the National
Oceanic and Atmospheric Administration Advanced Very High Resolution Radiometer
(NOAA-AVHRR) satellite. Solar radiation was calculated using obtained coefficients in
regression methods for 2004 year. The results were compared statistically. The most
successful method was Gaussian process regression method. The most unsuccessful method
was lasso regression method. While means bias error (MBE) value of Gaussian process
regression method was 0,274 MJ/m2, root mean square error (RMSE) value of method was
calculated as 2,260 MJ/m2. The correlation coefficient of related method was calculated as
0,941. Statistical results are consistent with the literature. Used the Gaussian process
regression method is recommended for other studies.
ANNUAL PRECIPITATION IN SOUTHERN OF MADAGASCAR: MODELING USING HIGH ORDER FUZ...ijfls
The objective of this research is to find the best conventional high order fuzzy time series model for annual precipitation series in southern Madagascar. This work consists on finding the hyper parameters (number of partition of the universe of discourse and model order) to obtain the best conventional high
order fuzzy time series model for our experimental data. In previous works, entitled spatial and temporal variability of precipitation in southern Madagascar, we subdivided the study area between 22 ° S to 30 ° S latitude and 43 ° Eto 48 ° E longitude into four zones of homogeneous precipitation. In this article, we seek to model annual precipitation data representative of one of these four areas. These data were taken between 1979 and 2017. Our approach consists on subdividing the data: data obtained from 1979 to 2001 (60%) for the training and data from 2002 to 2017 (40%) to test the model. To determine the number of partitions and model order, we fix first the number of partitions to 10 and then to 15, 20, 25,30, 35, 40, 45 and 50.For each of these values, we vary the model order from 1 to 10.Thenwe locate the model order which corresponds to the minimum of the average curve between the Mean Absolute Errors (MAE) between the training data and the test data. Thus, the orders of the candidate model are 2, 3, 5, and 6.The next step is to fix the model order with the previous values and vary the number of partitions from 3 to 50.For each couple of hyper parameter of the model (number of partitions, model order), we locate the value of number of partitions corresponding to the minimum of the average curve between the absolute mean of the errors or MAE (Mean Absolute Error) between the train and test data. We obtain the hyper-parameter pairs (37, 2), (20, 3), (35, 5) and (35, 6).The first pair gives the lowest Mean Absolute Error. As a final result, we obtain the best high order fuzzy time series model with hyperparameters umber of partition equals thirty seven and of order equals two for annual precipitation in Southern of Madagascar.
Classification accuracy analyses using Shannon’s EntropyIJERA Editor
There are many methods for determining the Classification Accuracy. In this paper significance of Entropy of
training signatures in Classification has been shown. Entropy of training signatures of the raw digital image
represents the heterogeneity of the brightness values of the pixels in different bands. This implies that an image
comprising a homogeneous lu/lc category will be associated with nearly the same reflectance values that would
result in the occurrence of a very low entropy value. On the other hand an image characterized by the
occurrence of diverse lu/lc categories will consist of largely differing reflectance values due to which the
entropy of such image would be relatively high. This concept leads to analyses of classification accuracy.
Although Entropy has been used many times in RS and GIS but its use in determination of classification
accuracy is new approach.
As basic data, the reliability of precipitation data makes a significant impact on many results of environmental applications. In order to obtain spatially distributed precipitation data, measured points are interpolated. There are many spatial interpolation schemes, but none of them can perform best in all cases. So criteria of precision evaluation are established. This study aims to find an optimal interpolation scheme for rainfall in Ningxia. The study area is located in northwest China. Meteorological stations distribute at a low density here. Six interpolation methods have been tested after exploring data. Cross-validation was used as the criterion to evaluate the accuracy of various methods. The best results were obtained by cokriging with elevation as the second variable, while the inverse distance weighting (IDW) preform worst. Three types of model in cokriging were compared, and Gaussian model is the best.
Modeling monthly average daily diffuse radiation for dhaka, bangladesheSAT Journals
Abstract The diffuse part of solar radiation is one of the elements necessary for the design and evaluation of energy production of a solar system. However, in most cases, when radiometric measurements are made, only global radiation is available. To remedy this situation, this paper presents a model of the scattered radiation measured on a horizontal surface for the capital city of Bangladesh. The correlation established for the chosen site was compared to the work of Liu anf Jordan, Page, Collares Pereira and Rabl, Modi and Sukhatme and Gupta el al. Keywords: Diffuse Radiation, Clearness Index, Regression analysis, Horizontal Radiation.
Integration Method of Local-global SVR and Parallel Time Variant PSO in Water...TELKOMNIKA JOURNAL
Flood is one type of natural disaster that can’t be predicted, one of the main causes of flooding is the continuous rain (natural events). In terms of meteorology, the cause of flood is come from high rainfall and the high tide of the sea, resulting in increased the water level. Rainfall and water level analysis in each period, still not able to solve the existing problems. Therefore in this study, the proposed integration method of Parallel Time Variant PSO (PTVPSO) and Local-Global Support Vector Regression (SVR) is used to forecast water level. Implementation in this study combine SVR as regression method for forecast the water level, Local-Global concept take the role for the minimization for the computing time, while PTVPSO used in the SVR to obtain maximum performance and higher accurate result by optimize the parameters of SVR. Hopefully this system will be able to solve the existing problems for flood early warning system due to erratic weather.
Precipitation’s Level Prediction Based on Tree Augmented Naïve Bayes modelNooria Sukmaningtyas
At present, most of the precipitation’s level predictions use the laws of nature to build the
mathematical model which contains one or more series level to carry out the numerical simulation, as thus
to analyze the causes and consequences of the evolution. Bayesian model is one kind of the foregoing
said. In the Bayesian classification model, Naive Bayes model is known for its stability and easy to
operate, but the established precedent assumption tends to be inadmissible. So here the article proposed
a new precipitation’s level prediction model based on the tree Augmented Naïve Bayes(we called TAN
model for short hereafter), which improve the original Naïve Bayes model defects and increase the
association between the leaf nodes on the basis of the original model. And we use the Dongtai station,
Jiangsu province meteorology data to test the new precipitation model. The results show that the new
precipitation prediction model’s performance is superior to the traditional Naive Bayes model.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Calculation of solar radiation by using regression methodsmehmet şahin
Abstract. In this study, solar radiation was estimated at 53 location over Turkey with
varying climatic conditions using the Linear, Ridge, Lasso, Smoother, Partial least, KNN
and Gaussian process regression methods. The data of 2002 and 2003 years were used to
obtain regression coefficients of relevant methods. The coefficients were obtained based on
the input parameters. Input parameters were month, altitude, latitude, longitude and landsurface
temperature (LST).The values for LST were obtained from the data of the National
Oceanic and Atmospheric Administration Advanced Very High Resolution Radiometer
(NOAA-AVHRR) satellite. Solar radiation was calculated using obtained coefficients in
regression methods for 2004 year. The results were compared statistically. The most
successful method was Gaussian process regression method. The most unsuccessful method
was lasso regression method. While means bias error (MBE) value of Gaussian process
regression method was 0,274 MJ/m2, root mean square error (RMSE) value of method was
calculated as 2,260 MJ/m2. The correlation coefficient of related method was calculated as
0,941. Statistical results are consistent with the literature. Used the Gaussian process
regression method is recommended for other studies.
ANNUAL PRECIPITATION IN SOUTHERN OF MADAGASCAR: MODELING USING HIGH ORDER FUZ...ijfls
The objective of this research is to find the best conventional high order fuzzy time series model for annual precipitation series in southern Madagascar. This work consists on finding the hyper parameters (number of partition of the universe of discourse and model order) to obtain the best conventional high
order fuzzy time series model for our experimental data. In previous works, entitled spatial and temporal variability of precipitation in southern Madagascar, we subdivided the study area between 22 ° S to 30 ° S latitude and 43 ° Eto 48 ° E longitude into four zones of homogeneous precipitation. In this article, we seek to model annual precipitation data representative of one of these four areas. These data were taken between 1979 and 2017. Our approach consists on subdividing the data: data obtained from 1979 to 2001 (60%) for the training and data from 2002 to 2017 (40%) to test the model. To determine the number of partitions and model order, we fix first the number of partitions to 10 and then to 15, 20, 25,30, 35, 40, 45 and 50.For each of these values, we vary the model order from 1 to 10.Thenwe locate the model order which corresponds to the minimum of the average curve between the Mean Absolute Errors (MAE) between the training data and the test data. Thus, the orders of the candidate model are 2, 3, 5, and 6.The next step is to fix the model order with the previous values and vary the number of partitions from 3 to 50.For each couple of hyper parameter of the model (number of partitions, model order), we locate the value of number of partitions corresponding to the minimum of the average curve between the absolute mean of the errors or MAE (Mean Absolute Error) between the train and test data. We obtain the hyper-parameter pairs (37, 2), (20, 3), (35, 5) and (35, 6).The first pair gives the lowest Mean Absolute Error. As a final result, we obtain the best high order fuzzy time series model with hyperparameters umber of partition equals thirty seven and of order equals two for annual precipitation in Southern of Madagascar.
Classification accuracy analyses using Shannon’s EntropyIJERA Editor
There are many methods for determining the Classification Accuracy. In this paper significance of Entropy of
training signatures in Classification has been shown. Entropy of training signatures of the raw digital image
represents the heterogeneity of the brightness values of the pixels in different bands. This implies that an image
comprising a homogeneous lu/lc category will be associated with nearly the same reflectance values that would
result in the occurrence of a very low entropy value. On the other hand an image characterized by the
occurrence of diverse lu/lc categories will consist of largely differing reflectance values due to which the
entropy of such image would be relatively high. This concept leads to analyses of classification accuracy.
Although Entropy has been used many times in RS and GIS but its use in determination of classification
accuracy is new approach.
Improved grey clustering method in risk zonation of mountain flash flood disa...Agriculture Journal IJOEAR
— Flash floods are considered one of the worst weather-related natural disasters. Flash floods are dangerous because they are sudden and highly unpredictable. Identification of the locations of high-risk areas has a major effect on the improvement of flash flood disaster control and prevention. Earlier work conducted on flood disaster risk zonation was commonly based on Digital Elevation Mode (DEM) data and statistical yearbook data and used an index, such as rainfall, topography, slope, or river distribution, with the analytic hierarchy process (AHP) method to determine the weighting. In this method, the final regional risk map was created by using ArcGIS map algebra superposition. In the present study, an improved gray clustering method is put forward to improve the comprehensive evaluation of the risk of mountain flash flood disasters by constructing the exponential whitening function and by using the information entropy weight method, which produces results that are more accurate and more reliable than those of the traditional method. This improved method can make full use of the limited information available, improving not only the resolution but also the influence of the subjective method, and produces more objective and accurate evaluation results. We obtain the risk degree by combining the information entropy weight and improved whitening function approaches in a gray clustering methodology. Additionally, a method is applied to develop models for mapping the risk grade in zones of 1436 towns and counties in Hubei Province with remotely sensed (RS) data and the ArcGIS platform. The results show that the improved approach is useful for rapidly assessing flash flood hazard and vulnerability and for completing risk assessments in mountain areas.
Austin Statistics is an open access, peer reviewed, scholarly journal dedicated to publish articles in all areas of statistics.
The aim of the journal is to provide a forum for scientists, academicians and researchers to find most recent advances in the field statistics.
Austin Statistics accepts original research articles, review articles, case reports and rapid communication on all the aspects of statistics.
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...IJDKP
ABSTRACT
The objectives of this paper were to 1) develop an empirical method for selecting relevant attributes for modelling drought and 2) select the most relevant attribute for drought modelling and predictions in the Greater Horn of Africa (GHA). Twenty four attributes from different domain areas were used for this experimental analysis. Two attribute selection algorithms were used for the current study: Principal Component Analysis (PCA) and correlation-based attribute selection (CAS). Using the PCA and CAS algorithms, the 24 attributes were ranked by their merit value. Accordingly, 15 attributes were selected for modelling drought in GHA. The average merit values for the selected attributes ranged from 0.5 to 0.9. The methodology developed here helps to avoid the uncertainty of domain experts’ attribute selection
challenges, which are unsystematic and dominated by somewhat arbitrary trial. Future research may evaluate the developed methodology using relevant classification techniques and quantify the actual information gain from the developed approach.
The document discusses applying inverse distance weighting (IDW) to interpolate air quality data from Taiwan. IDW was used to generate particulate matter (PM2.5) maps from station data for different power values. The results showed that higher power values produced more centralized, smoothed maps while lower power values provided more detailed surfaces. Statistics of the interpolated surfaces supported findings that higher power leads to smaller average values and larger standard deviations.
International Journal of Engineering Inventions (IJEI) provides a multidisciplinary passage for researchers, managers, professionals, practitioners and students around the globe to publish high quality, peer-reviewed articles on all theoretical and empirical aspects of Engineering and Science.
Covariance models for geodetic applications of collocation brief versionCarlo Iapige De Gaetani
This document summarizes a new methodology for modeling covariance functions for integrated gravity field modeling using collocation. The methodology uses linear programming and the simplex method to estimate parameters of analytical covariance function models to best fit empirical covariance functions from multiple gravity observations. This allows all available empirical covariances to be considered simultaneously, improving over standard methods that model covariances separately and propagate between observations. The results from testing this new methodology show improvements over existing software packages for modeling covariance functions for local gravity applications.
An innovative idea to discover the trend on multi dimensional spatio-temporal...eSAT Journals
Abstract Spatio-temporal data is any information regarding space and time. It is frequently updated data with 1TB/hr, are greatly challenging our ability to digest the data. Thereupon, it is unable to gain exact information from that data. So this research offers an innovative idea to discover the trend on multi-dimensional spatio-temporal datasets. Here it briefly describes the scope and relevancy of spatio-temporal data. From that, gain the depth knowledge of spatio-temporal recent research process to discover the trend. Keywords: Spatio-temporal Data, Applications of Spatio-temporal data, Problem Definition, Contributions
An innovative idea to discover the trend on multi dimensional spatio-temporal...eSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
The document describes a method for normalizing well logging curves in the Daqingzi oilfield in China. Key steps include:
1. Selecting key wells that are evenly distributed in the area and have complete coring data to analyze. A standard marker bed is also selected.
2. Using a combined correction method including tendency surface analysis, frequency distribution histograms, and mean correction to standardize sonic logging curves. Tendency surface analysis is used initially but outliers are corrected using the other two methods.
3. The combined method is found to provide a 6% better fit than tendency surface analysis alone when fitting times are four. The combined method better addresses outliers caused by logging equipment differences and human errors.
The two main challenges of predicting the wind speed depend on various atmospheric factors and random variables. This paper explores the possibility of developing a wind speed prediction model using different Artificial Neural Networks (ANNs) and Categorical Regression empirical model which could be used to estimate the wind speed in Coimbatore, Tamil Nadu, India using SPSS software. The proposed Neural Network models are tested on real time wind data and enhanced with statistical capabilities. The objective is to predict accurate wind speed and to perform better in terms of minimization of errors using Multi Layer Perception Neural Network (MLPNN), Radial Basis Function Neural Network (RBFNN) and Categorical Regression (CATREG). Results from the paper have shown good agreement between the estimated and measured values of wind speed.
Gravitational search algorithm with chaotic map (gsa cm) for solving optimiza...eSAT Journals
Abstract
Gravitational Search Algorithm (GSA) is a newly heuristic algorithm inspired by nature which utilizes Newtonian gravity law and
mass interactions. It has captured much attention since it has provided higher performance in solving various optimization
problems. This study hybridizes the GSA and chaotic equations. Ten chaotic-based GSA (GSA-CM) methods, which define the
random selections by different chaotic maps, have been developed. The proposed methods have been applied to the minimization
of benchmark problems and the results have been compared. The obtained numeric results show that most of the proposed
algorithms have increased the performance of GSA and have developed its quality of solution.
Keywords: Computational Intelligence, Evolutionary Computation, Heuristic Algorithms, Chaotic Maps, Optimization
Methods.
This document discusses key properties of spatial data including projection, accuracy, scale, and resolution. It defines projection as the conversion of 3D earth coordinates to a 2D map representation using mathematical formulas. Accuracy refers to how closely a map matches real-world values, which can be measured horizontally, vertically, or relatively. Scale compares distances on a map to actual distances on earth. Resolution is the smallest feature that can be recognized on a map.
Analysis of green’s function and surface current density for rectangular micr...eSAT Journals
Abstract In this paper, Green’s function and surface current density for planar structure has been calculated. The approach makes use of the popular and rigorously used spectral domain full wave analysis method in conjunction with method of moment as numerical analysis tool. In present approach, boundary conditions are applied at patch metallization, which leads to integral equation with the involvement of green’s function in spectral domain, which includes the effect of dielectric, conductor loss, surface wave modes and space wave radiation. By applying Galerkin’s moment method integral equation are transformed to linear set of equations. Entire domain basis function is used to improve the efficiency of the solution. Keywords: Spectral domain full wave analysis, Green function, Galerkin’s moment method, Entire domain basis function, Surface current density.
Analysis of green’s function and surface current density for rectangular micr...eSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
This document summarizes a study that used the fuzzy TOPSIS method to select the optimal type of spillway for a dam in northern Greece called Pigi Dam. Five alternative spillway types were evaluated based on nine criteria. The criteria were expressed as triangular fuzzy numbers to account for uncertainty. Weights for the criteria were determined using the AHP method and also expressed linguistically as fuzzy numbers. The fuzzy TOPSIS method was then used to rank the alternatives based on their distances from the ideal and negative-ideal solutions. The alternative with the highest relative closeness to the ideal solution was determined to be the optimal spillway type.
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
This document discusses applying a novel approach using multi-criterion decision analysis (MCDA) with the generalized likelihood uncertainty estimation (GLUE) method to quantify uncertainty in hydrological modeling. Specifically, it examines uncertainty in the SLURP hydrological model. Rather than considering overall Nash-Sutcliffe efficiency, the approach considers NSE values for different flow magnitudes simultaneously. The TOPSIS MCDA method is used to compute predictive intervals by considering NSE values for different flow periods simultaneously. The Kootenay Catchment case study is used to demonstrate the MCDA-GLUE approach.
This document summarizes a research paper that classified multi-date remote sensing images using NDVI values. It discusses how NDVI values were calculated from Terra satellite imagery using red and infrared band values. A similarity measure formula was proposed to classify images based on comparing NDVI values of unknown images to reference images. The formula measured similarity between image windows using sum of absolute differences of NDVI values. Five Terra images from different dates were classified into 20 reference classes using this approach.
Clustering and Classification in Support of Climatology to mine Weather Data ...MangaiK4
Abstract -Knowledge of climate data of region is essential for business, society, agriculture, pollution and energy applications. Climate is not fixed, the fluctuation in the climate can be seen from year to year.Thedata mining application help meteorological scientists to predictaccurate weather forecast and decisions and also provide more performance and reliability than any other methods. The data mining techniques applied on weather data are efficient when compare to the mathematical models used. Various techniques of data mining are applied on climate data to support weather forecasting, climate scientists, agriculture, vegetation, water resources and tourism. The aim of this paper is to provide a review report on various data mining techniques applied on weather data set in support of weather prediction and climate analysis
An exploratory analysis on half hourly electricity load patterns leading to h...ijaia
Accurate prediction of electricity demand can bring
extensive benefits to any country as the forecaste
d
values help the relevant authorities to take decisi
ons regarding electricity generation, transmission
and
distribution appropriately. The literature reveals
that, when compared to conventional time series
techniques, the improved artificial intelligent app
roaches provide better prediction accuracies. Howev
er,
the accuracy of predictions using intelligent appro
aches like neural networks are strongly influenced
by the
correct selection of inputs and the number of neuro
-forecasters used for prediction. Deshani, Hansen,
Attygalle, & Karunarathne (2014) suggested that a c
luster analysis could be performed to group similar
day types, which contribute towards selecting a bet
ter set of neuro-forecasters in neural networks. Th
e
cluster analysis was based on the daily total elect
ricity demands as their target was to predict the d
aily
total demands using neural networks. However, predi
cting half-hourly demand seems more appropriate
due to the considerable changes of electricity dema
nd observed during a particular day. As such cluste
rs
are identified considering half-hourly data within
the daily load distribution curves. Thus, this pape
r is an
improvement to Deshani et. al. (2014), which illust
rates how the half hourly demand distribution withi
n a
day, is incorporated when selecting the inputs for
the neuro-forecasters.
An exploratory analysis on half hourly electricity load patterns leading to h...acijjournal
Accurate prediction of electricity demand can bring
extensive benefits to any country as the forecaste
d
values help the relevant authorities to take decisi
ons regarding electricity generation, transmission
and
distribution appropriately. The literature reveals
that, when compared to conventional time series
techniques, the improved artificial intelligent app
roaches provide better prediction accuracies. Howev
er,
the accuracy of predictions using intelligent appro
aches like neural networks are strongly influenced
by the
correct selection of inputs and the number of neuro
-forecasters used for prediction. Deshani, Hansen,
Attygalle, & Karunarathne (2014) suggested that a c
luster analysis could be performed to group similar
day types, which contribute towards selecting a bet
ter set of neuro-forecasters in neural networks. Th
e
cluster analysis was based on the daily total elect
ricity demands as their target was to predict the d
aily
total demands using neural networks. However, predi
cting half-hourly demand seems more appropriate
due to the considerable changes of electricity dema
nd observed during a particular day. As such cluste
rs
are identified considering half-hourly data within
the daily load distribution curves. Thus, this pape
r is an
improvement to Deshani et. al. (2014), which illust
rates how the half hourly demand distribution withi
n a
day, is incorporated when selecting the inputs for
the neuro-forecasters.
This document summarizes a time series analysis of air pollution data from Richmond, Virginia conducted using R. The analysis examined particulate matter (PM2.5 and PM10), lead, carbon monoxide, and ozone over 2010-2013. Correlation between pollutants was low. Univariate time series models like ARIMA were fitted to each pollutant and compared to 2013 data. ARIMA predicted PM2.5 and lead levels accurately but not other pollutants. The analysis aimed to apply methods from a Bulgarian air pollution study to a US city.
Improved grey clustering method in risk zonation of mountain flash flood disa...Agriculture Journal IJOEAR
— Flash floods are considered one of the worst weather-related natural disasters. Flash floods are dangerous because they are sudden and highly unpredictable. Identification of the locations of high-risk areas has a major effect on the improvement of flash flood disaster control and prevention. Earlier work conducted on flood disaster risk zonation was commonly based on Digital Elevation Mode (DEM) data and statistical yearbook data and used an index, such as rainfall, topography, slope, or river distribution, with the analytic hierarchy process (AHP) method to determine the weighting. In this method, the final regional risk map was created by using ArcGIS map algebra superposition. In the present study, an improved gray clustering method is put forward to improve the comprehensive evaluation of the risk of mountain flash flood disasters by constructing the exponential whitening function and by using the information entropy weight method, which produces results that are more accurate and more reliable than those of the traditional method. This improved method can make full use of the limited information available, improving not only the resolution but also the influence of the subjective method, and produces more objective and accurate evaluation results. We obtain the risk degree by combining the information entropy weight and improved whitening function approaches in a gray clustering methodology. Additionally, a method is applied to develop models for mapping the risk grade in zones of 1436 towns and counties in Hubei Province with remotely sensed (RS) data and the ArcGIS platform. The results show that the improved approach is useful for rapidly assessing flash flood hazard and vulnerability and for completing risk assessments in mountain areas.
Austin Statistics is an open access, peer reviewed, scholarly journal dedicated to publish articles in all areas of statistics.
The aim of the journal is to provide a forum for scientists, academicians and researchers to find most recent advances in the field statistics.
Austin Statistics accepts original research articles, review articles, case reports and rapid communication on all the aspects of statistics.
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...IJDKP
ABSTRACT
The objectives of this paper were to 1) develop an empirical method for selecting relevant attributes for modelling drought and 2) select the most relevant attribute for drought modelling and predictions in the Greater Horn of Africa (GHA). Twenty four attributes from different domain areas were used for this experimental analysis. Two attribute selection algorithms were used for the current study: Principal Component Analysis (PCA) and correlation-based attribute selection (CAS). Using the PCA and CAS algorithms, the 24 attributes were ranked by their merit value. Accordingly, 15 attributes were selected for modelling drought in GHA. The average merit values for the selected attributes ranged from 0.5 to 0.9. The methodology developed here helps to avoid the uncertainty of domain experts’ attribute selection
challenges, which are unsystematic and dominated by somewhat arbitrary trial. Future research may evaluate the developed methodology using relevant classification techniques and quantify the actual information gain from the developed approach.
The document discusses applying inverse distance weighting (IDW) to interpolate air quality data from Taiwan. IDW was used to generate particulate matter (PM2.5) maps from station data for different power values. The results showed that higher power values produced more centralized, smoothed maps while lower power values provided more detailed surfaces. Statistics of the interpolated surfaces supported findings that higher power leads to smaller average values and larger standard deviations.
International Journal of Engineering Inventions (IJEI) provides a multidisciplinary passage for researchers, managers, professionals, practitioners and students around the globe to publish high quality, peer-reviewed articles on all theoretical and empirical aspects of Engineering and Science.
Covariance models for geodetic applications of collocation brief versionCarlo Iapige De Gaetani
This document summarizes a new methodology for modeling covariance functions for integrated gravity field modeling using collocation. The methodology uses linear programming and the simplex method to estimate parameters of analytical covariance function models to best fit empirical covariance functions from multiple gravity observations. This allows all available empirical covariances to be considered simultaneously, improving over standard methods that model covariances separately and propagate between observations. The results from testing this new methodology show improvements over existing software packages for modeling covariance functions for local gravity applications.
An innovative idea to discover the trend on multi dimensional spatio-temporal...eSAT Journals
Abstract Spatio-temporal data is any information regarding space and time. It is frequently updated data with 1TB/hr, are greatly challenging our ability to digest the data. Thereupon, it is unable to gain exact information from that data. So this research offers an innovative idea to discover the trend on multi-dimensional spatio-temporal datasets. Here it briefly describes the scope and relevancy of spatio-temporal data. From that, gain the depth knowledge of spatio-temporal recent research process to discover the trend. Keywords: Spatio-temporal Data, Applications of Spatio-temporal data, Problem Definition, Contributions
An innovative idea to discover the trend on multi dimensional spatio-temporal...eSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
The document describes a method for normalizing well logging curves in the Daqingzi oilfield in China. Key steps include:
1. Selecting key wells that are evenly distributed in the area and have complete coring data to analyze. A standard marker bed is also selected.
2. Using a combined correction method including tendency surface analysis, frequency distribution histograms, and mean correction to standardize sonic logging curves. Tendency surface analysis is used initially but outliers are corrected using the other two methods.
3. The combined method is found to provide a 6% better fit than tendency surface analysis alone when fitting times are four. The combined method better addresses outliers caused by logging equipment differences and human errors.
The two main challenges of predicting the wind speed depend on various atmospheric factors and random variables. This paper explores the possibility of developing a wind speed prediction model using different Artificial Neural Networks (ANNs) and Categorical Regression empirical model which could be used to estimate the wind speed in Coimbatore, Tamil Nadu, India using SPSS software. The proposed Neural Network models are tested on real time wind data and enhanced with statistical capabilities. The objective is to predict accurate wind speed and to perform better in terms of minimization of errors using Multi Layer Perception Neural Network (MLPNN), Radial Basis Function Neural Network (RBFNN) and Categorical Regression (CATREG). Results from the paper have shown good agreement between the estimated and measured values of wind speed.
Gravitational search algorithm with chaotic map (gsa cm) for solving optimiza...eSAT Journals
Abstract
Gravitational Search Algorithm (GSA) is a newly heuristic algorithm inspired by nature which utilizes Newtonian gravity law and
mass interactions. It has captured much attention since it has provided higher performance in solving various optimization
problems. This study hybridizes the GSA and chaotic equations. Ten chaotic-based GSA (GSA-CM) methods, which define the
random selections by different chaotic maps, have been developed. The proposed methods have been applied to the minimization
of benchmark problems and the results have been compared. The obtained numeric results show that most of the proposed
algorithms have increased the performance of GSA and have developed its quality of solution.
Keywords: Computational Intelligence, Evolutionary Computation, Heuristic Algorithms, Chaotic Maps, Optimization
Methods.
This document discusses key properties of spatial data including projection, accuracy, scale, and resolution. It defines projection as the conversion of 3D earth coordinates to a 2D map representation using mathematical formulas. Accuracy refers to how closely a map matches real-world values, which can be measured horizontally, vertically, or relatively. Scale compares distances on a map to actual distances on earth. Resolution is the smallest feature that can be recognized on a map.
Analysis of green’s function and surface current density for rectangular micr...eSAT Journals
Abstract In this paper, Green’s function and surface current density for planar structure has been calculated. The approach makes use of the popular and rigorously used spectral domain full wave analysis method in conjunction with method of moment as numerical analysis tool. In present approach, boundary conditions are applied at patch metallization, which leads to integral equation with the involvement of green’s function in spectral domain, which includes the effect of dielectric, conductor loss, surface wave modes and space wave radiation. By applying Galerkin’s moment method integral equation are transformed to linear set of equations. Entire domain basis function is used to improve the efficiency of the solution. Keywords: Spectral domain full wave analysis, Green function, Galerkin’s moment method, Entire domain basis function, Surface current density.
Analysis of green’s function and surface current density for rectangular micr...eSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
This document summarizes a study that used the fuzzy TOPSIS method to select the optimal type of spillway for a dam in northern Greece called Pigi Dam. Five alternative spillway types were evaluated based on nine criteria. The criteria were expressed as triangular fuzzy numbers to account for uncertainty. Weights for the criteria were determined using the AHP method and also expressed linguistically as fuzzy numbers. The fuzzy TOPSIS method was then used to rank the alternatives based on their distances from the ideal and negative-ideal solutions. The alternative with the highest relative closeness to the ideal solution was determined to be the optimal spillway type.
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
This document discusses applying a novel approach using multi-criterion decision analysis (MCDA) with the generalized likelihood uncertainty estimation (GLUE) method to quantify uncertainty in hydrological modeling. Specifically, it examines uncertainty in the SLURP hydrological model. Rather than considering overall Nash-Sutcliffe efficiency, the approach considers NSE values for different flow magnitudes simultaneously. The TOPSIS MCDA method is used to compute predictive intervals by considering NSE values for different flow periods simultaneously. The Kootenay Catchment case study is used to demonstrate the MCDA-GLUE approach.
This document summarizes a research paper that classified multi-date remote sensing images using NDVI values. It discusses how NDVI values were calculated from Terra satellite imagery using red and infrared band values. A similarity measure formula was proposed to classify images based on comparing NDVI values of unknown images to reference images. The formula measured similarity between image windows using sum of absolute differences of NDVI values. Five Terra images from different dates were classified into 20 reference classes using this approach.
Clustering and Classification in Support of Climatology to mine Weather Data ...MangaiK4
Abstract -Knowledge of climate data of region is essential for business, society, agriculture, pollution and energy applications. Climate is not fixed, the fluctuation in the climate can be seen from year to year.Thedata mining application help meteorological scientists to predictaccurate weather forecast and decisions and also provide more performance and reliability than any other methods. The data mining techniques applied on weather data are efficient when compare to the mathematical models used. Various techniques of data mining are applied on climate data to support weather forecasting, climate scientists, agriculture, vegetation, water resources and tourism. The aim of this paper is to provide a review report on various data mining techniques applied on weather data set in support of weather prediction and climate analysis
An exploratory analysis on half hourly electricity load patterns leading to h...ijaia
Accurate prediction of electricity demand can bring
extensive benefits to any country as the forecaste
d
values help the relevant authorities to take decisi
ons regarding electricity generation, transmission
and
distribution appropriately. The literature reveals
that, when compared to conventional time series
techniques, the improved artificial intelligent app
roaches provide better prediction accuracies. Howev
er,
the accuracy of predictions using intelligent appro
aches like neural networks are strongly influenced
by the
correct selection of inputs and the number of neuro
-forecasters used for prediction. Deshani, Hansen,
Attygalle, & Karunarathne (2014) suggested that a c
luster analysis could be performed to group similar
day types, which contribute towards selecting a bet
ter set of neuro-forecasters in neural networks. Th
e
cluster analysis was based on the daily total elect
ricity demands as their target was to predict the d
aily
total demands using neural networks. However, predi
cting half-hourly demand seems more appropriate
due to the considerable changes of electricity dema
nd observed during a particular day. As such cluste
rs
are identified considering half-hourly data within
the daily load distribution curves. Thus, this pape
r is an
improvement to Deshani et. al. (2014), which illust
rates how the half hourly demand distribution withi
n a
day, is incorporated when selecting the inputs for
the neuro-forecasters.
An exploratory analysis on half hourly electricity load patterns leading to h...acijjournal
Accurate prediction of electricity demand can bring
extensive benefits to any country as the forecaste
d
values help the relevant authorities to take decisi
ons regarding electricity generation, transmission
and
distribution appropriately. The literature reveals
that, when compared to conventional time series
techniques, the improved artificial intelligent app
roaches provide better prediction accuracies. Howev
er,
the accuracy of predictions using intelligent appro
aches like neural networks are strongly influenced
by the
correct selection of inputs and the number of neuro
-forecasters used for prediction. Deshani, Hansen,
Attygalle, & Karunarathne (2014) suggested that a c
luster analysis could be performed to group similar
day types, which contribute towards selecting a bet
ter set of neuro-forecasters in neural networks. Th
e
cluster analysis was based on the daily total elect
ricity demands as their target was to predict the d
aily
total demands using neural networks. However, predi
cting half-hourly demand seems more appropriate
due to the considerable changes of electricity dema
nd observed during a particular day. As such cluste
rs
are identified considering half-hourly data within
the daily load distribution curves. Thus, this pape
r is an
improvement to Deshani et. al. (2014), which illust
rates how the half hourly demand distribution withi
n a
day, is incorporated when selecting the inputs for
the neuro-forecasters.
This document summarizes a time series analysis of air pollution data from Richmond, Virginia conducted using R. The analysis examined particulate matter (PM2.5 and PM10), lead, carbon monoxide, and ozone over 2010-2013. Correlation between pollutants was low. Univariate time series models like ARIMA were fitted to each pollutant and compared to 2013 data. ARIMA predicted PM2.5 and lead levels accurately but not other pollutants. The analysis aimed to apply methods from a Bulgarian air pollution study to a US city.
Time Series Data Analysis for Forecasting – A Literature ReviewIJMER
This document summarizes literature on using statistical and data mining techniques for time series forecasting, with a focus on weather prediction. Section 2 discusses various statistical techniques used in literature such as ARIMA models, exponential smoothing models, and spectral analysis methods for time series rainfall and weather forecasting. Section 3 discusses data mining techniques used for time series forecasting, including neural networks and evolutionary computation methods. Several studies applying neural networks to weather prediction are summarized.
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...csandit
Accurate prediction of electricity demand can bring extensive benefits to any country as the
forecast values help the relevant authorities to take decisions regarding electricity generation,
transmission and distribution much appropriately. The literature reveals that, when compared
to conventional time series techniques, the improved artificial intelligent approaches provide
better prediction accuracies. However, the accuracy of predictions using intelligent approaches
like neural networks are strongly influenced by the correct selection of inputs and the number of
neuro-forecasters used for prediction. This research shows how a cluster analysis performed to
group similar day types, could contribute towards selecting a better set of neuro-forecasters in
neural networks. Daily total electricity demands for five years were considered for the analysis
and each date was assigned to one of the thirteen day-types, in a Sri Lankan context. As a
stochastic trend could be seen over the years, prior to performing the k-means clustering, the
trend was removed by taking the first difference of the series. Three different clusters were
found using Silhouette plots, and thus three neuro-forecasters were used for predictions. This
paper illustrates the proposed modified neural network procedure using electricity demand
data.
Spectroscopy or hyperspectral imaging consists in the acquisition, analysis, and extraction of the spectral information measured on a specific region or object using an airborne or satellite device. Hyperspectral imaging has become an active field of research recently. One way of analysing such data is through clustering. However, due to the high dimensionality of the data and the small distance between the different material signatures, clustering such a data is a challenging task.In this paper, we empirically compared five clustering techniques in different hyperspectral data sets. The considered clustering techniques are K-means, K-medoids, fuzzy Cmeans, hierarchical, and density-based spatial clustering of applications with noise. Four data sets are used to achieve this purpose which is Botswana, Kennedy space centre, Pavia, and Pavia University. Beside the accuracy, we adopted four more similarity measures: Rand statistics, Jaccard coefficient, Fowlkes-Mallows index, and Hubert index. According to accuracy, we found that fuzzy C-means clustering is doing better on Botswana and Pavia data sets, K-means and K-medoids are giving better results on Kennedy space centre data set, and for Pavia University the hierarchical clustering is better
International Journal of Engineering and Science Invention (IJESI)inventionjournals
This document discusses multidimensional clustering methods for data mining and their industrial applications. It begins with an introduction to clustering, including definitions and goals. Popular clustering algorithms are described, such as K-means, fuzzy C-means, hierarchical clustering, and mixture of Gaussians. Distance measures and their importance in clustering are covered. The K-means and fuzzy C-means algorithms are explained in detail. Examples are provided to illustrate fuzzy C-means clustering. Finally, applications of clustering algorithms in fields such as marketing, biology, and earth sciences are mentioned.
Forecasting of electric consumption in a semiconductor plant using time serie...Alexander Decker
This document summarizes a study that used time series methods to forecast electricity consumption in a semiconductor plant. The study analyzed 36 months of historical electricity consumption data from 2010-2012 to select the best forecasting model. Single exponential smoothing was found to have the lowest Mean Absolute Percentage Error (MAPE) of 5.60% and was determined to be the best forecasting method. The selected model will be used to forecast future electricity consumption for the plant.
Introduction of Lateral Decision Conveying Approach Based Multi Criteria Asse...ijsrd.com
Out lining of potentiality is regarded as the foundation step towards the conservation and management of any resource. Continuous and substantial extraction of ground water has already minimized the quantity of such a useful life supporting flow resource. Delineation of ground water potentiality (GWP) is essential not only to endure the present civilization from the prevailing shadow of water crisis but also to increase the sustainability of water resource development for the generation to come. In the present study efforts are made to scientifically delineate the GWP of selected study area by multi criteria assessment (MCA) of several hydro-geomorphic data and facts. While determining the individual weight and sub weight for MCA, through the study a new approach has been developed in order to strike a balance between accuracy and intricacy.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
A study of region based segmentation methods for mammogramseSAT Journals
Abstract Breast Cancer is one of the most common diseases that are found in women. The number of women getting affected by cancer is increasing year by year. Detecting cancer in the late stages, leads to very complicated surgeries and the chance of death is very high. Early detection of Breast Cancer helps in less complicated procedures and early recovery. Many tests have been found so as to detect cancer. Some of these tests are mammography; ultrasound etc.Mammography is a method that helps in early detection of Breast cancer. But finding the mass and its spread from a mammographic image is very difficult. Expert radiologists are needed for accurate reading of a mammogram. Researchers have been working for years for algorithms that help for easy detection and segmentation of breast masses. Feature extraction and classification have also been done extensively so that the studied cases can be compared to diagnose the new cases. Segmentation of cancerous mass regions from the breast tissues is a difficult process. Many algorithms have been proposed for this. Some of these algorithms are region growing, watershed segmentation, clustering etc. Region Growing Method is based on two major factors which is the seed point selection and then the stopping criteria. Watershed Segmentation on the other hand is based on the basic geographical concept of watersheds and catchment basins, and uses a technique called as flooding. A study of these two major region based methods such as Region Growing and Watershed Segmentation are compared and detailed in this paper. Keywords: Mammography, Mass Detection, Segmentation, Region growing, and Watershed Segmentation.
The document compares the predictive performance of classification trees and logistic regression models in determining whether individuals are insured or not based on demographic and socioeconomic characteristics. It first describes the data and techniques used, including classification trees, cost complexity pruning, logistic regression, bagging, and random forests. It then describes the research method of using different portions of the data for model training and testing. The results show that logistic regression performs as well as classification trees on nonlinear data and better on linear data. Both methods select income as an important predictor, while logistic regression favors dummy variables and classification trees favor continuous variables. Random forests have the highest predictive accuracy overall.
A hybrid approach for analysis of dynamic changes in spatial dataijdms
Any geographic location undergoes changes over a period of time. These changes can be observed by
naked eye, only if they are huge in number spread over a small area. However, when the changes are small
and spread over a large area, it is very difficult to observe or extract the changes. Presently, there are few
methods available for tackling these types of problems, such as GRID, DBSCAN etc. However, these
existing mechanisms are not adequate for finding an accurate changes or observation which is essential
with respect to most important geometrical changes such as deforestations and land grabbing etc.,. This
paper proposes new mechanism to solve the above problem. In this proposed method, spatial image
changes are compared over a period of time taken by the satellite. Partitioning the satellite image in to
grids, employed in the proposed hybrid method, provides finer details of the image which are responsible
for improving the precision of clustering compared to whole image manipulation, used in DBSCAN, at a
time .The simplicity of DBSCAN explored while processing portioned grid portion.
This document compares hierarchical and non-hierarchical clustering algorithms. It summarizes four clustering algorithms: K-Means, K-Medoids, Farthest First Clustering (hierarchical algorithms), and DBSCAN (non-hierarchical algorithm). It describes the methodology of each algorithm and provides pseudocode. It also describes the datasets used to evaluate the performance of the algorithms and the evaluation metrics. The goal is to compare the performance of the clustering methods on different datasets.
An improvement in k mean clustering algorithm using better time and accuracyijpla
This document summarizes a research paper that proposes an improved K-means clustering algorithm to enhance accuracy and reduce computation time. The standard K-means algorithm randomly selects initial cluster centroids, affecting results. The proposed algorithm systematically determines initial centroids based on data point distances. It assigns data to the closest initial centroid to generate initial clusters. Iteratively, it calculates new centroids and reassigns data only if distances decrease, reducing unnecessary computations. Experiments on various datasets show the proposed algorithm achieves higher accuracy faster than standard K-means.
Computer model simulations are widely used in the investigation of complex hydrological systems. In particular, hydrological models are tools that help both to better understand hydrological processes and to predict extreme events such as floods and droughts. Usually, model parameters need to be estimated through calibration, in order to constrain model outputs to observed variables.
Relevant model parameters used for calibration are usually selected based on expert knowledge of the modeller or by using a local one-at-a-time (OAT) sensitivity analysis (SA). However, in case of complex models those approaches may not result in proper identification of the most sensitive parameters for model calibration. In particular local OAT SA methods are only effective for assessing the relative importance of input factors when the model is linear, monotonic, and additive, which is rarely the case for complex environmental models. In contrast Global Sensitivity Analysis (GSA)
is a formal method for statistical evaluation of relevant parameters that contribute significantly to model performance. GSA techniques explore the entire feasible space of each model parameter, and they do not require any assumptions on the model nature (such as linearity or additivity).
In this work we apply the GSA to LISFLOOD, a fully-distributed hydrological model used for flood forecasting at Pan-European scale within the European Flood Awareness System (EFAS). Two case studies are considered, snowmelt- and evapotranspiration-driven catchments, to identify sensitive parameters for both types of hydrological regimes. Results of the GSA will then be used for selecting parameters that need to be estimated during model calibration. Considering the large
number of parameters of a fully-distributed model, a two-step GSA framework is applied. First, we implement the computationally efficient screening method of Morris. This method requires a limited number of simulations and produces a qualitative ranking and selection of important factors. As a second step, we apply the variance-based method of Sobol, only to the subset of factors determined as important during the previous screening. The method of Sobol provides quantitative estimates for first order and total order sensitivity indexes of input factors.
The calibration results after the GSA will be described for both case studies and compared against those obtained by using only prior expert knowledge
The document summarizes research on medical image segmentation algorithms. It discusses k-means clustering, fuzzy c-means clustering, and proposes enhancements to these algorithms. Specifically, it introduces an enhanced k-means algorithm that improves initial cluster center selection. It also presents a kernelized fuzzy c-means approach that maps data points into a feature space to perform clustering. The algorithms are tested on MRI brain images and evaluated based on segmentation accuracy. The enhanced methods aim to produce more precise segmentations for medical applications such as diagnosis and treatment planning.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
COMPARISON OF HIERARCHICAL AGGLOMERATIVE ALGORITHMS FOR CLUSTERING MEDICAL DO...ijseajournal
Extensive amount of data stored in medical documents require developing methods that help users to find
what they are looking for effectively by organizing large amounts of information into a small number of
meaningful clusters. The produced clusters contain groups of objects which are more similar to each other
than to the members of any other group. Thus, the aim of high-quality document clustering algorithms is to
determine a set of clusters in which the inter-cluster similarity is minimized and intra-cluster similarity is
maximized. The most important feature in many clustering algorithms is treating the clustering problem as
an optimization process, that is, maximizing or minimizing a particular clustering criterion function
defined over the whole clustering solution. The only real difference between agglomerative algorithms is
how they choose which clusters to merge. The main purpose of this paper is to compare different
agglomerative algorithms based on the evaluation of the clusters quality produced by different hierarchical
agglomerative clustering algorithms using different criterion functions for the problem of clustering
medical documents. Our experimental results showed that the agglomerative algorithm that uses I1 as its
criterion function for choosing which clusters to merge produced better clusters quality than the other
criterion functions in term of entropy and purity as external measures.
Similar to Defining Homogenous Climate zones of Bangladesh using Cluster Analysis (20)
Evaluation of Agro-morphological Performances of Hybrid Varieties of Chili Pe...Premier Publishers
In Benin, chilli pepper is a widely consumed as vegetable whose production requires the use of performant varieties. This work assessed, at Parakou and Malanville, the performance of six F1 hybrids of chilli including five imported (Laali, Laser, Nandi, Kranti, Nandita) and one local (De cayenne), in completely randomized block design at four replications and 15 plants per elementary plot. Agro-morphological data were collected and submitted to analysis of variance and factor analysis of mixed data. The results showed the effects of variety, location and their interactions were highly significant for most of the growth, earliness and yield traits. Imported hybrid varieties showed the best performances compared to the local one. Multivariate analysis revealed that 'De cayenne' was earlier, short in size, thin-stemmed, red fruits and less yielding (≈ 1 t.ha-1). The imported hybrids LaaliF1 and KrantiF1 were of strong vegetative vigor, more yielding (> 6 t.ha-1) by developing larger, long and hard fruits. Other hybrids showed intermediate performances. This study highlighted the importance of imported hybrids in improving yield and preservation of chili fruits. However, stability and adaptation analyses to local conditions are necessary for their adoption.
An Empirical Approach for the Variation in Capital Market Price Changes Premier Publishers
The chances of an investor in the stock market depends mainly on some certain decisions in respect to equilibrium prices, which is the condition of a system competing favorably and effectively. This paper considered a stochastic model which was latter transformed to non-linear ordinary differential equation where stock volatility was used as a key parameter. The analytical solution was obtained which determined the equilibrium prices. A theorem was developed and proved to show that the proposed mathematical model follows a normal distribution since it has a symmetric property. Finally, graphical results were presented and the effects of the relevant parameters were discussed.
Influence of Nitrogen and Spacing on Growth and Yield of Chia (Salvia hispani...Premier Publishers
Chia is an emerging cash crop in Kenya and its production is inhibited by lack of agronomic management information. A field experiment was conducted in February-June and May-August 2021, to determine the influence of nitrogen and spacing on growth and yield of Chia. A randomized complete block design with a split plot arrangement was used with four nitrogen rates as the main plots (0, 40, 80, 120 kg N ha-1) and three spacing (30 cm x 15 cm (s1), 30 cm x 30 cm (s2), 50 cm x 50 cm (s3)). Application of 120 kg N ha-1 significantly increased (p≤0.05) vegetative growth and seed yield of Chia. Stem height, branches, stem diameter and leaves increased by 23-28%, 11-13%, 43-55% and 59-88% respectively. Spacing s3 significantly increased (p≤0.05) vegetative growth. An increase of 27-74%, 36-45% and 73-107% was recorded in number of leaves, stem diameter and dry weight, respectively. Chia yield per plant was significantly higher (p≤0.05) in s3. However, when expressed per unit area, s1 significantly produced higher yields. The study recommends 120 kg N ha-1 or higher nitrogen rates and a closer spacing of 15 cm x 30 cm as the best option for Chia production in Kenya.
Enhancing Social Capital During the Pandemic: A Case of the Rural Women in Bu...Premier Publishers
The document discusses a case study of enhancing social capital among rural women in Bukidnon Province, Philippines during the COVID-19 pandemic through a livelihood project. Key findings include:
1) Technical trainings provided by the project increased the women's knowledge, allowing them to generate additional household income through vegetable gardening during the pandemic.
2) The women's social capital, as measured by groups/networks, trust, and cooperation, increased by 15.5% from 2019 to 2020 through increased participation in their association.
3) Main occupations, income sources, and ethnicity influenced the women's social capital. The project enhanced social ties that empowered the rural women economically and socially despite challenges of the pandemic.
Impact of Provision of Litigation Supports through Forensic Investigations on...Premier Publishers
This paper presents an argument through the fraud triangle theory that the provision of litigation supports through forensic audits and investigations in relation to corporate fraud cases is adequate for effective prosecution of perpetrators as well as corporate fraud prevention. To support this argument, this study operationalized provision of litigation supports through forensic audit and investigations, data mining for trends and patterns, and fraud data collection and preparation. A sample of 500 respondents was drawn from the population of professional accountants and legal practitioners in Nigeria. Questionnaire was used as the instrument for data collection and this was mailed to the respective respondents. Resulting responses were analyzed using the OLS multiple regression techniques via the SPSS statistical software. The results reveal that the provision of litigation supports through forensic audits and investigations, fraud data mining for trends and patterns and fraud data collection and preparation for court proceedings have a positive and significant impact on corporate fraud prevention in Nigeria. This study therefore recommends that regulators should promote the provision of litigation supports through forensic audits and investigations in relation to corporate fraud cases in publicly listed firms in Nigeria, as this will help provide reports that are acceptable in court proceedings.
Improving the Efficiency of Ratio Estimators by Calibration WeightingsPremier Publishers
It is observed that the performances of most improved ratio estimators depend on some optimality conditions that need to be satisfied to guarantee better estimator. This paper develops a new approach to ratio estimation that produces a more efficient class of ratio estimators that do not depend on any optimality conditions for optimum performance using calibration weightings. The relative performances of the proposed calibration ratio estimators are compared with a corresponding global [Generalized Regression (GREG)] estimator. Results of analysis showed that the proposed calibration ratio estimators are substantially superior to the traditional GREG-estimator with relatively small bias, mean square error, average length of confidence interval and coverage probability. In general, the proposed calibration ratio estimators are more efficient than all existing estimators considered in the study.
Urban Liveability in the Context of Sustainable Development: A Perspective fr...Premier Publishers
Urbanization and quality of urban life are mutually related and however it varies geographically and regionally. With unprecedented growth of urban centres, challenge against urban development is more in terms of how to enhance quality of urban life and liveability. Making sense of and measuring urban liveability of urban places has become a crucial step in the context of sustainable development paradigm. Geographical regions depict variations in nature of urban development and consequently level of urban liveability. The coastal regain of West Bengal faces unusual challenges caused by increasing urbanization, uncontrolled growth, and expansion of economic activities like tourism and changing environmental quality. The present study offers a perspective on urban liveability of urban places located in coastal region comprising of Purba Medinipur and South 24 Parganas districts. The study uses the liveability standards covering four major pillars- institutional, social, economic and physical and their indicators. This leads to develop a City Liveability Index to rank urban places of the region, higher the index values better the urban liveability. The data for the purpose is collected from various secondary sources. Study finds that the eastern coastal region of the country covering state of West Bengal depicts variations in index of liveability determined by physical, economic, social and institutional indicators.
Transcript Level of Genes Involved in “Rebaudioside A” Biosynthesis Pathway u...Premier Publishers
Stevia rebaudiana Bertoni is a plant which has recently been used widely as a sweetener. This medicinal plant has some components such as diterpenoid glycosides called steviol glycosides [SGs]. Rebaudioside A is a diterpenoid steviol glycoside which is 300 times sweeter than table sugar. This study was done to investigate the effect of GA3 (50 mg/L) on the expression of 14 genes involved in Rebaudioside A biosynthesis pathway in Stevia rebaudiana under in vitro conditions. The expression of DXS remarkably decreased by day 3. Also, probably because of the negative feedback of GA3 on MEP-drived isoprenes, GGDS transcript level reached its lowest amount after GA3 treatment. The abundance of DXR, CMS, CMK, MCS, and CDPS transcripts showed a significant increase at various days after this treatment. A significant drop in the expression levels of KS and UGT85C2 is detected during the first day. However, expression changes of HDR and KD were not remarkable. Results revealed that the level of transcript of UGT74G1 and UGT76G1 up regulated significantly 4 and 2 times higher than control, respectively. However, more research needs to shed more light on the mechanism of GA3 on gene expression of MEP pathway.
Multivariate Analysis of Tea (Camellia sinensis (L.) O. Kuntze) Clones on Mor...Premier Publishers
Information on genetic variability for biochemical characters is a prerequisite for improvement of tea quality. Thirteen introduced tea clones characterized with objective; assessing tea clones based on morphological characters at Melko and Gera research stations. The study was conducted during 2017/18 cropping season on experimental plots in RCBD with three replications. Data recorded on morphological traits like days from pruning to harvest, height to first branch, stem diameter, leaf serration density, leaf length, leaf width, leaf size, petiole length, leaf ratio, internode length, shoot length, number of shoot, canopy diameter, hundred shoot weight, fresh leaf yield per tree. Cluster analysis of morphological trait grouped into four clusters indicated, the existence of divergence among the tested clones. The maximum inter-cluster distance was between clusters I and IV (35.27) while the minimum inter cluster distance was observed between clusters I and II (7.8).Principal components analysis showed that the first five principal components with eigenvalues greater than one accounted 86.45% for 15 morphological traits. Generally, the study indicated presence of variability for several morphological traits. However, high morphological variation between clones is not a guarantee for a high genetic variation; therefore, molecular studies need to be considered as complementary to biochemical studies.
Causes, Consequences and Remedies of Juvenile Delinquency in the Context of S...Premier Publishers
This research work was designed to examine nature of juvenile offences committed by juveniles, causes of juvenile delinquency, consequences of juvenile delinquency and remedies for juvenile delinquency in the context of Sub-Saharan Africa with specific reference to Eritrea. Left unchecked, juvenile delinquents on the streets engage in petty theft, take alcohol or drugs, rape women, rob people at night involve themselves in criminal gangs and threaten the public at night. To shed light on the problem of juvenile delinquency in the Sub-Saharan region data was collected through primary and secondary sources. A sample size of 70 juvenile delinquents was selected from among 112 juvenile delinquents in remand at the Asmara Juvenile Rehabilitation Center in the Eritrean capital. The study was carried out through coded self-administered questionnaires administered to a sample of 70 juvenile delinquents. The survey evidence indicates that the majority of the juvenile respondents come either from families constructed by unmarried couples or separated or divorced parents where largely the father is missing in the home or dead. The findings also indicate that children born out of wedlock, families led by single mothers, lack of fatherly role models, poor parental-child relationships and negative peer group influence as dominant causes of juvenile infractions. The implication is that broken and stressed families are highly likely to be the breeding grounds for juvenile delinquency. The survey evidence indicates that stealing, truancy or absenteeism from school, rowdy or unruly behavior at school, free-riding in public transportation, damaging the book of fellow students and beating other young persons are the most common forms of juvenile offenses. It is therefore, recommended that parents and guardians should exercise proper parental supervision and give adequate care to transmit positive societal values to children. In addition, the government, the police, prosecution and courts, non-government organizations, parents, teachers, religious leaders, education administrators and other stakeholders should develop a child justice system that strives to prevent children from entering deeper into the criminal justice process.
The Knowledge of and Attitude to and Beliefs about Causes and Treatments of M...Premier Publishers
Stigma and discrimination associated with mental illness are a common occurrence in the Sub-Saharan region including Eritrea. Numerous studies from Sub-Saharan Africa suggest that stigma and discrimination are major problems in the community, with negative attitudes and behavior towards people with mental illness being widespread. In order to assess the whether such negative attitudes persist in the context of Eritrea this study explored the knowledge and perceptions of 90 Eritrean university students at the College of Business and Economics, the University of Asmara regarding the causes and remedies of mental illness A qualitative method involving coded self-administered questionnaires administered to a sample of 90 university students to collecting data at the end of 2019. The survey evidence points that almost 50% of the respondents had contact with a mentally ill person suggesting that the significant number of the respondents experienced a first-hand encounter and knowledge of mental illness in their family and community. The findings show an overall greater science-based understanding of the causes of mental illness to be followed by recommended psychiatric treatments. The survey evidence indicates that the top three leading causes of mental illness in the context of Eritrea according to the respondents are brain disease (76%), bad events in the life of the mentally ill person (66%) and substance abuse or alcohol taking, smoking, taking drugs like hashish. (54%). The majority of the respondents have a very sympathetic and positive outlook towards mentally ill persons suggesting that mentally illness does not simply affect a chosen individual rather it can happen to anybody regardless of economic class, social status, ethnicity race and religion. Medical interventions cited by the majority of the respondents as being effective treatments for mental illness centered on the idea that hospitals and clinics for treatment and even cures for psychiatric disease. Changing perceptions of mental illnesses in Eritrea that paralleled the very caring and sympathetic attitudes of the sample university students would require raising public awareness regarding mental illness through education, using the mass media to raise public awareness, integrating mental health into the primary health care system, decentralizing mental health care services to increase access to treatment and providing affordable service to maintain positive treatment outcomes.
Effect of Phosphorus and Zinc on the Growth, Nodulation and Yield of Soybean ...Premier Publishers
This study investigated the effects of phosphorus and zinc on the growth, nodulation, and yield of two soybean varieties in Nigeria. Phosphorus application significantly affected growth, nodulation, yield, and some yield components, with 60 kg P2O5/ha giving the highest growth and yield. Phosphorus also increased nodulation, with 30 kg P2O5/ha providing the highest nodulation. Zinc application did not significantly affect most growth characters or nodulation, except for reducing plant height. Phosphorus increased soybean yield significantly to 1.9 t/ha compared to the control of 1.7 t/ha. Protein and oil contents were not significantly affected by phosphorus but were by zinc
Influence of Harvest Stage on Yield and Yield Components of Orange Fleshed Sw...Premier Publishers
A field experiment was conducted at Adami Tullu Agricultural Research Center in 2018 under rainfed condition with supplementary irrigation to determine the influence of harvest stage on vine yield and tuberous root yield of orange fleshed sweet potato varieties. The experiment consisted of four harvest stages (105, 120, 135 and 150 days after planting) and Kulfo, Tulla and Guntute varieties. A 4 X 3 factorial experiment arranged in randomized complete block design with three replications was used. Interaction of harvest stage and variety significantly influenced above ground fresh biomass, vine length, marketable tuberous root weight per hectare, commercial harvest index and harvest index. The highest mean values of above ground fresh biomass (66.12 t/ha) and marketable tuberous root weight (56.39 t/ha) were produced by Guntute variety harvested at 135 days after planting. Based on the results, it can be recommended that, farmers of the study area can grow Guntute variety by harvesting at 135 days after planting to obtain optimum vine and tuberous root yields.
Performance evaluation of upland rice (Oryza sativa L.) and variability study...Premier Publishers
This study evaluated 13 upland rice varieties over two locations in Ethiopia for yield and other traits. Significant differences were found among varieties for several traits. The highest yielding varieties were Chewaka, Hiddassie, and Fogera 1. Chewaka yielded 5395.8 kg/ha on average, 25.8-35% more than the check. Most varieties matured within 120-130 days. High heritability was found for days to heading, panicle length, and grain yield, indicating these traits can be easily improved through selection. Grain yield also had high genetic variation and heritability with genetic advance, suggesting yield can be improved through selection. This study identified variability that can be used
Response of Hot Pepper (Capsicum Annuum L.) to Deficit Irrigation in Bennatse...Premier Publishers
This study was conducted at Enchete kebele in Benna-Tsemay Woreda, South Omo Zone to evaluate the response of hot pepper to deficit irrigation on yield and water productivity under furrow irrigation system. The experiment comprised four treatments (100 % of ETc, 85% of ETc, 70 % of ETc and 50% of ETc), respectively. The experiment was laid out in RCBD and replicated four times. The two years combined yield results indicated that, the maximum total yield (20.38 t/ha) was obtained from 100% ETc while minimum yield (12.92 t/ha) was obtained from 50% of ETc deficit irrigation level. The highest WUE 5.22 kg/ha mm-1 was obtained from 50% of ETc. Treatment of 100% ETc irrigation application had highest benefit cost ratio (4.5) than all others treatments. Applying 50% of ETc reduce the yield by 37% when compared to 100 % ETc. Accordingly, to achieve maximum hot pepper yield in areas where water is not scarce, applying 100% ETc irrigation water application level throughout whole growing season under furrow irrigation system is recommended. But, in the study area water scarcity is the major limiting factor for crop production. So, it is possible to get better yield and water productivity of hot pepper when we apply 85% ETc irrigation water throughout growing season under furrow irrigation system.
Harnessing the Power of Agricultural Waste: A Study of Sabo Market, Ikorodu, ...Premier Publishers
Nigeria is still burdened with huge responsibilities of waste disposal because the potential for benefits of proper waste management is yet to be harnessed. The paper evaluates the capacity of the Sabo Cattle market in producing the required quantities of waste from animal dung alongside decomposed fruits with a view to generating renewable energy possibilities for lighting, security and other business activities of the market. It is estimated that about 998 million tons of agricultural waste is produced yearly in the country with organic wastes amounting to 80 percent of the total solid wastes. This can be categorized into biodegradable and non-biodegradable wastes. The paper evaluates the capacity of the Sabo Cattle market in producing the required quantities of waste from animal dung alongside decomposed fruits with a view to generating renewable energy possibilities for lighting, security and other business activities of the market. The Sabo market was treated as a study case with the adoption of in-depth examinations of the facility, animals and products for sale and waste generated. A combination of experimental, interviews (qualitative) and design simulation (for final phase) was adopted to extract, verify and analyse the data generated from the study. Animal waste samples were subjected to compositional and fibre analysis with results showing that the sample has high potency for biogas production. Biodegradable Wastes are human and animal excreta, agricultural and all degradable wastes. Availability of high quantity of waste generated being organic in Sabo market allows the use of anaerobic digestion to be proposed as a waste to energy technology due to its feasibility for conversion of moist biodegradable wastes into biogas. The study found that at peak supply period during the Islamic festivities, a conservative 300tonnes of animal waste is generated during the week which translates to over 800kilowatts of electricity.
Influence of Conferences and Job Rotation on Job Productivity of Library Staf...Premier Publishers
The general purpose of this study is to investigate the influence of conferences and job rotation on job productivity of library staff in tertiary institutions in Imo State, Nigeria. The survey research design was used for this study using questionnaire as an instrument for data collection. This study covered the entire population of 661. Out of these, 501 copies of the questionnaire representing 75.8% were duly completed and returned for analysis. Student’s t-test was used to analyze the research questions. The finding showed that conferences had no significant influence on the job productivity of library staff in tertiary institutions in Imo State, Nigeria (F cal= 7.86; t-vale =6.177; p >0.005). Finding also showed that job rotation significantly influences job productivity of library staff in tertiary institutions in Imo State, Nigeria (F-cal value= 18.65; t-value = 16.225; P<0.05). This study recommended that, government should ensure that library staff participate in conferences with themes and topics that are relevant to the job they perform and also ensure that there should be proper evaluation and feedback mechanism which aimed to ensuring control and minimize abuse of their development opportunities. Again, there should be written statement of objectives in order to sustain job rotation programmes. Also, that training and development needs of library staff must be identified and analyzed before embarking on job rotation processes as this would help to build skills, competences, specialization and high job productivity.
Scanning Electron Microscopic Structure and Composition of Urinary Calculi of...Premier Publishers
This document summarizes a study on the scanning electron microscopic structure and chemical composition of urinary calculi (stones) found in geriatric dogs. Microscopic examination of urine samples revealed increased numbers of blood cells, epithelial cells, pus cells, casts, bacteria and crystals of various shapes, predominantly struvite, calcium oxalate dihydrate and monohydrate, and ammonium urate. Scanning electron microscopy showed perpendicular columnar strata of struvite crystals and wavy phases of uric acid. Chemical analysis identified calcium phosphate, calcium oxalate and urea stones. The study characterized the microscopic and electron microscopic appearance of crystals and chemical composition of urinary calculi in geriatric dogs.
Gentrification and its Effects on Minority Communities – A Comparative Case S...Premier Publishers
This paper does a comparative analysis of four global cities and their minority districts which have been experiencing the same structural pressure of gentrification. The main contribution of this paper is providing a detailed comparison of four micro geographies worldwide and the impacts of gentrification on them: Barrio Logan in San Diego, Bo-Kaap in Cape Town, the Mission District in San Francisco, and the Rudolfsheim-Fünfhaus District in Vienna. All four cities have been experiencing the displacement of minority communities due to increases in property values. These cities were chosen because their governments enacted different policies to temper the gentrification process. It was found that cities which implemented social housing and cultural inclusionary policies were more successful in maintaining the cultural and demographic make-up of the districts.
Oil and Fatty Acid Composition Analysis of Ethiopian Mustard (Brasicacarinata...Premier Publishers
The experiments was conducted at Holetta Agricultural Research Center, to analyze forty nine Ethiopian Mustard land races for oil and fatty acid composition traits The experiment was carried out in a simple lattice design. The analysis of variance showed that there were highly significant differences among genotypes for all oil and fatty acid traits compared. The significant difference indicates the existence of genetic variability among the land races which is important for improvement
বাংলাদেশের অর্থনৈতিক সমীক্ষা ২০২৪ [Bangladesh Economic Review 2024 Bangla.pdf] কম্পিউটার , ট্যাব ও স্মার্ট ফোন ভার্সন সহ সম্পূর্ণ বাংলা ই-বুক বা pdf বই " সুচিপত্র ...বুকমার্ক মেনু 🔖 ও হাইপার লিংক মেনু 📝👆 যুক্ত ..
আমাদের সবার জন্য খুব খুব গুরুত্বপূর্ণ একটি বই ..বিসিএস, ব্যাংক, ইউনিভার্সিটি ভর্তি ও যে কোন প্রতিযোগিতা মূলক পরীক্ষার জন্য এর খুব ইম্পরট্যান্ট একটি বিষয় ...তাছাড়া বাংলাদেশের সাম্প্রতিক যে কোন ডাটা বা তথ্য এই বইতে পাবেন ...
তাই একজন নাগরিক হিসাবে এই তথ্য গুলো আপনার জানা প্রয়োজন ...।
বিসিএস ও ব্যাংক এর লিখিত পরীক্ষা ...+এছাড়া মাধ্যমিক ও উচ্চমাধ্যমিকের স্টুডেন্টদের জন্য অনেক কাজে আসবে ...
This presentation includes basic of PCOS their pathology and treatment and also Ayurveda correlation of PCOS and Ayurvedic line of treatment mentioned in classics.
How to Make a Field Mandatory in Odoo 17Celine George
In Odoo, making a field required can be done through both Python code and XML views. When you set the required attribute to True in Python code, it makes the field required across all views where it's used. Conversely, when you set the required attribute in XML views, it makes the field required only in the context of that particular view.
How to Fix the Import Error in the Odoo 17Celine George
An import error occurs when a program fails to import a module or library, disrupting its execution. In languages like Python, this issue arises when the specified module cannot be found or accessed, hindering the program's functionality. Resolving import errors is crucial for maintaining smooth software operation and uninterrupted development processes.
Main Java[All of the Base Concepts}.docxadhitya5119
This is part 1 of my Java Learning Journey. This Contains Custom methods, classes, constructors, packages, multithreading , try- catch block, finally block and more.
How to Manage Your Lost Opportunities in Odoo 17 CRMCeline George
Odoo 17 CRM allows us to track why we lose sales opportunities with "Lost Reasons." This helps analyze our sales process and identify areas for improvement. Here's how to configure lost reasons in Odoo 17 CRM
Walmart Business+ and Spark Good for Nonprofits.pdfTechSoup
"Learn about all the ways Walmart supports nonprofit organizations.
You will hear from Liz Willett, the Head of Nonprofits, and hear about what Walmart is doing to help nonprofits, including Walmart Business and Spark Good. Walmart Business+ is a new offer for nonprofits that offers discounts and also streamlines nonprofits order and expense tracking, saving time and money.
The webinar may also give some examples on how nonprofits can best leverage Walmart Business+.
The event will cover the following::
Walmart Business + (https://business.walmart.com/plus) is a new shopping experience for nonprofits, schools, and local business customers that connects an exclusive online shopping experience to stores. Benefits include free delivery and shipping, a 'Spend Analytics” feature, special discounts, deals and tax-exempt shopping.
Special TechSoup offer for a free 180 days membership, and up to $150 in discounts on eligible orders.
Spark Good (walmart.com/sparkgood) is a charitable platform that enables nonprofits to receive donations directly from customers and associates.
Answers about how you can do more with Walmart!"
2. Defining Homogenous Climate zones of Bangladesh using Cluster Analysis
Doulah and Islam. 120
Climate Data
The investigation has been carried out using daily records
of one important climatic variable, rainfall, observed at 34
ground based stations of Bangladesh Meteorological
Department (BMD) distributed over the country during the
time period 1991-2013 (http://www.data.gov.bd/).
Although Bangladesh Meteorological Department (BMD)
has thirty-six (36) ground based stations, but only data of
thirty-four (34) stations has been taken in this research. At
initial stage, quality of rainfall data is checked by verifying
the following criteria (Erin, 1984; Masoodian, 2005)
(i) Non-existence of dates
(ii) Negative monthly rainfall
(iii) Monthly winter rainfall>100mm
(iv) Weather stations > 35% missing data
(v) Stations with gaps three or more years in between
series
If any of the above mentioned point is true for any dataset,
it is identified as erroneous data. So, two BMD stations are
discarded after following the preceding conditions
considering data period from 1991 to 2013. R-based
program is used to detect homogenous climate zones.
METHODOLOGY
For clustering purposes there are two widely used
methods: the hierarchical and the non-hierarchical
(partitional). The hierarchical clustering process can be
categorized as divisive when a large data set is divided
into several small groups and, agglomerative when a small
data set are put together to create a larger cluster (Dyeret,
1975; Gan et al. 2007; Sarah et al. 2011). There are so
many descriptive statistics available in the literature
(Doulah, 2018) for evaluating the data that we have
applied the most frequently used measures in our analysis
first and then we have used the clustering techniques.
Agglomerative Algorithms
Some of the agglomerative algorithms are: single linkage,
complete linkage, average linkage, centroid and Ward’s
method. Several proximity measures like Euclidean
distance, Minkowski distance, Manhattan distance,
maximum distance, correlation based distance and
Canberra distance are used. The partitioned clustering
process is based on recover the natural grouping present
in the data thought a single partition. The partitioned
algorithms are divided as: K-means, Fuzzy and model
based clustering techniques (Hossen et al. 2015; Han &
Kamber, 2006; Johnson & Wichern, 1998).
Table 1: Some of the agglomerative algorithms
Methods Statistic Explanation
Single
Linkage 12
,
min ( , )i j
i j
D d x y This is the distance between the closest members of the two clusters.
Complete
Linkage 12
,
max ( , )i j
i j
D d x y This is the distance between the members that are farthest apart (most
dissimilar)
Average
Linkage 12
1 1
1
( , )
k l
i j
i j
D d x y
kl
This method involves looking at the distances between all pairs and
averages all of these distances. This is also called UPGMA-Un-
Weighted Pair Group Mean Averaging.
Centroid
Method 12 ( , )D d x y This involves finding the mean vector location for each of the clusters
and taking the distance between the two centroids.
Ward
Method
12
2. .
.
k l
D x y
k l
This method minimizes the total within-cluster variance. Those clusters
are combined whose merger results in minimum information loss (ESS
criterion)
Distance Measures
The distances are normally used to measure the similarity
or dissimilarity between two data objects. Though there
are various distance measure available in the literature
(Hossen & Doulah, 2016; Meila, 2007; Yashwantl &
Sananse, 2015), commonly used six distance measures
are considered here. A simple description of distance
measures are given below:
Non-hierarchical Algorithms
K-means clustering
K-means clustering intends to partition n objects into k
clusters in which each object belongs to the cluster with
the nearest mean. K-Means is relatively an efficient
method (Gong & Richman, 1995; Nathan & McMahon,
1990). However, we need to specify the number of
clusters, in advance and the final results are sensitive to
initialization and often terminates at a local optimum.
3. Defining Homogenous Climate zones of Bangladesh using Cluster Analysis
Int. J. Stat. Math. 121
Table 2: Some of the distance measures
Distance Statistic
Euclidean 2
( , ) ( )i id x y x y
Manhattan
1
( , )
p
i i
i
d x y x y
Minkowski 1/
1
( , )
mmp
i i
i
d x y x y
Maximum ( , ) max i id x y x y
Correlation
1
2 2
1 1
( )( )
( , ) 1
( ) ( )
p
i i
i
cor p n
i i
i i
x x y y
d x y
x x y y
Canberra
1
( , )
p
i i
i i i
x y
d x y
x y
Algorithm
1. Clusters the data into k groups where k is predefined.
2. Select k points at random as cluster centers.
3. Assign objects to their closest cluster center according
to the Euclidean distance function.
4. Calculate the centroid or mean of all objects in each
cluster.
5. Repeat steps 2, 3 and 4 until the same points are
assigned to each cluster in consecutive rounds.
Fuzzy clustering
The Fuzzy clustering is a clustering algorithm developed
by Dunn, and later on improved by Bezdek (Luxburg,
2010). It is useful when the required numbers of clusters
are pre-determined; thus, the algorithm tries to put each of
the data points to one of the clusters. What makes FCM
different is that it does not decide the absolute membership
of a data point to a given cluster; instead, it calculates the
likelihood (the degree of membership) that a data point will
belong to that cluster. Hence, depending on the accuracy
of the clustering that is required in practice, appropriate
tolerance measures can be put in place. Since the
absolute membership is not calculated, FCM can be
extremely fast because the number of iterations required
to achieve a specific clustering exercise corresponds to
the required accuracy.
Model-Based clustering
The model-based clustering framework consists of three
major steps (Baldwin & Lakshmivarahan, 2002; Everitt,
1993):
(a) Initialize the EM algorithm using the partitions from
model-based agglomerative hierarchical clustering.
(b) Estimate the parameters using the EM algorithm;
(c) Choose the model and the number of clusters
according to the BIC.
In this method, a model is hypothesized for each cluster to
find the best fit of data for a given model. Also, this method
locates the clusters by clustering the density function.
Thus, it reflects the spatial distribution of the data points.
This method also provides a way to determine the number
of clusters. That was based on standard statistics, taking
outlier or noise into account. It, therefore, yields robust
clustering methods.
Validity Indices
In the literature of data clustering, a lot of clustering
algorithms have been proposed for different applications
and different sizes of data. But clustering a dataset is an
unsupervised process; there are no predefined classes
and no examples that can show that the clusters found by
the clustering algorithms are valid (Hardy, 1996; Luxburg,
2010). To compare the clustering results of difference
clustering algorithms, it is necessary to develop some
validity criteria. Also, if the number of clusters is not given
in the clustering algorithms, it is a highly nontrivial task to
find the optimal number of clusters in the data set. To do
this, we need some cluster validity methods. The notation
& meaning of the validity indices are: n = number of
observations, p= number of variables, q= number of
clusters, X = ijx , 1,2,......,i n ; 1,2,.....,j p ; =
n p data matrix of p variables measured on n
independent observations, x = centroid of data matrix X
, kn = number of objects in cluster kC ,
ix = p -dimensional vector of observations of the
th
i object
in cluster kC ,
qW =
1
( )( )
k
q
T
i k i k
k i c
x c x c
is the within-group
dispersion matrix for data clustered into q clusters,
qB =
1
( )( )
p
T
k k k
k
n c x c x
is the between-group
dispersion matrix for data clustered into q clusters,
T =Total Sum of Squares,
2
S =
1
1 p
j
j j
BGSS
p TSS
,
jBGSS =
2
1
( )
p
jk kj
k
n c x
and
jTSS =
2
1
( )
p
jij
i
x x
. The following cluster validity
methods are given in Table 3 below:
4. Defining Homogenous Climate zones of Bangladesh using Cluster Analysis
Doulah and Islam. 122
Table 3: Some of the validity indices
Validity Index Statistic Criteria for selection
Krzanowski and Lai ( )
1
q
q
DIFF
KL q
DIFF
Maximum value of the index
Calinski and Harabasz
( ) / ( 1)
( )
( ) / ( )
q
q
trace B q
CH q
trace W n q
Maximum value of the index
Scott and Symons
det( )
cot log
det( )q
T
S t n
W
Maximum difference Between hierarchy levels of the index
Marriot
2
det( )qMarriot q W Max. value of second differences between levels of the index
TrCovW cov (cov( ))qTr W trace W Maximum difference between hierarchy levels of the index
TraceW ( )qTraceW trace W Maximum value of absolute second differences between
levels of the index
Friedman and Rubin
1
( )q qFriedman trace W B
Maximum difference between hierarchy levels of the index
Rubin
det( )
det( )q
T
Rubin
W
Max. value of second differences between levels of the index
Ratkowsky 1/2
S
Ratkowsky
q
Maximum value of the index
To settle the cluster number is a difficult task since there is
not a specific method for this purpose and the number is
the result of the assignation of training clusters until the
optimal value is found. Some of the indexes to be used for
establishing the number of clusters can also be employed
to validate cluster quality.
RESULTS AND DISCUSSIONS
The statistical analysis for the monthly rainfall data of 34
meteorological stations are summarized in Table 4, where
the mean, standard deviation (SD), coefficient of variation
(CV), skewness (S) and kurtosis (K) are given.
Chuadanga, Rajshahi and Ishurdi stations were less
monthly rainfall affected station.
Table 4: Descriptive statistics of selected meteorological stations
S/No Stations Mean Standard Deviation (SD) Coefficient of Variation (CV)Skewness (S) Kurtosis (K)
1 Barisal 170.14130 182.63744 107.3445632 1.231543 1.694753
2 Bhola 180.69565 196.6325455 108.8197437 1.174785 1.173446
3 Bogra 141.34782 160.1997005 113.3372227 1.254129 1.168203
4 Chandpur 165.47463 175.7854562 106.2310567 1.238886 1.718038
5 Chittagong 245.56521 289.6960779 117.9711365 1.42098 1.660857
6 chuadanga 125.63768 144.921468 115.3487287 1.599033 3.050739
7 Comilla 172.57608 181.6644199 105.2662759 1.246822 1.446507
8 Cox's Bazar 315.66666 366.0148796 115.9498035 1.103031 0.226135
9 Dhaka 167.45652 175.4360228 104.7651181 1.126478 0.823015
10 Dinajpur 163.86231 196.6004048 119.9790203 1.25614 1.160553
11 Faridpur 143.28623 148.4809432 103.6254086 1.104658 0.958157
12 Feni 240.85869 258.866027 107.4763053 0.956138 -0.03495
13 Hatiya 260.87681 300.1593141 115.0578744 1.076978 0.453877
14 Ishurdi 120.08333 132.3241074 110.1935662 1.314394 1.715892
15 Jessore 139.82608 152.5011076 109.0648469 1.352668 2.405591
16 Khepupara 238.42029 258.0383992 108.2283724 0.921879 -0.10036
17 Khulna 148.85144 156.7208394 105.2867407 1.140018 1.149058
18 Kutubdia 260.01087 319.3049666 122.8044686 1.661191 3.375483
19 M.court 248.76449 265.0863303 106.5611605 0.928332 -0.07512
20 Madaripur 157.97826 166.8337209 105.6054928 1.15376 1.174079
5. Defining Homogenous Climate zones of Bangladesh using Cluster Analysis
Int. J. Stat. Math. 123
Table 4 Continue: Descriptive statistics of selected meteorological stations
S/No Stations Mean Standard Deviation (SD) Coefficient of Variation (CV)Skewness (S) Kurtosis (K)
21 Mongla 160.46739 169.7316889 105.773321 1.075138 1.189396
22 Mymensingh 182.43840 195.008102 106.8898301 1.055405 0.518139
23 Patuakhali 214.17391 241.5987481 112.8049372 1.15553 0.703598
24 Rajshahi 116.77898 134.3791315 115.0713297 1.416645 2.283301
25 Rangamati 213.88043 234.8292769 109.794651 1.337556 1.671646
26 Rangpur 183.99275 203.9673948 110.8562108 1.044524 0.442857
27 Sandwip 301.01087 383.7351879 127.4821698 2.278171 9.364013
28 Satkhira 145.73550 147.8694718 101.4642722 0.877968 -0.13951
29 Sitakunda 260.95289 291.0339461 111.5273859 1.218129 0.953242
30 Srimangal 193.93115 192.6867235 99.3583105 1.050778 0.752041
31 sydpur 178.93478 215.9816055 120.7040925 1.271583 0.923776
32 Sylhet 323.86594 324.7227743 100.2645639 0.85881 -0.05229
33 Tangail 151.64855 157.5703353 103.9049398 1.076241 0.772751
34 Teknaf 367.02173 454.7473299 123.9020149 1.213534 0.588181
Hierarchical Clustering methods
Now we mentioned below the dendrogram of several linkage methods based on different distance measures for the
monthly rainfall data of 34 meteorological stations.
Single Linkage
Euclidean Distance Minkowski Distance Manhattan Distance
Correlation method Maximum Canbera
Figure 1: Dendrogram of Single linkage for selected rainfall station for different distance measure
6. Defining Homogenous Climate zones of Bangladesh using Cluster Analysis
Doulah and Islam. 124
Complete Linkage
Euclidean Distance Minkowski Distance Manhattan Distance
Correlation method Maximum Canbera
Figure 2: Dendrogram of Complete linkage for selected rainfall station for different distance measure
Average Linkage
Euclidean Distance Minkowski Distance Manhattan Distance
Correlation method Maximum Canbera
Figure 3: Dendrogram of Average linkage for selected rainfall station for different distance measure
7. Defining Homogenous Climate zones of Bangladesh using Cluster Analysis
Int. J. Stat. Math. 125
Ward.D
Euclidean Distance Minkowski Distance Manhattan Distance
Correlation method Maximum Canbera
Figure 4: Dendrogram of Ward linkage for selected rainfall station for different distance measure
Centroid
Euclidean Distance Minkowski Distance Manhattan Distance
Correlation method Maximum Canbera
Figure 5: Dendrogram of Centroid linkage for selected rainfall station for different distance measure
To sum up the afore-depicted dendrogram from Figure 1-
5 of all agglomerative hierarchical clustering (single
linkage, complete linkage, average linkage, ward and
centroid) based on the proximity measures (Euclidean
distance, Minkowski distance, Manhattan distance,
correlation method, maximum distance and Canberra
distance) have identified homogeneous climate zones in
Bangladesh. Here, we have got the patent homogeneous
climate zones in Bangladesh based on Ward method with
proximity measures. Therefore, we conclude that Ward
method is the best in this perspective.
8. Defining Homogenous Climate zones of Bangladesh using Cluster Analysis
Doulah and Islam. 126
The seven homogeneous climate zones in Bangladesh are
shown below:
Cluster 1: Rangpur, Sydpur, Dinajpur
Cluster 2: Satkhira, Khulna, Mongla, Ishurdi, chuadanga,
Rajshahi, Jessore
Cluster 3: Barisal, Bhola, Chandpur, Madaripur,
Srimangal, Comilla, Dhaka, Faridpur,
Mymensingh, Bogra, Tangail,
Cluster 4: Sandwip, Cox’s bazar, Teknaf
Cluster 5: Sylhet
Cluster 6: Hatiya, Khepupara, Patuakhali, Feni, M.court
Cluster 7: Kutubdia, Rangamati, Chittagong, Sitakunda
Nonhierarchical Clustering methods
The results of Nonhierarchical methods for the monthly
rainfall data of 34 meteorological stations are shown
below:
Figure 6: K-means clustering
Figure 7: Fuzzy clustering
9. Defining Homogenous Climate zones of Bangladesh using Cluster Analysis
Int. J. Stat. Math. 127
Figure 8: Model based clustering
Reviewing the weather stations in the seven clusters, it is
apparent that from Figure 6-8, k-means, Fuzzy and Model
based clustering methods gave results generally
consistent with the linkage hierarchical methods. Weather
stations with common or compatible geographical
locations cluster. They also depicted the seven
homogeneous climate zones in Bangladesh.
Validity Indices
Many clustering algorithms have been designed, and thus
it is important to decide how to choose a good clustering
algorithm for a given data set and how to evaluate a
clustering method. In these circumstances, one of the
techniques, validity indices may help to check the perfect
selection of cluster size. Validity Indices can be used for
defining the number of clusters for 34 meteorological
stations of rainfall. The following validity indices results are
shown in the given below:
Table 5: Different validity indices values for selecting the number of cluster
No of Cluster
Index Name 2 3 4 5 6 7 8 9 10
krzanowski and Lai 2.53 1.40 1.5065 0.9954 0.0126 213.7057 0.8814 0.9856 1.1411
Calinski and Harabasz 23.15 12.68 8.9764 7.1303 5.8794 77.3498 67.5873 60.2785 54.5556
Scott and Symons 181.4 312.26 466.4679 726.1218 896.2206 1367.598 1520.601 1645.157 1755.115
Marriot 0 -4.6E+54 -6.7E+54 4.2E+54 -3.7E+54 5.5E+54 2.0E+53 8.7E+52 0.0E+00
TrCovW 0 777.564 365.859 376.821 170.214 789.408 143.292 502.706 155.105
TraceW 0 0 18.684 0.504 20.313 4973.39 4951.4 0.6 2.543
Friedman and Rubin 0 3.0336 2.4694 3.1761 2.2267 218.1186 5.7366 4.9847 4.2741
Rubin 0 0 -0.0023 0.0001 -0.0025 1.6127 -1.5764 0.0008 -0.001
Ratkowsky 0.19 0.1622 0.1447 0.1328 0.1233 0.3005 0.2823 0.2673 0.2545
10. Defining Homogenous Climate zones of Bangladesh using Cluster Analysis
Doulah and Islam. 128
Figure 9: Bar plot of different validity indices values
Here we checked the validity of the cluster of climate
variable, rainfall, by using well-recognized nine validity
indices. In this paper from Table 5 & Figure 9 we found that
there are seven clusters in our dataset. Therefore, it is to
be concluded that there are seven homogenous climate
zones in Bangladesh.
CONCLUSION
Cluster Analysis is an unsupervised machine learning
method. It offers a way to partition a dataset into subsets
that share common patterns. Notably, there are many
cluster analysis algorithms to choose from, each making
certain assumptions about the data and about how cluster
should be formed. In this study, we applied 5-
agglomerative hierarchical clustering technique based on
6-proximity measure and other popular 3-clustering
technique, 9- cluster validity index. Although many of the
previous studies did not use objective validation methods
that are well-justified or did not use validation methods at
all, previous studies on the subtyping of weather stations,
all employed a single clustering method. Here, formal
methods of cluster validation examine how well a cluster
fits a dataset. The goal of this study is to identify the similar
weather station from a group of weather stations with
rainfall data by using cluster analysis. To sum up the whole
discussion we conclude that ward method, K-means and
Fuzzy clustering methods are the best methods among all
other methods and Bangladesh has seven homogenous
climate zones for analyzing rainfall data.
ACKNOWLEDGEMENTS
The author would like to thank the anonymous reviewers
for their helpful comments to enhance the quality of this
paper.
REFERENCES
Baldwin ME, Lakshmivarahan S. (2002). Rainfall
classification using histogram analysis an example of
data mining in meteorology, Technical Report, 4:342-
357.
Dyeret TGJ. (1975). Assignment of stations into
homogeneous group, Quarterly Journal of
Meteorological Society, 101:1005-1012.
Doulah MSU. (2018). Alternative Measures of Standard
Deviation Coefficient of Variation and Standard Error,
International Journal of Statistics and Applications,
8(6):309-315.
Erin S (1984). Climatology and its methods. 3rd edn.
Istanbul.
Everitt BS (1993). Cluster analysis. Edward Arnold,
London.