Data Mining (DM) is a powerful and a new field having various techniques to analyses the recent real world problems. In DM, environmental mining is one of the essential and interesting research areas. DM enables to collect fundamental insights and knowledge from massive volume of environmental data. The water quality is determining the condition of water in the environment. It represents the concentration and state (dissolved or particulate) of some or all the organic and inorganic material present in the water, together with certain physical characteristics of the water. The Dissolved Oxygen (DO) is one of the important aspects of water quality. The DO is the quantity of gaseous oxygen (O2) incorporated into the water. The DO is essential for keeping the water organisms alive. The amount of DO level in the water can be detected by various methods. The data mining techniques are properly used to find DO Level in the different types of water. A number of DM methods used to analyze the DO level such as Multi-Layer Perceptron, Multivariate Linear Regression, Factor Analysis, and Feed Forward Neural Network. This survey work discusses about such type of methods, particularly used for the analysis of DO level elaborately.
Impact on Aquatic Environment for Water Pollution in the Vahirab Rivertheijes
Water is a valued natural resource for the existence of all living organisms. Vahirab river is the prominent & important rivers for vital source of water from Jessore to Khulna. The prime reason of deterioration and pollution of the river water quality by effluents discharged from industries, municipal sewage, household wastes, clinical wastes and oils. The purpose of this study is to investigate the impact of this wastewater on the river and thus to provide an updated report on the state of water quality of river Vahirab. The water sample were collected from different areas in the river Vahirab and analyzed for physico-chemical parameters and fresh water elements in the month of July, 2013 to November, 2013. The values of temperature, turbidity, pH, chloride, hardness, DO, BOD, COD, Fe, As, Pb, Cd and Cr in the river water were measured. During study period the temperature, turbidity, pH, hardness, chloride, DO, BOD and COD were found to be 280C to 31.50C, 37 to 947 NTU, 7.2 to 8.01, 92 to 140 ppm, 9 to 34 ppm, 122 to 5.51 mg/L, 0.22 to 5.79 mg/L and 3.80 to 10.80 mg/L respectively. The concentration of Fe, Pb, As, Cd and Cr were found to be 0.10 to 2.60 ppm, 0.004 to 0.025 ppm, 0.001 to 0.004 ppm, 0.00015 to 0.0094 ppm and 0.001 to 0.006 ppm respectively. From the results of investigation, it was observed that the values of pH, Cl- , hardness, As, Pb, Cd and Cr were within standard limit but the values of temperature, turbidity, DO, BOD, COD and Fe were not within acceptable limit as recommended by WHO and BDS guidelines. The results obtained from this investigation will help the people of the research area to be conscious for using the water in the Vahirab river.
Impact on Aquatic Environment for Water Pollution in the Vahirab Rivertheijes
Water is a valued natural resource for the existence of all living organisms. Vahirab river is the prominent & important rivers for vital source of water from Jessore to Khulna. The prime reason of deterioration and pollution of the river water quality by effluents discharged from industries, municipal sewage, household wastes, clinical wastes and oils. The purpose of this study is to investigate the impact of this wastewater on the river and thus to provide an updated report on the state of water quality of river Vahirab. The water sample were collected from different areas in the river Vahirab and analyzed for physico-chemical parameters and fresh water elements in the month of July, 2013 to November, 2013. The values of temperature, turbidity, pH, chloride, hardness, DO, BOD, COD, Fe, As, Pb, Cd and Cr in the river water were measured. During study period the temperature, turbidity, pH, hardness, chloride, DO, BOD and COD were found to be 280C to 31.50C, 37 to 947 NTU, 7.2 to 8.01, 92 to 140 ppm, 9 to 34 ppm, 122 to 5.51 mg/L, 0.22 to 5.79 mg/L and 3.80 to 10.80 mg/L respectively. The concentration of Fe, Pb, As, Cd and Cr were found to be 0.10 to 2.60 ppm, 0.004 to 0.025 ppm, 0.001 to 0.004 ppm, 0.00015 to 0.0094 ppm and 0.001 to 0.006 ppm respectively. From the results of investigation, it was observed that the values of pH, Cl- , hardness, As, Pb, Cd and Cr were within standard limit but the values of temperature, turbidity, DO, BOD, COD and Fe were not within acceptable limit as recommended by WHO and BDS guidelines. The results obtained from this investigation will help the people of the research area to be conscious for using the water in the Vahirab river.
Assessment on the Ecosystem Service Functions of Nansi Lake in ChinaIJERA Editor
The assessment of ecosystem service functions is one of the focused area in modern ecological and
environmental research. As a typical shallow macrophytic lake in China, Nansi Lake is selected as the study
area. Based the indicator system and assessment models established in this research, the ecosystem service
functions of Nansi Lake are assessed. The results show that the ecosystem service function of drinking water
source area/aquatic product supply/ecological habitat/entertainment and landscape/ water purification function of
the lakeside zone for Nansi Lake is assessed as medium (2.2)/good (3.5)/medium (3)/medium (2.55)/medium (3),
and the overall ecosystem service function of Nansi Lake can be considered as ―Medium‖. The eutrophication
control and ecological restoration of lakeside wetland need to be enhanced in the future.
Physico chemical Analysis of Groundwater near Dumping Site of Gorakhpur Cityijtsrd
Groundwater is the water present beneath Earths surface in rock and soil pore spaces and in the fractures of rock formations. About 30 percent of all readily available freshwater in the world is groundwater. 1 A unit of rock or an unconsolidated deposit is called an aquifer when it can yield a usable quantity of water. The depth at which soil pore spaces or fractures and voids in rock become completely saturated with water is called the water table. Groundwater is recharged from the surface it may discharge from the surface naturally at springs and seeps, and can form oases or wetlands. Groundwater is also often withdrawn for agricultural, municipal, and industrial use by constructing and operating extraction wells. The study of the distribution and movement of groundwater is hydrogeology, also called groundwater hydrology.Typically, groundwater is thought of as water flowing through shallow aquifers, but, in the technical sense, it can also contain soil moisture, permafrost frozen soil , immobile water in very low permeability bedrock, and deep geothermal or oil formation water. Groundwater is hypothesized to provide lubrication that can possibly influence the movement of faults. It is likely that much of Earths subsurface contains some water, which may be mixed with other fluids in some instances. Groundwater is often cheaper, more convenient and less vulnerable to pollution than surface water. Therefore, it is commonly used for public water supplies. For example, groundwater provides the largest source of usable water storage in the United States, and California annually withdraws the largest amount of groundwater of all the states. Underground reservoirs contain far more water than the capacity of all surface reservoirs and lakes ,, including the Lakes. Many municipal water supplies are derived solely from groundwater. Over 2 billion people rely on it as their primary water source worldwide. Use of groundwater has related environmental issues. For example, polluted groundwater is less visible and more difficult to clean up than pollution in rivers and lakes. Groundwater pollution most often results from improper disposal of wastes on land. Major sources include industrial and household chemicals and garbage landfills, excessive fertilizers and pesticides used in agriculture, industrial waste lagoons, tailings and process wastewater from mines, industrial fracking, oil field brine pits, leaking underground oil storage tanks and pipelines, sewage sludge and septic systems. Additionally, groundwater is susceptible to saltwater intrusion in coastal areas and can cause land subsidence when extracted unsustainably, leading to sinking cities and loss in elevation. These issues are made more complicated by sea level rise and other changes caused by climate changes which will affect the water cycle. Kavita Tripathi | Anil Kumar Dwivedi "Physico-chemical Analysis of Groundwater near Dumping Site of Gorakhpur City" Published in International Journal of
Inland Surface Water Analysis: A comparative study and their Environmental Im...IJERA Editor
Water is the most essential basic amenity of human life. Availability of water for domestic as well as commercial purpose is in the two forms i.e. ground water & Inland Surface water. As the percent of fresh water is less so it becomes very important that whatever fraction of water we are utilising it should meet the designated best use of it. In the present paper different regions of Haryana are taken & analysed with respect to physiochemical & Biological parameters given by Central Pollution Control Board (CPCB) directly indicating the designated best use of the water pertaining to different Class along with BIS standards for drinking water. The significance of this study is to get the appropriate notion regarding various Inland surface water utility in different regions of Haryana. The results obtained revealed that the surface water quality varied spatially. Water at most of the locations is not suitable for drinking purposes as per BIS guidelines. It is further suggested that some kind of treatment for TDS is immediately required in the studied areas to avoid water borne health problems in residents
The peer-reviewed International Journal of Engineering Inventions (IJEI) is started with a mission to encourage contribution to research in Science and Technology. Encourage and motivate researchers in challenging areas of Sciences and Technology.
International Journal of Mathematics and Statistics Invention (IJMSI) is an international journal intended for professionals and researchers in all fields of computer science and electronics. IJMSI publishes research articles and reviews within the whole field Mathematics and Statistics, new teaching methods, assessment, validation and the impact of new technologies and it will continue to provide information on the latest trends and developments in this ever-expanding subject. The publications of papers are selected through double peer reviewed to ensure originality, relevance, and readability. The articles published in our journal can be accessed online.
A comparative study of general water quality has been
extensively studied in some major rivers of West Java, Indonesia and Dhaka, Bangladesh. Water quality assessment based on physiochemical investigation along wit h heavy metal concentration in water and sediments is presented. The results indicate that maximum sampling sites in the rivers of Dhaka are severely impaired in compar ison with the rivers of West Java. And, the pollution gap in the rivers of Dhaka is evident in respect
of the season where pollution in winter is eminent in comparison with rainy seasons . All rivers were severely polluted with NOx, PO43- and Escherichia coli (E-coli). The heavy metal concentration of Al and Mn exceeded whereas, Cu, Zn and Pb were found to be below the international guidelines in most of the sampling points. And, C d an d Fe approached the thres hold
limit in Dhaka.
Influence of Sulfate, Phosphate, Ammonia and Dissolved Oxygen on Biochemical ...BRNSS Publication Hub
The present study was done to ascertain the level biochemical oxygen demand (BOD) of Marlimund Lake situated in Ooty how it is related with the other parameters such as water temperature, dissolved oxygen, phosphate, sulfate, iron, and free ammonia. Four sampling sites were selected and studied over the period of 13 months from February 2016 to February 2017. The results were computed by best model fits applied for calculation using Curve Expert Version 4.2. The water temperature ranged from 8.1°C to 18.7°C, dissolved oxygen 3.468–6.976 mg/l, phosphate 0.1–1.92 mg/l, sulfate 1–18 mg/l, free ammonia 0.12–6.01 mg/l, and BOD 2.178–5.040 mg/l. BOD was found to be significantly related to dissolved oxygen (r = 0.5690291) by 4th degree polynomial fit, phosphate (r = 0.7095253) by rational function fit and free ammonia (r = 0.7395016) by MMF model fit, respectively. Sulfate was found to be nonsignificant (r = 0.2565396) by geometric fit model, and water temperature (r =0.4595060) shows a sinusoidal fit.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Study on Physico-Chemical Parameters of Harsi Reservoir Dabra, Gwalior Distri...ijsrd.com
this study was aimed to estimate physico-chemical characteristic of Harsi reservoir. Harsi reservoir located in Dabra, Gwalior district, Madhya Pradesh is constructed on parwati River. Monthly study in Physico-chemical parameters such as water temperature, depth, transparency, electrical conductivity, turbidity, total dissolved solids, pH, dissolved oxygen, free carbon dioxide, total alkalinity, total hardness, chlorides, sulphates, nitrate, nitrite, phosphate, silicates, ammonia, BOD, COD, calcium, magnesium, sodium, potassium were analyzed from January 2011 to December 2011. The results indicated that Physico-chemical parameters of the water were used for drinking, domestic use, irrigation and pisciculture.
An Investigation into Brain Tumor Segmentation Techniques IIRindia
A tumor is an anomalous mass in the brain which can be cancerous. Such anomalous growth within this restricted space or inside the covering skull can cause problems. Detecting brain tumors from images of medical modalities like CT scan or MRI involves segmentation (Division into parts) for analysis and can be a challenging task. Accurate segmentation of brain images is very essential for proper diagnosis of tumor and non-tumor areas for clinical analysis. This paper details on segmentation algorithms for brain images, advantages, disadvantages and a comparison of the algorithms.
Agricultural sector is the backbone of our country and it plays a vital role in the overall economic growth of our nation. India has about 59% of its total area for agricultural purpose. The contribution of agricultural sector to our GDP is about 17%. Advanced techniques or the betterment in the arena of agriculture will as certain to increase the competence of certain farming activities. In this paper we introduce a concept for smart farming which utilizes wireless sensor web technology with a web based application. This will play a crucial role in helping farmers. It will aim for the betterment in the facilities given to the farmers and by focussing on the measurement of production of the crops. With the help of data mining techniques and algorithms like K-nearest, decision tree we will gather each and every data related to the farming and it should be updated frequently so that farmers and the consumers will get the right knowledge of the respective crops and about the suitable equipments related to farming. Existing system are not so much efficient in displaying such data characteristics. Our main aim is to enhance the growth in the agriculture sector and make the existing system smarter so that the decision- maker can define the expansion of agriculture activities to empower the different forces in existing agriculture sector
More Related Content
Similar to A Survey on the Analysis of Dissolved Oxygen Level in Water using Data Mining Techniques
Assessment on the Ecosystem Service Functions of Nansi Lake in ChinaIJERA Editor
The assessment of ecosystem service functions is one of the focused area in modern ecological and
environmental research. As a typical shallow macrophytic lake in China, Nansi Lake is selected as the study
area. Based the indicator system and assessment models established in this research, the ecosystem service
functions of Nansi Lake are assessed. The results show that the ecosystem service function of drinking water
source area/aquatic product supply/ecological habitat/entertainment and landscape/ water purification function of
the lakeside zone for Nansi Lake is assessed as medium (2.2)/good (3.5)/medium (3)/medium (2.55)/medium (3),
and the overall ecosystem service function of Nansi Lake can be considered as ―Medium‖. The eutrophication
control and ecological restoration of lakeside wetland need to be enhanced in the future.
Physico chemical Analysis of Groundwater near Dumping Site of Gorakhpur Cityijtsrd
Groundwater is the water present beneath Earths surface in rock and soil pore spaces and in the fractures of rock formations. About 30 percent of all readily available freshwater in the world is groundwater. 1 A unit of rock or an unconsolidated deposit is called an aquifer when it can yield a usable quantity of water. The depth at which soil pore spaces or fractures and voids in rock become completely saturated with water is called the water table. Groundwater is recharged from the surface it may discharge from the surface naturally at springs and seeps, and can form oases or wetlands. Groundwater is also often withdrawn for agricultural, municipal, and industrial use by constructing and operating extraction wells. The study of the distribution and movement of groundwater is hydrogeology, also called groundwater hydrology.Typically, groundwater is thought of as water flowing through shallow aquifers, but, in the technical sense, it can also contain soil moisture, permafrost frozen soil , immobile water in very low permeability bedrock, and deep geothermal or oil formation water. Groundwater is hypothesized to provide lubrication that can possibly influence the movement of faults. It is likely that much of Earths subsurface contains some water, which may be mixed with other fluids in some instances. Groundwater is often cheaper, more convenient and less vulnerable to pollution than surface water. Therefore, it is commonly used for public water supplies. For example, groundwater provides the largest source of usable water storage in the United States, and California annually withdraws the largest amount of groundwater of all the states. Underground reservoirs contain far more water than the capacity of all surface reservoirs and lakes ,, including the Lakes. Many municipal water supplies are derived solely from groundwater. Over 2 billion people rely on it as their primary water source worldwide. Use of groundwater has related environmental issues. For example, polluted groundwater is less visible and more difficult to clean up than pollution in rivers and lakes. Groundwater pollution most often results from improper disposal of wastes on land. Major sources include industrial and household chemicals and garbage landfills, excessive fertilizers and pesticides used in agriculture, industrial waste lagoons, tailings and process wastewater from mines, industrial fracking, oil field brine pits, leaking underground oil storage tanks and pipelines, sewage sludge and septic systems. Additionally, groundwater is susceptible to saltwater intrusion in coastal areas and can cause land subsidence when extracted unsustainably, leading to sinking cities and loss in elevation. These issues are made more complicated by sea level rise and other changes caused by climate changes which will affect the water cycle. Kavita Tripathi | Anil Kumar Dwivedi "Physico-chemical Analysis of Groundwater near Dumping Site of Gorakhpur City" Published in International Journal of
Inland Surface Water Analysis: A comparative study and their Environmental Im...IJERA Editor
Water is the most essential basic amenity of human life. Availability of water for domestic as well as commercial purpose is in the two forms i.e. ground water & Inland Surface water. As the percent of fresh water is less so it becomes very important that whatever fraction of water we are utilising it should meet the designated best use of it. In the present paper different regions of Haryana are taken & analysed with respect to physiochemical & Biological parameters given by Central Pollution Control Board (CPCB) directly indicating the designated best use of the water pertaining to different Class along with BIS standards for drinking water. The significance of this study is to get the appropriate notion regarding various Inland surface water utility in different regions of Haryana. The results obtained revealed that the surface water quality varied spatially. Water at most of the locations is not suitable for drinking purposes as per BIS guidelines. It is further suggested that some kind of treatment for TDS is immediately required in the studied areas to avoid water borne health problems in residents
The peer-reviewed International Journal of Engineering Inventions (IJEI) is started with a mission to encourage contribution to research in Science and Technology. Encourage and motivate researchers in challenging areas of Sciences and Technology.
International Journal of Mathematics and Statistics Invention (IJMSI) is an international journal intended for professionals and researchers in all fields of computer science and electronics. IJMSI publishes research articles and reviews within the whole field Mathematics and Statistics, new teaching methods, assessment, validation and the impact of new technologies and it will continue to provide information on the latest trends and developments in this ever-expanding subject. The publications of papers are selected through double peer reviewed to ensure originality, relevance, and readability. The articles published in our journal can be accessed online.
A comparative study of general water quality has been
extensively studied in some major rivers of West Java, Indonesia and Dhaka, Bangladesh. Water quality assessment based on physiochemical investigation along wit h heavy metal concentration in water and sediments is presented. The results indicate that maximum sampling sites in the rivers of Dhaka are severely impaired in compar ison with the rivers of West Java. And, the pollution gap in the rivers of Dhaka is evident in respect
of the season where pollution in winter is eminent in comparison with rainy seasons . All rivers were severely polluted with NOx, PO43- and Escherichia coli (E-coli). The heavy metal concentration of Al and Mn exceeded whereas, Cu, Zn and Pb were found to be below the international guidelines in most of the sampling points. And, C d an d Fe approached the thres hold
limit in Dhaka.
Influence of Sulfate, Phosphate, Ammonia and Dissolved Oxygen on Biochemical ...BRNSS Publication Hub
The present study was done to ascertain the level biochemical oxygen demand (BOD) of Marlimund Lake situated in Ooty how it is related with the other parameters such as water temperature, dissolved oxygen, phosphate, sulfate, iron, and free ammonia. Four sampling sites were selected and studied over the period of 13 months from February 2016 to February 2017. The results were computed by best model fits applied for calculation using Curve Expert Version 4.2. The water temperature ranged from 8.1°C to 18.7°C, dissolved oxygen 3.468–6.976 mg/l, phosphate 0.1–1.92 mg/l, sulfate 1–18 mg/l, free ammonia 0.12–6.01 mg/l, and BOD 2.178–5.040 mg/l. BOD was found to be significantly related to dissolved oxygen (r = 0.5690291) by 4th degree polynomial fit, phosphate (r = 0.7095253) by rational function fit and free ammonia (r = 0.7395016) by MMF model fit, respectively. Sulfate was found to be nonsignificant (r = 0.2565396) by geometric fit model, and water temperature (r =0.4595060) shows a sinusoidal fit.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Study on Physico-Chemical Parameters of Harsi Reservoir Dabra, Gwalior Distri...ijsrd.com
this study was aimed to estimate physico-chemical characteristic of Harsi reservoir. Harsi reservoir located in Dabra, Gwalior district, Madhya Pradesh is constructed on parwati River. Monthly study in Physico-chemical parameters such as water temperature, depth, transparency, electrical conductivity, turbidity, total dissolved solids, pH, dissolved oxygen, free carbon dioxide, total alkalinity, total hardness, chlorides, sulphates, nitrate, nitrite, phosphate, silicates, ammonia, BOD, COD, calcium, magnesium, sodium, potassium were analyzed from January 2011 to December 2011. The results indicated that Physico-chemical parameters of the water were used for drinking, domestic use, irrigation and pisciculture.
Similar to A Survey on the Analysis of Dissolved Oxygen Level in Water using Data Mining Techniques (20)
An Investigation into Brain Tumor Segmentation Techniques IIRindia
A tumor is an anomalous mass in the brain which can be cancerous. Such anomalous growth within this restricted space or inside the covering skull can cause problems. Detecting brain tumors from images of medical modalities like CT scan or MRI involves segmentation (Division into parts) for analysis and can be a challenging task. Accurate segmentation of brain images is very essential for proper diagnosis of tumor and non-tumor areas for clinical analysis. This paper details on segmentation algorithms for brain images, advantages, disadvantages and a comparison of the algorithms.
Agricultural sector is the backbone of our country and it plays a vital role in the overall economic growth of our nation. India has about 59% of its total area for agricultural purpose. The contribution of agricultural sector to our GDP is about 17%. Advanced techniques or the betterment in the arena of agriculture will as certain to increase the competence of certain farming activities. In this paper we introduce a concept for smart farming which utilizes wireless sensor web technology with a web based application. This will play a crucial role in helping farmers. It will aim for the betterment in the facilities given to the farmers and by focussing on the measurement of production of the crops. With the help of data mining techniques and algorithms like K-nearest, decision tree we will gather each and every data related to the farming and it should be updated frequently so that farmers and the consumers will get the right knowledge of the respective crops and about the suitable equipments related to farming. Existing system are not so much efficient in displaying such data characteristics. Our main aim is to enhance the growth in the agriculture sector and make the existing system smarter so that the decision- maker can define the expansion of agriculture activities to empower the different forces in existing agriculture sector
Kidney Failure Due to Diabetics – Detection using Classification Algorithm in...IIRindia
In order to analyse the chosen data from various points of view, data mining is used as the effective process. This process is also used to sum-up all those views into useful information. There are several types of algorithms in data mining such as Classification algorithms, Regression, Segmentation algorithms, association algorithms, sequence analysis algorithms, etc.,. The classification algorithm can be used to bifurcate the data set from the given data set and foretell one or more discrete variables, based on the other attributes in the dataset. The ID3 (Iterative Dichotomiser 3) algorithm is an original data set S as the root node. An unutilised attribute of the data set S calculates the entropy H(S) (or Information gain IG (A)) of the attribute. Upon its selection, the attribute should have the smallest entropy (or largest information gain) value. The prime objective of this paper is to analyze the data from a Kidney disorder due to diabetics by using classification technique to predict class accurately.
Silhouette Threshold Based Text Clustering for Log AnalysisIIRindia
Automated log analysis has been a dominant subject area of interest to both industry and academics alike. The heterogeneous nature of system logs, the disparate sources of logs (Infrastructure, Networks, Databases and Applications) and their underlying structure & formats makes the challenge harder. In this paper I present the less frequently used document clustering techniques to dynamically organize real time log events (e.g. Errors, warnings) to specific categories that are pre-built from a corpus of log archives. This kind of syntactic log categorization can be exploited for automatic log monitoring, priority flagging and dynamic solution recommendation systems. I propose practical strategies to cluster and correlate high volume log archives and high velocity real time log events; both in terms of solution quality and computational efficiency. First I compare two traditional partitional document clustering approaches to categorize high dimensional log corpus. In order to select a suitable model for our problem, Entropy, Purity and Silhouette Index are used to evaluate these different learning approaches. Then I propose computationally efficient approaches to generate vector space model for the real time log events. Then to dynamically relate them to the categories from the corpus, I suggest the use of a combination of critical distance measure and least distance approach. In addition, I introduce and evaluate three different critical distance measures to ascertain if the real time event belongs to a totally new category that is unobserved in the corpus.
Analysis and Representation of Igbo Text Document for a Text-Based SystemIIRindia
The advancement in Information Technology (IT) has assisted in inculcating the three Nigeria major languages in text-based application such as text mining, information retrieval and natural language processing. The interest of this paper is the Igbo language, which uses compounding as a common type of word formation and as well has many vocabularies of compound words. The issues of collocation, word ordering and compounding play high role in Igbo language. The ambiguity in dealing with these compound words has made the representation of Igbo language text document very difficult because this cannot be addressed using the most common and standard approach of the Bag-Of-Words (BOW) model of text representation, which ignores the word order and relation. However, this cause for a concern and the need to develop an improved model to capture this situation. This paper presents the analysis of Igbo language text document, considering its compounding nature and describes its representation with the Word-based N-gram model to properly prepare it for any text-based application. The result shows that Bigram and Trigram n-gram text representation models provide more semantic information as well addresses the issues of compounding, word ordering and collocations which are the major language peculiarities in Igbo. They are likely to give better performance when used in any Igbo text-based system.
A Survey on E-Learning System with Data MiningIIRindia
E-learning process has been widely used in university campus and educational institutions are playing vital role to enhance the skill set of students. Modern E-learning done by many electronic devices, such as smartphones, Tabs, and so on, on existing E-learning tools is insufficient to achieve the purpose of online training of education. This paper presents a survey of online e-Learning authoring tools for creating and integrating reusable e-learning tool for generation and enhancing existing learning resources with them. The work concentrates on evaluation of the existing e-learning tools a, and authoring tools that have shown good performance in the past for online learners. This survey work takes more than 20 online tools that deal with the educational sector mechanism, for the purpose of observations, and the outcome were analyzed. The findings of this paper are the main reason for developing a new tool, and it shows that educators can enhance existing learning resources by adding assessment resources, if suitable authoring tools are provided. Finally, the different factors that assure the reusability of the created new e-learning tool has been analysed in this paper.E-learning environment is a guide for both students and tutorial management system. The useful on the e-learning system for apart from students and distance learning students. The purpose of using e-learning environment for online education system, developed in data mining for more number of clustering servers and resource chain has been good.
Image Segmentation Based Survey on the Lung Cancer MRI ImagesIIRindia
Educational data mining (EDM) creates high impact in the field of academic domain. The methods used in this topic are playing a major advanced key role in increasing knowledge among students. EDM explores and gives ideas in understanding behavioral patterns of students to choose a correct path for choosing their carrier. This survey focuses on such category and it discusses on various techniques involved in making educational data mining for their knowledge improvement. Also, it discusses about different types of EDM tools and techniques in this article. Among the different tools and techniques, best categories are suggested for real world usage.
The Preface Layer for Auditing Sensual Interacts of Primary Distress Conceali...IIRindia
Resting anterior brain electrical activity, self-report measures of Behavioral Approach System (BAS) and Behavioral Inhibition System (BIS) strength, and common levels of Positive Affect (PA) and Negative Affect (NA) were composed from 46 unselected undergraduates two split occasions Electroencephalogram (EEG) measures of prefrontal asymmetry and the self-report measures showed excellent internal reliability, steadiness and tolerable test-retest stability. Strong connection betweens the unconstrained facial emotional expressions and the full of feeling states correlated cerebrum movement. When seeing dreadful as contrasted with unbiased faces, members showed larger amounts of actuation inside the privilege average prefrontal cortex (PFC). To propose a multimodal method to deal with assess Efficient Practical near Infrared Spectroscopy (EPNIS) signals and EEG signals for full of feeling state identification. Outcomes demonstrate that proposed technique with EPNIS enhances execution over EPNIS methodologies. Based on
Feature Based Underwater Fish Recognition Using SVM ClassifierIIRindia
An approach for underwater fish recognition based on wavelet transform is presented in this paper. This approach decomposes the input image into sub-bands by using the multi resolutional analysis known as Discrete Wavelet Transform (DWT). As each sub-band in the decomposed image contains useful information about the image, the mean values of every sub-band are assumed as features. This approach is tested on Underwater Photography - A Fish Database. The database contains 7953 pictures of 1458 different species. The database is considered for the classification based on Support Vector machine (SVM) classifier. The result shows that maximum recognition accuracy of 90.74% is achieved by the wavelet features.
A Survey on Educational Data Mining TechniquesIIRindia
Educational data mining (EDM) creates high impact in the field of academic domain. The methods used in this topic are playing a major advanced key role in increasing knowledge among students. EDM explores and gives ideas in understanding behavioral patterns of students to choose a correct path for choosing their carrier. This survey focuses on such category and it discusses on various techniques involved in making educational data mining for their knowledge improvement. Also, it discusses about different types of EDM tools and techniques in this article. Among the different tools and techniques, best categories are suggested for real world usage.
The objective of this research work is focused on the right cluster creation of lung cancer data and analyzed the efficiency of k-Means and k-Medoids algorithms. This research work would help the developers to identify the characteristics and flow of algorithms. In this research work is pertinent for the department of oncology in cancer centers. This implementation helps the oncologist to make decision with lesser execution time of the algorithm.It is also enhances the medical care applications. This work is very suitable for selection of cluster development algorithm for lung cancer data analysis.Clustering is an important technique in data mining which is applied in many fields including medical diagnosis to find diseases. It is the process of grouping data, where grouping is recognized by discovering similarities between data based on their features. In this research work, the lung cancer data is used to find the performance of clustering algorithms via its computational time. Considering a limited number attributes of lung cancer data, the algorithmic steps are applied to get results and compare the performance of algorithms. The partition based clustering algorithms k-Means and k-Mediods are selected to analyze the lung cancer data.The efficiency of both the algorithms is analyzed based on the results produced by this approach. The finest outcome of the performance of the algorithm is reported for the chosen data concept.
A Study on MRI Liver Image Segmentation using Fuzzy Connected and Watershed T...IIRindia
A comparison study between automatic and interactive methods for liver segmentation from contrast-enhanced MRI images is ocean. A collection of 20 clinical images with reference segmentations was provided to train and tune algorithms in advance. Employed algorithms include statistical shape models, atlas registration, level-sets, graph-cuts and rule-based systems. All results were compared to refer five error measures that highlight different aspects of segmentation accuracy. The measures were combined according to a specific scoring system relating the obtained values to human expert variability. In general, interactive methods like Fuzzy Connected and Watershed Methods reached higher average scores than automatic approaches and featured a better consistency of segmentation quality. However, the best automatic methods (mainly based on statistical shape models with some additional free deformation) could compete well on the majority of test images. The study provides an insight in performance of different segmentation approaches under real-world conditions and highlights achievements and limitations of current image analysis techniques. In this paper only Fuzzy Connected and Watershed Methods are discussed.
A Clustering Based Collaborative and Pattern based Filtering approach for Big...IIRindia
With web services developing and aggregating in application range, benefit revelation has turned into a hot issue for benefit organization and service management. Service clustering gives a promising approach to part the entire seeking space into little areas in order to limit the disclosure time successfully. In any case, semantic data is a basic component amid the entire arranging process. Current industrialized Web Service Portrayal Language (WSPL) does not contain enough data for benefit depiction. Thusly, a service clustering technique has been proposed, which upgrades unique WSPL report with semantic data by methods for Connected Open Information (COI). Examination based genuine service information has been performed, and correlation with comparable techniques has additionally been given to exhibit the adequacy of the strategy. It is demonstrated that using semantic data from COI improves the exactness of service grouping. Furthermore, it shapes a sound base for promote thorough preparing with semantic data.
Hadoop and Hive Inspecting Maintenance of Mobile Application for Groceries Ex...IIRindia
Numerous movable applications on secure groceries expenditure and e-health have designed recently. Health aware clients respect such applications for secure groceries expenditure, particularly to avoid irritating groceries and added substances. However, there is the lack of a complete database including organized or unstructured information to help such applications. In the paper propose the Multiple Scoring Frameworks (MSF), a healthy groceries expenditure search service for movable applications using Hadoop and MapReduce (MR). The MSF works in a procedure behind a portable application to give a search service for data on groceries and groceries added substances. MSF works with similar logic from a web search engine (WSE) and it crawls over Web sources cataloguing important data for possible utilize in reacting to questions from movable applications. MSF outline and advancement are featured in the paper during its framework design, inquiry understanding, its utilization of the Hadoop/MapReduce infrastructure, and activity contents.
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...IIRindia
Educational Data mining(EDM)is a prominent field concerned with developing methods for exploring the unique and increasingly large scale data that come from educational settings and using those methods to better understand students in which they learn. It has been proved in various studies and by the previous study by the authors that data mining techniques find widespread applications in the educational decision making process for improving the performance of students in higher educational institutions. Classification techniques assumes significant importance in the machine learning tasks and are mostly employed in the prediction related problems. In machine learning problems, feature selection techniques are used to reduce the attributes of the class variables by removing the redundant and irrelevant features from the dataset. The aim of this research work is to compares the performance of various feature selection techniques is done using WEKA tool in the prediction of students’ performance in the final semester examination using different classification algorithms. Particularly J48, Naïve Bayes, Bayes Net, IBk, OneR, and JRip are used in this research work. The dataset for the study were collected from the student’s performance report of a private college in Tamil Nadu state of India. The effectiveness of various feature selection algorithms was compared with six classifiers and the results are discussed. The results of this study shows that the accuracy of IBK is 99.680% which is found to be
A Review of Edge Detection Techniques for Image SegmentationIIRindia
Edge detection is a key stride in Image investigation. Edges characterize the limits between areas in a image, which assists with division and article acknowledgment.Edge discovery is a image preparing method for finding the limits of articles inside Image. It works by distinguishing irregular in brilliance and utilized for Image division and information extraction in zones, for example, Image preparing, PC vision and Image vision. There are likely more algorithms in a writing of upgrading and distinguishing edges than whatever other single subject.In this paper, the principle is to concentrate most usually utilized edge methods for Image segmentation.
Leanness Assessment using Fuzzy Logic Approach: A Case of Indian Horn Manufac...IIRindia
Lean principles are being implemented by many industries today that focus on improving the efficiency of the operations for reducing the waste, efforts and consumption. Organizations implementing lean principles can be assessed using the some tools. This paper attempts to assess the lean implementation in a leading Horn manufacturing industry in South India. The twofold objectives are set to be achieved through this paper. First is to find the leanness level of a manufacturing organization for which a horn manufacturing company has been selected as the case company. Second is to find the critical obstacles for the lean implementation. The fuzzy logic computation method is used to extract the perceptions about the particular variables by using linguistic values and then match it with fuzzy numbers to compute the precise value of the leanness level of the organization. Based on the results obtained from this analysis, it was found that the case study company has performed in the lean to vey lean range and the weaker areas have been identified to improve the performance further.
Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy C...IIRindia
Health care has millions of centric data to discover the essential data is more important. In data mining the discovery of hidden information can be more innovative and useful for much necessity constraint in the field of forecasting, patient’s behavior, executive information system, e-governance the data mining tools and technique play a vital role. In Parkinson health care domain the hidden concept predicts the possibility of likelihood of the disease and also ensures the important feature attribute. The explicit patterns are converted to implicit by applying various algorithms i.e., association, clustering, classification to arrive at the full potential of the medical data. In this research work Parkinson dataset have been used with different classifiers to estimate the accuracy, sensitivity, specificity, kappa and roc characteristics. The proposed weighted empirical optimization algorithm is compared with other classifiers to be efficient in terms of accuracy and other related measures. The proposed model exhibited utmost accuracy of 87.17% with a robust kappa statistics measurement and roc degree indicated the strong stability of the model when compared to other classifiers. The total penalty cost generated by the proposed model is less when compared with the penalty cost of other classifiers in addition to accuracy and other performance measures.
Survey on Segmentation Techniques for Spinal Cord ImagesIIRindia
Medical imaging is a technique which is used to expose the interior part of the body, to diagnose the diseases and to treat them as well. Different modalities are used to process the medical images. It helps the human specialists to make diagnosis ailments. In this paper, we surveyed segmentation on the spinal cord images using different techniques such as Data mining, Support vector machine, Neural Networks and Genetic Algorithm which are applied to find the disorders and syndromes affected in the spinal cord system. As a result, we have gained knowledge in an identified disarrays and ailments affected in lumbar vertebra, thoracolumbar vertebra and spinal canal. Finally how the Disc Similarity Index values are generated in each method is also analysed.
An Approach for Breast Cancer Classification using Neural NetworksIIRindia
Breast Cancer,an increasing predominant death causing disease among women has become a social concern. Early detection and efficient treatment helps to reduce the breastcancerrisk.AdaptiveResonanceTheory(ART1),anunsupervised neural network has become an efficient tool in the classification of breast cancer as Benign(non dangerous tumour) or Malignant (dangerous tumour). 400 instances were pre processed to convert real data into binary data and the classification was carried out using ART1 network. The results of the classified data and the physician diagnosed data were compared and the standard performance measures accuracy, sensitivity and specificity were computed. The results show that the simulation results are analogous to the clinical results.
Water scarcity is the lack of fresh water resources to meet the standard water demand. There are two type of water scarcity. One is physical. The other is economic water scarcity.
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)MdTanvirMahtab2
This presentation is about the working procedure of Shahjalal Fertilizer Company Limited (SFCL). A Govt. owned Company of Bangladesh Chemical Industries Corporation under Ministry of Industries.
Welcome to WIPAC Monthly the magazine brought to you by the LinkedIn Group Water Industry Process Automation & Control.
In this month's edition, along with this month's industry news to celebrate the 13 years since the group was created we have articles including
A case study of the used of Advanced Process Control at the Wastewater Treatment works at Lleida in Spain
A look back on an article on smart wastewater networks in order to see how the industry has measured up in the interim around the adoption of Digital Transformation in the Water Industry.
Hierarchical Digital Twin of a Naval Power SystemKerry Sado
A hierarchical digital twin of a Naval DC power system has been developed and experimentally verified. Similar to other state-of-the-art digital twins, this technology creates a digital replica of the physical system executed in real-time or faster, which can modify hardware controls. However, its advantage stems from distributing computational efforts by utilizing a hierarchical structure composed of lower-level digital twin blocks and a higher-level system digital twin. Each digital twin block is associated with a physical subsystem of the hardware and communicates with a singular system digital twin, which creates a system-level response. By extracting information from each level of the hierarchy, power system controls of the hardware were reconfigured autonomously. This hierarchical digital twin development offers several advantages over other digital twins, particularly in the field of naval power systems. The hierarchical structure allows for greater computational efficiency and scalability while the ability to autonomously reconfigure hardware controls offers increased flexibility and responsiveness. The hierarchical decomposition and models utilized were well aligned with the physical twin, as indicated by the maximum deviations between the developed digital twin hierarchy and the hardware.
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...Amil Baba Dawood bangali
Contact with Dawood Bhai Just call on +92322-6382012 and we'll help you. We'll solve all your problems within 12 to 24 hours and with 101% guarantee and with astrology systematic. If you want to take any personal or professional advice then also you can call us on +92322-6382012 , ONLINE LOVE PROBLEM & Other all types of Daily Life Problem's.Then CALL or WHATSAPP us on +92322-6382012 and Get all these problems solutions here by Amil Baba DAWOOD BANGALI
#vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore#blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #blackmagicforlove #blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #Amilbabainuk #amilbabainspain #amilbabaindubai #Amilbabainnorway #amilbabainkrachi #amilbabainlahore #amilbabaingujranwalan #amilbabainislamabad
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxR&R Consult
CFD analysis is incredibly effective at solving mysteries and improving the performance of complex systems!
Here's a great example: At a large natural gas-fired power plant, where they use waste heat to generate steam and energy, they were puzzled that their boiler wasn't producing as much steam as expected.
R&R and Tetra Engineering Group Inc. were asked to solve the issue with reduced steam production.
An inspection had shown that a significant amount of hot flue gas was bypassing the boiler tubes, where the heat was supposed to be transferred.
R&R Consult conducted a CFD analysis, which revealed that 6.3% of the flue gas was bypassing the boiler tubes without transferring heat. The analysis also showed that the flue gas was instead being directed along the sides of the boiler and between the modules that were supposed to capture the heat. This was the cause of the reduced performance.
Based on our results, Tetra Engineering installed covering plates to reduce the bypass flow. This improved the boiler's performance and increased electricity production.
It is always satisfying when we can help solve complex challenges like this. Do your systems also need a check-up or optimization? Give us a call!
Work done in cooperation with James Malloy and David Moelling from Tetra Engineering.
More examples of our work https://www.r-r-consult.dk/en/cases-en/
A Survey on the Analysis of Dissolved Oxygen Level in Water using Data Mining Techniques
1. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 06, Issue: 02, December 2017, Page No.43-51
ISSN: 2278-2419
43
A Survey on the Analysis of Dissolved Oxygen Level in
Water using Data Mining Techniques
R. Arunkumar1
and T.Velmurugan2
1
Research Scholar, 2
Associate Professor,
PG and Research Department of Computer Science,
D.G.Vaishnav College, Arumbakkam, Chennai-600106, India.
Abstract: Data Mining (DM) is a powerful
and a new field having various techniques to
analyses the recent real world problems. In
DM, environmental mining is one of the
essential and interesting research areas. DM
enables to collect fundamental insights and
knowledge from massive volume of
environmental data. The water quality is
determining the condition of water in the
environment. It represents the concentration
and state (dissolved or particulate) of some or
all the organic and inorganic material present
in the water, together with certain physical
characteristics of the water. The Dissolved
Oxygen (DO) is one of the important aspects
of water quality. The DO is the quantity of
gaseous oxygen (O2) incorporated into the
water. The DO is essential for keeping the
water organisms alive. The amount of DO
level in the water can be detected by various
methods. The data mining techniques are
properly used to find DO Level in the different
types of water. A number of DM methods used
to analyze the DO level such as Multi-Layer
Perceptron, Multivariate Linear Regression,
Factor Analysis, and Feed Forward Neural
Network. This survey work discusses about
such type of methods, particularly used for the
analysis of DO level elaborately. Finally, this
research suggests the best DM method to find
DO level in water by means of a comparative
analysis.
Keywords: Dissolved Oxygen, Multi-Layer
Perceptron, Multivariate Linear Regression,
Factor Analysis, Feed Forward Neural
Network.
I. INTRODUCTION
Organism and non-biotic environment in
ecosystem are one of the significant factors in
water environment. The situations express,
affection of humans using water resource on
ecological environment. It also elaborates the
relation between human activity and water
environment. The water has various
parameters such as Total Dissolved Solids
(TDS), Fluoride, Chloride, Dissolved Oxygen,
and Sulfate. These primary parameters decide
the quality of water. In this analysis, DO was
taken from one of these parameters. DO is the
amount of free or non-compound oxygen gas
(O2) available in the water. The oxygen, which
is not bonded with any other element, is called
non-compound oxygen or free oxygen. DO is
the presence of these free O2 molecules within
the water. The DO count does not considers
bonded oxygen molecule in water (H2O).
Oxygen is a primary parameter for evaluating
the water quality, because of its power on the
organisms living within body. Oxygen enters
into the water by direct inclusion from the
environment or by plant photosynthesis.
Figure 1 shows how DO enters into water from
atmosphere. DO is necessary for the survival
of fish, invertebrates, bacteria, and underwater
plants. It is also needed for decomposing
organic matter. The amount of DO needed
varies from creature to creature. Bottom
feeders, crabs, oysters and worms needed
2. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 06, Issue: 02, December 2017, Page No.43-51
ISSN: 2278-2419
44
small amount of oxygen needed (1-6 mg/L),
while shallow water fish requires higher levels
(4-15 mg/L). Microorganisms (Microbes) such
as bacteria and fungi also need DO. The DO
level is one of the major scenarios to
determine the water quality. At the bottom of
water microorganisms decay the organic
material with the use of DO.
Microbial decomposition is a significant
contributor to nutrient recycling. However in a
body of water with irregular or no turnover
(also known as stratification), the oxygen at
lower water levels will get used up quicker, if
there is an excess of decaying organic material
(from dying algae and other organisms).The
DO level in water is relative to atmospheric
pressure, water temperature and salinity and
other factors. Concentrations of DO are
constantly affected by diffusion and aeration,
photosynthesis, respiration and decomposition.
The DO enter into the water through the air or
as a plant by product. From the surrounding
atmosphere, oxygen can spread slowly across
the water’s surface or be combined in rapidly
through aeration, whether natural or man-
made. Natural aeration of water can be caused
by wind (creating waves), fast-flowing water,
and ground water discharge. Aeration of man-
made causes differs from an aquarium air
pump to a hand-turned waterwheel to a large
dam.
Figure 1: Dissolved Oxygen Enters
into the Water
Photosynthesis also produces DO in water
with the use of algae, seaweed, phytoplankton
and other aquatic plants. Mostly it can occur in
surface of the water but a large portion of the
process happens in under water. During
daylight time DO is produced as much as
possible, because an aquatic photosynthesis is
light-dependent and only smaller amount of
DO is produced at night time. With the
required light, carbon di-oxide (CO2) is easily
absorbed by water and the oxygen formed as a
byproduct remains dissolved in water. The
basic reaction of aquatic photosynthesis
explains in Equation 1.
CO2 + H2O → (CH2O) + O2 (1)
In stable state of water, dissolved oxygen will
remain at 100% air saturation. It means water
is keeping as many dissolved gas particles as it
can in equilibrium. Until the water reaches
complete equilibrium, it slowly absorbs
oxygen and other gases from the atmosphere.
This process is done by aeration. Due to the
respiration of aquatic organism and microbial
decomposition, DO levels come below 100%
in deeper waters. Due to organic or in-organic
substances water bodies are polluted. When
the DO consumption rate is greater than the
rate of oxygen in the water, the content of DO
can be nearer to 0 and at the instant, the
anaerobic bacteria breed actively increase and
it get worse the water quality. Therefore, it is a
great significance for environmental
monitoring and the enlargement of the
aquaculture industry to observe the content of
DO in water.
Organization of the paper is described as
follows. Section 2 discusses the different
methods used in the environmental mining.
Section 3 explores detection of dissolved
oxygen in water. The suitable techniques in
environmental mining for detecting DO in
water were concluded in Section 4.
3. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 06, Issue: 02, December 2017, Page No.43-51
ISSN: 2278-2419
45
II. METHODS OF ENVIRONMENTAL
MINING
Data Mining is a process for determining
hidden facts and understandable patterns from
huge amounts of data. The goal of DM is to
focus on choosing input database, develop new
algorithms and distribute the statistical output
for discovering potential knowledge. The
novelty and comprehensibility of mining
results are exhibitory in environmental
domain. Environmental mining maintains the
situation of the environment and keeps the
ecosystem normal. The researchers used
different methods to point out the state of
environment and its parameters like DO, BOD
(biochemical oxygen demand), wind
temperature and etc., M. Kanevski et al.
proposed a hybrid approach, which is a
combination of MLA (Machine Learning
Algorithm) and Geo-statistical based on
decision-making process and probabilistic
mapping [1]. This approach introduced new
non-stationary SVRRSGS (support vector
regression residual sequential gaussian
simulation model) and NNRSGS (neural
network residual sequential gaussian
simulations model) for analysis and mapping
spatial distributed data. The method used to
removelarge-scale spatial structures and the
computational cost. Chen Zhang et al.
implemented a multi-hierarchy based fuzzy
theory for assessing water environment
security status of Luan – River [2]. They
divided the water environment security in 3
aspects and assess the security level. The 3
aspects are ‘water quality situation’, ‘ecology
situation’ and ‘pollution situation and the
result of these aspects are ‘very safe’, ‘safe’
and ‘normal’ respectively. The total evaluation
result is “safe” in the water security. Pierre
Accorsi et al. proposed a tool namely
HydroQual, which is the combination of
spatiotemporal data mining and visualization
techniques [3]. The visual mapping focuses 3
kind of view that is geographical view,
clustering view and temporal patterns view.
The proposed method make easy to visual
analysis of river water quality. The results are
very promising.
Shoba G and Shobha G. discussed a major
data mining approaches for predicting the
water quality [4]. The researchers explores
various data mining techniques like ANN,
back propagation, MLP, decision tree etc.,
used to analyze the environment water quality.
The research paper, “Evaluation of
multivariate linear regression and artificial
neural networks in prediction of water quality
parameters” is proposed by HamidZare,
Abyaneh. He implemented the multivariate
linear regression (MLR) and ANN models for
predicting the water quality [5]. The hybrid
model predicted the BOD and chemical
oxygen demand (COD) parameters in
wastewater treatment plant. They find out the
water quality using these parameters. D. M.
Hamby proposed parameter sensitivity
techniques, which is used to analyze the
environment models [6]. The researcher
discussed about various sensitivity analysis
such as one-at-a-time method and direct
method. He used the correlation analysis to
find out the relationship between dependent
and independent variables and regression
analysis to complete sensitivity measures.
Nitin Muttil and Kwok-Wing Chau discussed
ANN and genetic programming (GP)
algorithm based hybrid technique for
predicting the algal bloom in Tolo Harbor,
Hong Kong [7]. The result shows long-term
trends of algal biomass reasonably well.
3. DETECTION OF DISSOLVED
OXYGEN IN WATER
Detection of dissolved oxygen is one of the
important water quality parameter, which is
used to determine the water body survival. The
4. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 06, Issue: 02, December 2017, Page No.43-51
ISSN: 2278-2419
46
water parameters of BOD and COD are found
out after the DO detection. These parameters
are used to determine organic components in
water. The researchers found DO in different
types of water in the environment. Campo et
al. proposed fiber-optic oxygen sensor based
technique, which is used to measure the DO in
water [8]. Using Stern-Volmer ratio, they
determine the oxygen concentration in water.
The detection limit is 0.05 mg/l and the sensor
showed high stability, but the one big
disadvantage of this method was longer
response time and a significant limitation in
some applications. The following researchers
discussed the DO level in rivers. The research
paper, “simulation of the concentration of
dissolved oxygen in river waters using
artificial neural networks” is proposed by
Fabiana Costa de et al [9]. They proposed
ANN technique, which is used to calculate the
DO in river Algeria. The simulation method
observes and predicts the concentration of DO
in water. The DO concentration is depending
on the number of variables present in the
model. The results shows average of error is
11 and predicts the available oxygen in the
waters of a river was 42%. V. Simeonov et al.
proposed a hybrid technique, which is
combination of component analysis (CA),
PCA and multi regression analysis (MRA)
techniques [10]. They evaluate the dissolved
oxygen, BOD, and etc., level in surface water
quality in the major rivers at Northern Greece.
The better information can be consumed form
this kind of multivariate statistical assessment.
The hybrid method assesses the large and
complex databases.
Ehsan Olyaie et al. discussed a combination of
three different Artificial Intelligence (AI)
methods and provided a new hybrid technique,
which is used to estimate the concentration of
DO in Delaware River station [11]. The
methods are, 1. Two types of artificial neural
network namely multi linear perceptron and
radial based function (RBF). 2. Linear genetic
programming (LGP) and 3. Support vector
machine (SVM). ANNs, LGP and SVM
displayed good forecasting accuracy for low
values of DO but could not maintain their
accuracy for high values of DO. The result
shows, the analysis of SVM method is superior
to the ANNs and LGP method in DO
forecasting. Samira Nemati et al. proposed
data-driven techniques such as MLR, ANFIS
and ANN methods to evaluate the DO in Tai
Po River [12]. They measured and estimated
DO values to MLR model, ANFIS model and
ANN model are 0.550 and 0.681, 0.791 and
0.645 and 0.796 and 0.798 respectively. Durdu
Omer Faruk proposed a hybrid technique,
which is a combination of ARIMA method,
feed forward and BP network with optimized
conjugated training algorithm [13]. The result
shows the predicted value of boron, DO and
WT are 0.902, 0.893 and 0.909 respectively.
Using these water parameters he predicted the
water quality at Buyuk Menderes River. The
effective tool used by Kunwar P. Singh et al
[14]. They predict the water quality in Gomti
River in India. The researchers combined an
ANN model that is FFNN, BP learning
algorithm, which are used to measure the DO
and biochemical oxygen demand concentration
levels. The hybrid method improves the
understanding of river pollution trends.
Stewart A. Rounds et al. proposed MLR model
and FFNN model, which are used to calculate
the dissolved oxygen concentration in Tualatin
River at Oswego Dam [15]. The MLR model
failed to capture long term patterns in the DO
data. This hybrid technique provides a better
performance. The calculated mean absolute
error (MAE) is less than 0.9 Mg/L.
The below researchers measured the dissolved
oxygen level in stream water. Ozgur KISI and
Murat AY proposed the radial basis neural
network (RBNN) and adaptive neuro-fuzzy
inference system (ANFIS) method, which is
5. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 06, Issue: 02, December 2017, Page No.43-51
ISSN: 2278-2419
47
used to measure, DO concentration in
Fountain Greek stream, Gauging Station [16].
The researchers evaluated DO water quality in
daily basis, using RBNN and ANFIS methods.
RBNN is slightly better than ANFIS method.
ANFIS method can be good for DO estimation
when only temperature data is available. The
multilayer perceptron and radial basis neural
network techniques are combined by Murat Ay
and Ozgur Kisi [17]. This hybrid method used
to measure the DO concentration in upstream
and downstream station on Foundation Creek,
Colorado. The researchers compared the MLP
and RBNN models with multi linear regression
model. Researchers also found the DO
concentration in underwater, coastal water and
reservoir. Yu Zhao et al. developed an optical
sensor, which is based on the principle of
fluorescence quenching [18]. The
photoelectric detection technology, wired and
wireless data communication technology and
numbers of instruments placed in various
locations are combined and the information is
sent to primary monitoring upper machine.
Underwater detected information is further
processed by the primary machine. It can
measure the concentration of DO level in
water on-line at real time. This detection
process was simple and fast. The researchers
Kwok-wing Chau and Nitin Muttil proposed
the box plots and multivariate statistical
analysis, which is used to measure the DO
level in coastal water area (Tolo Harbor) in
Hong Kong [19]. H. Vicente et al. proposed
the ANN based model that is feed-forward
neural network model (FFNN) and back
propagation (BP) learning algorithm, which is
based on ANN [20]. They used both models to
predict the DO, wind temperature, etc in the
Monte Novo Reservoir. The training set value
(0.995-0.998) and the test result value (0.994-
0.996) shows the good match between them.
Following researchers determine the water
quality by using DO level in waters. A. H.
Pejman et al. used various multivariate
statistical techniques such as cluster analysis,
principal component analysis (PCA) and factor
analysis (FA) to evaluate the spatial and
seasonal variations of surface water body
quality [21]. Using the cluster analysis eight
sampling stations are grouped into three
clusters based on the criteria. PCA and FA
methods are responsible for water quality
variations. The researchers S. Shrestha and F.
Kazama discussed multivariate statistical
techniques, which is used for measuring the
temporal or spatial complex dataset [22]. They
used different kinds of analysis such as cluster
analysis, PCA, FA and discriminant analysis
(DA), which are used to find various water
quality parameters like DO, BOD, water
temperature (WT) and etc. in Fuji River. The
Hierarchical cluster analysis grouped the 13
samplings sites into 3 clusters depend on the
similarity of water quality characteristics that
is less polluted (LP), medium polluted (MP)
and highly polluted (HP) sites in the river. The
total variance in water quality datasets of LP,
MP, and HP areas are 73.18, 77.61 and
65.39% respectively. DA gives the best result
both spatially and temporally. F. Karimipour et
al. proposed the geospatial information system
(GIS) and geospatial data mining [23]. The
researchers used these methods to pull out the
information DO and find out the importance of
water quality. They research DO, pH and TDS
levels in North-West Iran. The result shows, if
the pollution is increased, the DO level will be
decreased. Minimum amount of DO is not
good for aquatic life. Increase of TDS and
decrease of DO imply the pollution of water.
Sundarambal Palani et al. proposed the ANN
technique, which is used to predict water
quality [24]. The researchers discussed about
water quantity parameters such as DO,
salinity, temperature, etc. are evaluated using
general regression neural network (GRNN)
and multilayer perceptron methods. GRNN is
better for DO forecasting.
6. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 06, Issue: 02, December 2017, Page No.43-51
ISSN: 2278-2419
48
The research paper, “Use of genetic algorithms
to select input variables in decision tree
models for the prediction of benthic macro
invertebrates" proposed by Fang Huang et al
[25]. They discussed fuzzy based technique to
evaluate different water parameter (DO, BOD
and fluoride and etc.) to determine the water
quality in Qiantang River, china. Tom
D’heygere et al. implemented genetic
algorithm, which is used to predict the
decision trees [26]. Different sets of input
variables are (DO, pH, temperature and etc.)
used to compare the predictive power of
decision trees for macro invertebrates in deep
water. They reduced the input variables from
15 to 2-8 variables without affecting the
predictive power with the use of decision trees.
Xiaohu Wen et al. proposed the three layer
back propagation technique to find out the
water quality parameters such as DO
concentration, pH, calcium, chloride etc. [27].
The ANN model is better choice to evaluate
the water quality parameters. The table 1
shows that the summary of various methods
and techniques proposed by different
researchers for finding the DO concentration
in water and the quality of the water. Most of
the proposed techniques integrate two or more
different techniques into one single technique
called hybrid.
Table 1: Results Comparison
Pap
er
Ref.
No.
Author
Name
Propose
d
Method
Results
Accura
cy
Outp
ut
[8]
J.C.
Campo et
al.
Fiber
optic
oxygen
sensor
High stability
with longer
response time
[9]
Fabiana
Costa de
Araujo
Schtz et
ANN -- 42%
al.
[10]
V.
Simeono
v et al.
CA,
PCA,
MRA
Better
information can
be consumed
[11]
Ehsan
Olyaie et
al.
MLP,
RBF,
LGP,
SVM
SVM method
provide best
performance
[12]
Samira
Nemati
et al.
MLR,
ANFIS,
ANN
ANN model
showed the
good
performance
[13]
Durdu
Omer
Faruk
ARIMA,
FF, BP
with
optimize
d
conjugat
ed
training
algorith
m
The hybrid
technology give
the better
accuracy
compared with
ARIMA and
ANN models
for water quality
predictions.
[14]
Kunwar
P. Singh
et al.
FFNN,
BP
Find out water
quality using
DO and BOD
concentration.
[15]
Stewart
A.
Rounds
et al.
MLR,
FFNN
The ANN
model provides
better
performance
compared to
MLR model.
[16]
Ozgur
KISI and
Murat
AY
RBNN,
ANFIS
RBNN gave
good result
compared to
ANFIS
[17]
Murat
Ay and
Ozgur
Kisi
MLP,
RBNN
MLP and
RBNN hybrid
model gives
good result
compared to
MLR model
[18]
Yu Zhao
et al.
fluoresce
nce
quenchin
Measure the DO
level in under
water at real
7. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 06, Issue: 02, December 2017, Page No.43-51
ISSN: 2278-2419
49
g time
[19]
Kwok-
wing
Chau and
Nitin
Muttil
Box
plots and
Multivar
iate
Statistica
l
Analysis
Steps to control
the pollution
[20]
H.
Vicente
et al.
FFNN,
BP
Learning
Algorith
m
--
Durin
g
Traini
ng
0.995
-
0.998
Test
Resul
t
0.994
-
0.996
[21]
A. H.
Pejman
et al.
PCA, FA
DO and other
parameters
evaluated
significantly
[22]
S.
Shrestha
and F.
Kazama
CA,
PCA,
FA, DA
6
paramet
ers
85%
7
paramet
ers
81%
--
[23]
F.
Karimipo
ur et al.
GIS and
geospati
al data
mining
Analyze the
DO, pH and
TDS
[24]
Sundara
mbal
Palani et
al.
GRNN,
MLP
R2
0.8 to
0.9
--
[25]
Fang
Huang et
al.
Fuzzy
based
FA and
UNMIX
Pollution level
categorized into
LP, MP, and HP
by various water
model quality
parameters.
[26]
Tom
D’heyger
e et al.
Genetic
algorith
m,
Decision
tree
The data trends
are identify
easily
[27]
Xiaohu
Wen et
al.
BP
Find out various
water quality
parameters such
as DO, pH
From the table1 results show the performance
of the applied techniques by various
researchers. The results clearly explain, hybrid
techniques are a good one to evaluate the DO
level in water.
4. CONCLUSION
The content of aquatic organisms in water is
characterized by the Dissolved Oxygen. The
concentration of DO has been traditionally
used to change water quality of water systems.
The modeling of water quality parameters is a
very important feature in the analysis of any
aquatic systems. The aquatic life can be
distress if DO level is too high or too low. In
this research work, various DM methods and
techniques proposed by different researchers
are analyzed. Many existing DM methods and
some of the hybrid methods are taken into
account for the analysis. From the different
researcher perspectives, it is identified that
most of the hybrid techniques yields best
results to find the concentration of DO level in
water. Hence, this work concludes that the
hybrid methods perform well in analyzing DO
level in water compared with the existing
methods.
References
[1] Kanevski, M., Roman Parkin, Aleksey
Pozdnukhov, VadimTimonin, Michel
Maignan, V. Demyanov, and
8. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 06, Issue: 02, December 2017, Page No.43-51
ISSN: 2278-2419
50
StéphaneCanu,“Environmental data
mining and modeling based on machine
learning algorithms and
geostatistics”, Environmental Modelling&
Software, Vol. 19(9), 2004, pp.845-855.
[2] Zhang, Chen, Wanhong Liu, Zhaoshen Su,
and Liyi Wang, “Index system and
method for assessing water environment
security of Luan River-Tianjin water
diversion project”. IEEE 4th
Int. Conf.
on Bioinformatics and Biomedical
Engineering, 2010, pp. 1-5.
[3] Accorsi, Pierre, Nathalie Lalande,
MickäelFabrègue, AgnèsBraud, Pascal
Poncelet, Arnaud Sallaberry, Sandra
Bringay, MaguelonneTeisseire,
FlavieCernesson, and Florence Le
Ber,“HydroQual: visual analysis of river
water quality”, IEEE Conf. onVisual
Analytics Science and Technology, 2014,
pp. 123-132.
[4] Shoba, G., and G. Shobha, "Water Quality
Prediction Using Data Mining techniques:
A Survey", Int. Journal of Engineering
and Computer Science, Vol.3(6), 2014,
pp. 6299-6306.
[5] Abyaneh, Hamid Zare, "Evaluation of
multivariate linear regression and
artificial neural networks in prediction of
water quality parameters", Journal of
Environmental Health Science and
Engineering, Vol.12(1), 2014, pp. 40.
[6] Hamby, D. M., "A review of techniques for
parameter sensitivity analysis of
environmental models", Environmental
monitoring and assessment, Vol. 32(2),
1994, pp. 135-154.
[7] Muttil, Nitin, and Kwok-Wing Chau,
"Neural network and genetic
programming for modeling coastal algal
blooms", Int. Journal of Environment and
Pollution, Vol. 28(3-4), 2006, pp. 223-
238.
[8] Campo, J. C., M. A. Perez, M. Gonzalez,
and F. J. Ferrero, "An optrode type sensor
to measure dissolved oxygen in water”,
IEEE Conf. on Instrumentation and
Measurement Technology, Vol. 3, 1999,
pp. 1537-1540.
[9] AraujoSchtz, Fabiana Costa, Vera Lucia
Antunes de Lima, Eduardo Eyng, Adriano
de Andrade Bresolin, and Fernando Schtz,
"Simulation of the concentration of
dissolved oxygen in river waters using
artificial neural networks", IEEE 11th
Int.
Conf. on Natural Computation, 2015, pp.
1252-1257.
[10] Simeonov, V., J. A. Stratis, C. Samara, G.
Zachariadis, D. Voutsa, A. Anthemidis,
M. Sofoniou, and ThKouimtzis,
"Assessment of the surface water quality
in Northern Greece", Water research,
2003, pp. 4119-4124.
[11] Olyaie, Ehsan, Hamid ZareAbyaneh, and
Ali DanandehMehr,"A comparative
analysis among computational
intelligence techniques for dissolved
oxygen prediction in Delaware
River", Geoscience Frontiers, Vol. 8(3),
2017, pp. 517-527.
[12] Nemati, Samira, Mohammad
HasanFazelifard, ÖzlemTerzi, and
Mohammad Ali Ghorbani, "Estimation of
dissolved oxygen using data-driven
techniques in the Tai Po River, Hong
Kong", Environmental Earth Sciences,
Vol. 74(5), 2015, pp. 4065-4073.
[13] Faruk, DurduÖmer, "A hybrid neural
network and ARIMA model for water
quality time series prediction",
Engineering Applications of Artificial
Intelligence, Vol. 23(4), 2010, pp. 586-
594.
[14] Singh, Kunwar P., AnkitaBasant, Amrita
Malik, and Gunja Jain, "Artificial neural
network modeling of the river water
quality-a case study", Ecological
Modelling, Vol. 220(6), 2009, pp. 888-
895.
9. Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 06, Issue: 02, December 2017, Page No.43-51
ISSN: 2278-2419
51
[15] Rounds, Stewart A, "Development of a
neural network model for dissolved
oxygen in the Tualatin River, Oregon",
Proc. of the 2nd
Federal Interagency
Hydrologic Modeling Conf., 2002.
[16] Kisi, Ozgur, and Murat Ay, "Comparison
of ANN and ANFIS techniques in
modeling dissolved oxygen", 16th
Int.
Water Technology Conference, Vol. 2,
2012, pp. 141.
[17] Murat Ay and Ozgur Kisi, "Modeling of
dissolved oxygen concentration using
different neural network techniques in
Foundation Creek, El Paso County,
Colorado", Journal of Environmental
Engineering, Vol. 138(6), 2011, pp. 654-
662.
[18] Zhao, Yu, Li Sun, and Meng-fei Li, "The
research about detection of dissolved
oxygen in water based on C8051F040",
IEEE Int. Conf. on Information
Engineering and Computer Science, 2009,
pp. 1-4.
[19] Chau, Kwok-wing, and NitinMuttil, "Data
mining and multivariate statistical
analysis for ecological system in coastal
waters", Journal of Hydroinformatics,
Vol. 9(4), 2007, pp. 305-317.
[20] Vicente, Henrique, CatarinaCouto, José
Machado, AntónioAbelha, and José
Neves, "Prediction of water quality
parameters in a reservoir using artificial
neural networks", Int. Journal of Design
& Nature and Ecodynamics, Vol. 7(3),
2012, pp. 310-319.
[21] Pejman, A. H., GR NabiBidhendi, A. R.
Karbassi, N. Mehrdadi, and M.
EsmaeiliBidhendi, "Evaluation of spatial
and seasonal variations in surface water
quality using multivariate statistical
techniques", Int. Journal of Environmental
Science & Technology, 2009, pp. 467-
476.
[22] Shrestha, S., and F1 Kazama,
"Assessment of surface water quality
using multivariate statistical techniques:
A case study of the Fuji river basin,
Japan", Environmental Modelling&
Software, Vol. 22, 2007, pp. 464-475.
[23] Karimipour, Farid, Mahmoud Reza
Delavar, and M. Kinaie, "Water quality
management using GIS data mining",
Journal of Environmental informatics,
Vol. 5(2), 2005, pp. 61-71.
[24] Palani, Sundarambal, Shie-YuiLiong, and
PavelTkalich, "An ANN application for
water quality forecasting", Marine
Pollution Bulletin, Vol. 56 (9), 2008, pp.
1586-1597.
[25] Huang, Fang, Xiaoquan Wang, Liping
Lou, Zhiqing Zhou, and Jiaping Wu,
"Spatial variation and source
apportionment of water pollution in
Qiantang River (China) using statistical
techniques", Water research, Vol. 44(5),
2010, pp. 1562-1572.
[26] D'heygere, Tom, Peter LM Goethals, and
Niels DePauw, "Use of genetic algorithms
to select input variables in decision tree
models for the prediction of benthic
macroinvertebrates", Ecological
Modelling, Vol. 160(3), 2003, pp. 291-
300.
[27] Wen, Xiaohu, Jing Fang, MeinaDiao, and
Chuanqi Zhang, "Artificial neural
network modeling of dissolved oxygen in
the Heihe River, Northwestern China",
Environmental monitoring asssnd
assessment, Vol. 185(5), 2013, pp. 4361-
4371.