SlideShare a Scribd company logo
1 of 3
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1500
Survey of Estimation of Crop Yield using Agriculture Data
Anu priya K R
Dept of computer Science and Engineering, New Horizon College of Engineering, Bangalore, Karnataka, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract - Data mining techniques are increasingly being
used in finding useful pattern from large data sets .Huge
amount of data is being produced day by day from various
sources and are stored in data sets .halfofthepeopleinIndia
depends on agriculture for their maintenance .Data mining
plays a major role in estimation of crop production
depending upon the various data sets such as soil verify,
rainfall, humidity, temperature, cities etc .Thispaperfocuses
on analysing agriculture data ,pre-processing the data to
remove the noise and find the optimal parametertoincrease
the crop yield which is more accurate and optimal .This
paper also makes an effort to understand the different
complications to eliminate the noise from the initial dataset
,know the problems of various algorithms used in earlier
approaches.
Key Words: Data cleaning; Big data; Data preprocessing
Optimal yield; Noise elimination.
1. INTRODUCTION
Big data deal with large amount of data which cannot be
accommodated by a normal computer in terms of storage,
processing and computation and data analysis technique
[1].As the size of data is increasing every moment which
causes complexity in terms of volume, variety, velocity,
veracity as a result it requires more storage, computation
and speed should be more[2].
Data mining on this big data bit a way of removing noise
from the initial dataset which is called pre-processing using
some of the pre-processing techniques.Rapid fluctuationare
common in market in such scenario it is difficult for the
farmers to choose the type of crop to be grown in land or
estimate the yield. Data mining is the procedure of using
huge data set to infer important hidden knowledge.
Data mining is divided into 7 methods:
 Data Cleaning
 Data integration
 Data selection
 Data transformation
 Data mining
 Pattern estimation
 Knowledge display
Figure 1.shows the different stages in data mining
Along with this in today's applications mostly all of them
involve crowd sourcing and semantic input for improved
data analysis in such an environment the tracing and
analysing data provenance of the data discovery and
integration of several parallel and distributed computing
generated data is the scope for implementing newer
exploratory data analysis and interpretation methodswhere
in the, integration of heterogeneous data, and developing
new models for massive data computation is possible.
The main purpose of data cleaning is to manipulate and
transform raw data so that the informationcontentenfolded
in the data set can be exposed, or made more easily
accessible. The quality of data is very pivotal in many
organizations as it maintains the basic characteristics of
cleaned data. So it's necessary to obtain a proper sample
data for knowledge extraction. Preprocessing canproducea
smaller data set than the original, which allows us to
improve efficiency in the data mining process. This
performance includes data reduction techniques such as
feature selection, sampling, instance selection and
discretization.
There are many methods for cleaning noisyandinconsistent
data like:
 Filter-based Method
 Imputation Method
 Hybrid method
 Wrapper method
 Global imputation of stray and non-missing
attribute
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1501
 Embedded Method
Data mining involves six common classes of tasks
 Detection of anomalies (detection of outliers /
change / deviation): documentation of rare data
records, which may be stimulants or data errors
that require further study.
 Learning associationrules(dependencemodeling):
observing for relationships between variables. For
example, a shop cancollectdata oncustomer buying
habits. Using the rules of the learning association,
the shop can regulate which products are regularly
bought together and use this information for
marketing purposes. This is sometimes mentioned
to as an study of the market basket.
 Grouping: the task of determining groups and
structures in the data that in one wayoranotherare
"similar", without using structures known in the
data.
 Classification: it is the task of simplifying the
known structure to apply it to innovative data. For
example, an email program mighttrytocategorizea
message as "legitimate" or "spam".
 Regression: try to find a function that models the
data with the lowest error, to estimate the
relationships between data or data sets.
 Summary: provides a more solid illustration of the
data set, including viewing and reporting.
LITERATURE SURVEY
Mohammad Motiur Rahman Naheena Haq, Rashedur M
Rahman have done an analysis on forecasting the rice yield
of Bangladesh. The study has shown that we can have good
relationship designed among the rice yield and the climatic
variable of a given area within certain threshold. This study
has helped the people concerned withagriculturesuchasthe
farmer etc to make decisions to diminish their loss in the
following years.
Jayaram Hariharakrishnan, Mohanavalli .S, Srividya,
Sundhara Kumar K.B carried survey on various data
cleaning methods such as Filter method, Imputationmethod
and Wrapper method. It focused on how efficient this data
cleaning techniques where in terms of blank spaces and
missing values. It comparesdifferentdata miningtechniques
and suggest which data cleaning technique is best
Ratul Dey, Sanjay Chakraborty , this paper help us
understand how incremental DBSCAN clustering algorithm
and Convex-Hull can be used to predict the weather of the
upcoming days. In this they have shown that these
algorithms are suitable for active database where the
weather data keeps changing regularly. The accuracy of the
techniques used is calculated based upon the hit and the
miss times.
S.Kanaga Suba Raja,Rishi .R,Sundarsan.E,Srijitfocusesontry
to predict crop yield and value by analyzing patterns in past
data.The technique used is Sliding Window and Non linear
Regression to predict several features that influence the
agriculture production such as rainfall,Temprature etc..The
proposed system helps to decrease the harm faced by the
farmers and would recover the crop yield.
Jharna Majumdar, Sneha Naraseeyappa and Shilpa Ankalaki,
the survey tells us that data mining techniques are very
necessary approach for estimation of various crop yields.
This paper focuses on the analysis agricultural dataset and
find optimal solution for maximizing the crop production.
PAM, CLARA etc are some of the different data mining
techniques that are been used.
III. PROPOSED SYSTEM
Several data mining techniques are carried on the input
dataset to accomplish the finest performance yielding
method. Clustering method can obtain better results.
DBSCAN algorithm can be used to estimate the crop yield by
considering various datasets and forming the clusters for
making decisions.
DBSAN is resistant to noise and can handle clusters of
various shapes and sizes. The results obtained are more
optimal and efficient hence farmers can make better
decision.
IV. CONCLUSIONS
A good clustering algorithm should be able to forecast the
crop yield which is more effective and optimal compared to
other clustering algorithms. There are numerous clustering
algorithms for predicting crop yield based upon the various
agricultural data set.
Thus executing the above system would further lead to the
agricultural growth of our country. The DBSCAN algorithm
helps us to forecast best range of best temperature, worst
temperature and rain fall to achieve higher production of
different crops. Clustering methods are associated using
quality metrics. Rendering to the analyses of clustering
quality metrics, DBSCAN gives the better clustering quality
REFERENCES
1. "A survey on pre-processing and postprocessing
techniques in data mining." Tomar, Divya, and Sonali
Al1;arwal International Journal of Database Theory &
Application 7.4 (2014).
2."Enhancing data analysis with noise removal." Xiong, Hui,
et al, IEEE Transactions on Knowledge andData Engineering
18.3 (2006): 304- 319.
3. “ Survey of Pre-processing Techniques for Mining Big
Data” JayaramHariharakrishnan*, Mohanavalli.S*,Srividya*,
Sundhara Kumar K.B** Department of Information
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1502
Technology, SSN College of Engineering Kalavakkam, Tamil
Nadu, India.IEEE[2017].
4. “Analysis of agriculturedata usingdata miningtechniques:
application of big data” Jharna Majumdar* , Sneha
Naraseeyappa and Shilpa Ankalaki [2017]
5. “Demand Based Crop Recommender system for Farmers”
S.Kanaga Suba Raja,Rishi.R,Sundarsan.E,Srijit.VDepartment
of Information Technology , Easwari Engineering College,
Chennai, India.[2017]
6. “Convex-Hull & DBSCAN Clustering to Predict Future
Weather” Ratul Dey , Sanjay Chakraborty Computer Science
& Engineering ,Institute of Engineering &
Management[2015]
7. “Application of Data Mining Tools forRiceYieldPrediction
on Clustered Regions of Bangladesh “.Mohammad Motiur
Rahman , Naheena Haq, Rashedur M Rahman Electrical and
Computer Engineering Department North South University
Dhaka, Bangladesh [2014]

More Related Content

What's hot

The International Journal of Engineering and Science
The International Journal of Engineering and ScienceThe International Journal of Engineering and Science
The International Journal of Engineering and Science
theijes
 
Application of data mining tools for
Application of data mining tools forApplication of data mining tools for
Application of data mining tools for
IJDKP
 
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
Editor IJMTER
 

What's hot (20)

The International Journal of Engineering and Science
The International Journal of Engineering and ScienceThe International Journal of Engineering and Science
The International Journal of Engineering and Science
 
IRJET-Precision Farming and Big Data
IRJET-Precision Farming and Big DataIRJET-Precision Farming and Big Data
IRJET-Precision Farming and Big Data
 
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUESGI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
 
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESA SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
 
IRJET- Improving the Performance of Smart Heterogeneous Big Data
IRJET- Improving the Performance of Smart Heterogeneous Big DataIRJET- Improving the Performance of Smart Heterogeneous Big Data
IRJET- Improving the Performance of Smart Heterogeneous Big Data
 
Application of data mining tools for
Application of data mining tools forApplication of data mining tools for
Application of data mining tools for
 
An incremental mining algorithm for maintaining sequential patterns using pre...
An incremental mining algorithm for maintaining sequential patterns using pre...An incremental mining algorithm for maintaining sequential patterns using pre...
An incremental mining algorithm for maintaining sequential patterns using pre...
 
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop ClusterIRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...
 
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
 
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
 
IRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its AnalysisIRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its Analysis
 
Multidimensional schema for agricultural data warehouse
Multidimensional schema for agricultural data warehouseMultidimensional schema for agricultural data warehouse
Multidimensional schema for agricultural data warehouse
 
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
 
An efficient feature selection algorithm for health care data analysis
An efficient feature selection algorithm for health care data analysisAn efficient feature selection algorithm for health care data analysis
An efficient feature selection algorithm for health care data analysis
 
Enhancement techniques for data warehouse staging area
Enhancement techniques for data warehouse staging areaEnhancement techniques for data warehouse staging area
Enhancement techniques for data warehouse staging area
 
IRJET-Survey on Data Mining Techniques for Disease Prediction
IRJET-Survey on Data Mining Techniques for Disease PredictionIRJET-Survey on Data Mining Techniques for Disease Prediction
IRJET-Survey on Data Mining Techniques for Disease Prediction
 
Comparative study of frequent item set in data mining
Comparative study of frequent item set in data miningComparative study of frequent item set in data mining
Comparative study of frequent item set in data mining
 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence Chain
 
Data mining and data warehouse lab manual updated
Data mining and data warehouse lab manual updatedData mining and data warehouse lab manual updated
Data mining and data warehouse lab manual updated
 

Similar to IRJET- Survey of Estimation of Crop Yield using Agriculture Data

Augmentation of Customer’s Profile Dataset Using Genetic Algorithm
Augmentation of Customer’s Profile Dataset Using Genetic AlgorithmAugmentation of Customer’s Profile Dataset Using Genetic Algorithm
Augmentation of Customer’s Profile Dataset Using Genetic Algorithm
RSIS International
 
Correlation of artificial neural network classification and nfrs attribute fi...
Correlation of artificial neural network classification and nfrs attribute fi...Correlation of artificial neural network classification and nfrs attribute fi...
Correlation of artificial neural network classification and nfrs attribute fi...
eSAT Journals
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
ijdpsjournal
 

Similar to IRJET- Survey of Estimation of Crop Yield using Agriculture Data (20)

Supervise Machine Learning Approach for Crop Yield Prediction in Agriculture ...
Supervise Machine Learning Approach for Crop Yield Prediction in Agriculture ...Supervise Machine Learning Approach for Crop Yield Prediction in Agriculture ...
Supervise Machine Learning Approach for Crop Yield Prediction in Agriculture ...
 
IRJET- Agricultural Productivity System
IRJET- Agricultural Productivity SystemIRJET- Agricultural Productivity System
IRJET- Agricultural Productivity System
 
Augmentation of Customer’s Profile Dataset Using Genetic Algorithm
Augmentation of Customer’s Profile Dataset Using Genetic AlgorithmAugmentation of Customer’s Profile Dataset Using Genetic Algorithm
Augmentation of Customer’s Profile Dataset Using Genetic Algorithm
 
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICSA STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS
A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS
 
Crop yield prediction.pdf
Crop yield prediction.pdfCrop yield prediction.pdf
Crop yield prediction.pdf
 
Correlation of artificial neural network classification and nfrs attribute fi...
Correlation of artificial neural network classification and nfrs attribute fi...Correlation of artificial neural network classification and nfrs attribute fi...
Correlation of artificial neural network classification and nfrs attribute fi...
 
Agricultural Product Price and Crop Cultivation Prediction based on Data Scie...
Agricultural Product Price and Crop Cultivation Prediction based on Data Scie...Agricultural Product Price and Crop Cultivation Prediction based on Data Scie...
Agricultural Product Price and Crop Cultivation Prediction based on Data Scie...
 
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYCLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
 
IRJET- Fault Detection and Prediction of Failure using Vibration Analysis
IRJET-	 Fault Detection and Prediction of Failure using Vibration AnalysisIRJET-	 Fault Detection and Prediction of Failure using Vibration Analysis
IRJET- Fault Detection and Prediction of Failure using Vibration Analysis
 
IRJET- Comparative Analysis of Various Tools for Data Mining and Big Data...
IRJET-  	  Comparative Analysis of Various Tools for Data Mining and Big Data...IRJET-  	  Comparative Analysis of Various Tools for Data Mining and Big Data...
IRJET- Comparative Analysis of Various Tools for Data Mining and Big Data...
 
IRJET- Customer Online Buying Prediction using Frequent Item Set Mining
IRJET- Customer Online Buying Prediction using Frequent Item Set MiningIRJET- Customer Online Buying Prediction using Frequent Item Set Mining
IRJET- Customer Online Buying Prediction using Frequent Item Set Mining
 
V33119122
V33119122V33119122
V33119122
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining TechniquesAn analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining Techniques
 
A study and survey on various progressive duplicate detection mechanisms
A study and survey on various progressive duplicate detection mechanismsA study and survey on various progressive duplicate detection mechanisms
A study and survey on various progressive duplicate detection mechanisms
 
Comparison Between WEKA and Salford System in Data Mining Software
Comparison Between WEKA and Salford System in Data Mining SoftwareComparison Between WEKA and Salford System in Data Mining Software
Comparison Between WEKA and Salford System in Data Mining Software
 
IRJET- A Review of Data Mining and its Methods Used in Manufacturing and How ...
IRJET- A Review of Data Mining and its Methods Used in Manufacturing and How ...IRJET- A Review of Data Mining and its Methods Used in Manufacturing and How ...
IRJET- A Review of Data Mining and its Methods Used in Manufacturing and How ...
 
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
 
IRJET- Agricultural Data Modeling and Yield Forecasting using Data Mining...
IRJET-  	  Agricultural Data Modeling and Yield Forecasting using Data Mining...IRJET-  	  Agricultural Data Modeling and Yield Forecasting using Data Mining...
IRJET- Agricultural Data Modeling and Yield Forecasting using Data Mining...
 
IRJET- Comparative Analysis of Data Mining Classification Techniques for Hear...
IRJET- Comparative Analysis of Data Mining Classification Techniques for Hear...IRJET- Comparative Analysis of Data Mining Classification Techniques for Hear...
IRJET- Comparative Analysis of Data Mining Classification Techniques for Hear...
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
 

More from IRJET Journal

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
Tonystark477637
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Christo Ananth
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
rknatarajan
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
 

Recently uploaded (20)

result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 

IRJET- Survey of Estimation of Crop Yield using Agriculture Data

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1500 Survey of Estimation of Crop Yield using Agriculture Data Anu priya K R Dept of computer Science and Engineering, New Horizon College of Engineering, Bangalore, Karnataka, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract - Data mining techniques are increasingly being used in finding useful pattern from large data sets .Huge amount of data is being produced day by day from various sources and are stored in data sets .halfofthepeopleinIndia depends on agriculture for their maintenance .Data mining plays a major role in estimation of crop production depending upon the various data sets such as soil verify, rainfall, humidity, temperature, cities etc .Thispaperfocuses on analysing agriculture data ,pre-processing the data to remove the noise and find the optimal parametertoincrease the crop yield which is more accurate and optimal .This paper also makes an effort to understand the different complications to eliminate the noise from the initial dataset ,know the problems of various algorithms used in earlier approaches. Key Words: Data cleaning; Big data; Data preprocessing Optimal yield; Noise elimination. 1. INTRODUCTION Big data deal with large amount of data which cannot be accommodated by a normal computer in terms of storage, processing and computation and data analysis technique [1].As the size of data is increasing every moment which causes complexity in terms of volume, variety, velocity, veracity as a result it requires more storage, computation and speed should be more[2]. Data mining on this big data bit a way of removing noise from the initial dataset which is called pre-processing using some of the pre-processing techniques.Rapid fluctuationare common in market in such scenario it is difficult for the farmers to choose the type of crop to be grown in land or estimate the yield. Data mining is the procedure of using huge data set to infer important hidden knowledge. Data mining is divided into 7 methods:  Data Cleaning  Data integration  Data selection  Data transformation  Data mining  Pattern estimation  Knowledge display Figure 1.shows the different stages in data mining Along with this in today's applications mostly all of them involve crowd sourcing and semantic input for improved data analysis in such an environment the tracing and analysing data provenance of the data discovery and integration of several parallel and distributed computing generated data is the scope for implementing newer exploratory data analysis and interpretation methodswhere in the, integration of heterogeneous data, and developing new models for massive data computation is possible. The main purpose of data cleaning is to manipulate and transform raw data so that the informationcontentenfolded in the data set can be exposed, or made more easily accessible. The quality of data is very pivotal in many organizations as it maintains the basic characteristics of cleaned data. So it's necessary to obtain a proper sample data for knowledge extraction. Preprocessing canproducea smaller data set than the original, which allows us to improve efficiency in the data mining process. This performance includes data reduction techniques such as feature selection, sampling, instance selection and discretization. There are many methods for cleaning noisyandinconsistent data like:  Filter-based Method  Imputation Method  Hybrid method  Wrapper method  Global imputation of stray and non-missing attribute
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1501  Embedded Method Data mining involves six common classes of tasks  Detection of anomalies (detection of outliers / change / deviation): documentation of rare data records, which may be stimulants or data errors that require further study.  Learning associationrules(dependencemodeling): observing for relationships between variables. For example, a shop cancollectdata oncustomer buying habits. Using the rules of the learning association, the shop can regulate which products are regularly bought together and use this information for marketing purposes. This is sometimes mentioned to as an study of the market basket.  Grouping: the task of determining groups and structures in the data that in one wayoranotherare "similar", without using structures known in the data.  Classification: it is the task of simplifying the known structure to apply it to innovative data. For example, an email program mighttrytocategorizea message as "legitimate" or "spam".  Regression: try to find a function that models the data with the lowest error, to estimate the relationships between data or data sets.  Summary: provides a more solid illustration of the data set, including viewing and reporting. LITERATURE SURVEY Mohammad Motiur Rahman Naheena Haq, Rashedur M Rahman have done an analysis on forecasting the rice yield of Bangladesh. The study has shown that we can have good relationship designed among the rice yield and the climatic variable of a given area within certain threshold. This study has helped the people concerned withagriculturesuchasthe farmer etc to make decisions to diminish their loss in the following years. Jayaram Hariharakrishnan, Mohanavalli .S, Srividya, Sundhara Kumar K.B carried survey on various data cleaning methods such as Filter method, Imputationmethod and Wrapper method. It focused on how efficient this data cleaning techniques where in terms of blank spaces and missing values. It comparesdifferentdata miningtechniques and suggest which data cleaning technique is best Ratul Dey, Sanjay Chakraborty , this paper help us understand how incremental DBSCAN clustering algorithm and Convex-Hull can be used to predict the weather of the upcoming days. In this they have shown that these algorithms are suitable for active database where the weather data keeps changing regularly. The accuracy of the techniques used is calculated based upon the hit and the miss times. S.Kanaga Suba Raja,Rishi .R,Sundarsan.E,Srijitfocusesontry to predict crop yield and value by analyzing patterns in past data.The technique used is Sliding Window and Non linear Regression to predict several features that influence the agriculture production such as rainfall,Temprature etc..The proposed system helps to decrease the harm faced by the farmers and would recover the crop yield. Jharna Majumdar, Sneha Naraseeyappa and Shilpa Ankalaki, the survey tells us that data mining techniques are very necessary approach for estimation of various crop yields. This paper focuses on the analysis agricultural dataset and find optimal solution for maximizing the crop production. PAM, CLARA etc are some of the different data mining techniques that are been used. III. PROPOSED SYSTEM Several data mining techniques are carried on the input dataset to accomplish the finest performance yielding method. Clustering method can obtain better results. DBSCAN algorithm can be used to estimate the crop yield by considering various datasets and forming the clusters for making decisions. DBSAN is resistant to noise and can handle clusters of various shapes and sizes. The results obtained are more optimal and efficient hence farmers can make better decision. IV. CONCLUSIONS A good clustering algorithm should be able to forecast the crop yield which is more effective and optimal compared to other clustering algorithms. There are numerous clustering algorithms for predicting crop yield based upon the various agricultural data set. Thus executing the above system would further lead to the agricultural growth of our country. The DBSCAN algorithm helps us to forecast best range of best temperature, worst temperature and rain fall to achieve higher production of different crops. Clustering methods are associated using quality metrics. Rendering to the analyses of clustering quality metrics, DBSCAN gives the better clustering quality REFERENCES 1. "A survey on pre-processing and postprocessing techniques in data mining." Tomar, Divya, and Sonali Al1;arwal International Journal of Database Theory & Application 7.4 (2014). 2."Enhancing data analysis with noise removal." Xiong, Hui, et al, IEEE Transactions on Knowledge andData Engineering 18.3 (2006): 304- 319. 3. “ Survey of Pre-processing Techniques for Mining Big Data” JayaramHariharakrishnan*, Mohanavalli.S*,Srividya*, Sundhara Kumar K.B** Department of Information
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1502 Technology, SSN College of Engineering Kalavakkam, Tamil Nadu, India.IEEE[2017]. 4. “Analysis of agriculturedata usingdata miningtechniques: application of big data” Jharna Majumdar* , Sneha Naraseeyappa and Shilpa Ankalaki [2017] 5. “Demand Based Crop Recommender system for Farmers” S.Kanaga Suba Raja,Rishi.R,Sundarsan.E,Srijit.VDepartment of Information Technology , Easwari Engineering College, Chennai, India.[2017] 6. “Convex-Hull & DBSCAN Clustering to Predict Future Weather” Ratul Dey , Sanjay Chakraborty Computer Science & Engineering ,Institute of Engineering & Management[2015] 7. “Application of Data Mining Tools forRiceYieldPrediction on Clustered Regions of Bangladesh “.Mohammad Motiur Rahman , Naheena Haq, Rashedur M Rahman Electrical and Computer Engineering Department North South University Dhaka, Bangladesh [2014]