SlideShare a Scribd company logo
1 of 3
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1786
HOUSE COST PREDICTION USING DATA SCIENCE AND MACHINE
LEARNING
Anuj V. Kumar1, Anshuman Kumar2, Satish S. Tiwari3, Sneha G. Gobade4, Prof. Amita Suke5
1,2,3,4,5CSE, Wainganga College of Engineering and Management, Nagpur, India.
-------------------------------------------------------------------------***----------------------------------------------------------------------
Abstract- This project is based on data science and machine
learning to predict the cost of house by collected data for
housing price to analyze data and predict the cost of new
house. Predicting housing cost based on three important
factor which include physical condition, Concept for building
and location. First we explore the data for analyzing by
plotting different graph and next, split the data into training
set and testing set and using different regression method to
predict the price. There are various method to such this and
each method has pros and cons. I think regression is one of
the most important method because it gives all more insight
about the data by using different regression technique to
improve the performance of the model.
Keywords: data science, machine learning, linear
regression, multiple linear regression, MSE, RMSE,
polynomial regression, K-NN regression, R2 square,
adjusted R2 square.
I. INTRODUCTION
In day to day life population is increasing, everyone
wants house for living. So, this is very difficult to predict
the cost of house, so there is a need of the system to
predict the cost of price in future. The system help the
developer to the determine the selling price of house and
help customer to arrange the right time to purchase a
house. Regression is best method of prediction to predict
the cost of house. To predict the cost of house price person
usually tries to locate similar properties his or her locate
similar properties his or her neighborhood and based on
collected data that predict the cost of house. All these
analysis of data collection emerging the area of prediction,
which is required for machine learning for create the
model of who predicts the cost of house by linear
regression, also apply multiple regression which is more
accurate predict the cost of house. Also available various
regression technique to predict the good price of the data
like polynomial regression, KN-regression, importing
modules, reading the dataset and defining an evaluation
table. This data frame includes root mean squared error
(RMSE), R-squared, adjusted R-squared and mean of the R-
square value obtained by the k-fold cross validation.
II. RELATED WORK
The biggest challenges faced by researchers is to optimum
number of feature that will help to predict the accurate
direction of house prices. It can mention that productivity
growth in various residential construction sectors does
impact the growth of the housing prices. [2]
In datasets various attributes utilizes such as ID, date,
price, bedroom, bathroom, sqft-living, sqft-lot, floors,
water tank, view, yr-renovated, Zip code, flat, long.
Another biggest challenge that is faced by the researchers
is to find out various machine learning technique that will
be more effective for accurate predicting the cost of house.
The current work is unique compared to other problem
from regression perspective. That tries to predict the cost
of house but in regression less feature take for finding
more accurate result apply different regression technique.
2.1 Install
This project requires anaconda python, because below
libraries already available.
 Numpy
 Matplotlib
 Seaborn
 Skit-learn
 Pandas
Also need to have software. Install, run and execute a
Jupyter notebook.
Anaconda python provide various features that will really
helpful for us such as ‘help‘one of the keyword that is used
to take library help and also learn sum extra features.
2.2 Data
The housing data consist of 21614, Dataset, and 21
feature(attributes). This dataset is a modified version of
the housing dataset found on kaggle.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1787
2.3 Features
Id, Date, price, bedroom, bathroom, sqft-living, sqft-lot,
floors, water tank, view, grade, sqft-above, sqft-base, Yr-
built.
There are number of features available some of important
features given below
 Price: Price of each room
 Bedroom: number of bedroom in each house
 Bathroom: Number of bathroom in each house
 Sqft-living: Square fit area for living
 Yr-built: Year of built
 Yr-renovated: Year of renovated
2.4 Data exploration
In this first section of this project you will make a cursory
investigation about the housing data and provide your
observations.
Familiarizing ourself with the data through discovery.
Process is a fundamental practice to help you for better
understanding and justify with results. [2]
The main aim of this project is a construct a regression
model which is predicting the cost of house.
As different research are done by author using various
machine learning algorithm, it is seen then predicting real
estate cost is a complex study.[1]
We see various papers and conclude that most of them do
not predict the future cost of house. Location is an
important factor for predicting the price of a house. This is
because the location determine the prevailing land price of
the required house .[4]
III. METHODOLOGY
The methodology section of a research gives two main
answer to the question how was the data collected and
how was it analyzed.
We have undertaken various Machine learning concepts
Fig.3.1. Steps of methodology
We are taking row data, preprocessing given dataset after
that we analyze the data by plotting different types of
graph. Then apply different regression for finding good
accuracy.
IV. CONCLUSION
In the present real estate world population is
continuously increasing. People are not able predict
accurate price of house. In today real world, it has become
different to staring. House amount of data. The making of
different model applied optimal use of the regression
algorithm. The model use such data in more efficient way,
the regression algorithm fulfill the customer requirement
and reducing the risk of the investment in an estate. A lot
of features added make model more widely acceptable.
after applying different regression technique we will see
which technique can well classifier fit to regression
problem.
V. FUTURE WORK
We suggest that working on large data sets, better
result can be found. We have taken only regression
algorithm for predicting the cost of house applying
different regression algorithm to improve the performance
of model. This model is useful for development of
application for various cities.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1788
VI. REFERENCES
[1].Nehal N Ghosalkar “Real Estate Value Prediction”
nehal.ghosalkar@spit.ac.in.
[2].B. Balakumar, P. Raviraj, S. EssakKiammal “Predicting
Housing prices using Machine Learning Techniques”.
[3].An Nguyen “Housing price Prediction” March2018.
[4].Adyan Alfiyatin, Hilman Taufiq, Ruth Febrita, Wayan
Mahmudy “Modeling House Price Prediction Using
Regression Analysis and Particle Swarm Optimization”
International Journal of Advanced Science and Application
2017.
[5]Shantanu Chakraborty, “Application of Incentive Based
Scoring Rule Deciding Pricing for Smart
Houses”shantanu.chakraborty@nitech.ac.jp.
[6]Danping Ren, Hui Li, Yuefeng Ji, “House energy
management system for the residential load control based
on the price prediction” 2011.
[7]Rushab Sawant, Saurabh Jain, Tushar Tiwari, Yashwant
Jangid, Anita Gupta, “A multi feature based Housing Price
Prediction For Indian Market Using Machine Learning”
International journal of computer & Mathematical Science
December 2017.
[8]Azme Bin Khamis, Nur Khalidah Khalilah Binti
Kamarudin, “Comparative Study On Estimate House Price
Using Statistical And Neural Network model” International
Journal of Science & Technology Research Volume
3.December 2014.

More Related Content

Similar to IRJET- House Cost Prediction using Data Science and Machine Learning

House Price Prediction for Ai & ml project .pptx
House Price Prediction for Ai & ml project .pptxHouse Price Prediction for Ai & ml project .pptx
House Price Prediction for Ai & ml project .pptx
aditichauhan2019404
 

Similar to IRJET- House Cost Prediction using Data Science and Machine Learning (20)

Review on House Price Prediction through Regression Techniques
Review on House Price Prediction through Regression TechniquesReview on House Price Prediction through Regression Techniques
Review on House Price Prediction through Regression Techniques
 
STOCK MARKET ANALYZING AND PREDICTION USING MACHINE LEARNING TECHNIQUES
STOCK MARKET ANALYZING AND PREDICTION USING MACHINE LEARNING TECHNIQUESSTOCK MARKET ANALYZING AND PREDICTION USING MACHINE LEARNING TECHNIQUES
STOCK MARKET ANALYZING AND PREDICTION USING MACHINE LEARNING TECHNIQUES
 
IRJET- Logistics Network Superintendence Based on Knowledge Engineering
IRJET- Logistics Network Superintendence Based on Knowledge EngineeringIRJET- Logistics Network Superintendence Based on Knowledge Engineering
IRJET- Logistics Network Superintendence Based on Knowledge Engineering
 
Web Application for House Price Prediction
Web Application for House Price PredictionWeb Application for House Price Prediction
Web Application for House Price Prediction
 
IRJET - House Price Prediction using Machine Learning and RPA
 IRJET - House Price Prediction using Machine Learning and RPA IRJET - House Price Prediction using Machine Learning and RPA
IRJET - House Price Prediction using Machine Learning and RPA
 
IRJET- House Rent Price Prediction
IRJET- House Rent Price PredictionIRJET- House Rent Price Prediction
IRJET- House Rent Price Prediction
 
House Price Prediction for Ai & ml project .pptx
House Price Prediction for Ai & ml project .pptxHouse Price Prediction for Ai & ml project .pptx
House Price Prediction for Ai & ml project .pptx
 
Template-03 - FYP-BCSM-F22-023.docx
Template-03 - FYP-BCSM-F22-023.docxTemplate-03 - FYP-BCSM-F22-023.docx
Template-03 - FYP-BCSM-F22-023.docx
 
IRJET - Company’s Stock Price Predictor using Machine Learning
IRJET -  	  Company’s Stock Price Predictor using Machine LearningIRJET -  	  Company’s Stock Price Predictor using Machine Learning
IRJET - Company’s Stock Price Predictor using Machine Learning
 
Cost Forecasting of Construction Materials: A review
Cost Forecasting of Construction Materials: A reviewCost Forecasting of Construction Materials: A review
Cost Forecasting of Construction Materials: A review
 
bhagat.pdf
bhagat.pdfbhagat.pdf
bhagat.pdf
 
IRJET- Share Market Prediction using Deep Learning Approach
IRJET- Share Market Prediction using Deep Learning ApproachIRJET- Share Market Prediction using Deep Learning Approach
IRJET- Share Market Prediction using Deep Learning Approach
 
Predicting house price
Predicting house pricePredicting house price
Predicting house price
 
IRJET- Rice QA using Deep Learning
IRJET- Rice QA using Deep LearningIRJET- Rice QA using Deep Learning
IRJET- Rice QA using Deep Learning
 
IRJET - Job Portal Analysis and Salary Prediction System
IRJET -  	  Job Portal Analysis and Salary Prediction SystemIRJET -  	  Job Portal Analysis and Salary Prediction System
IRJET - Job Portal Analysis and Salary Prediction System
 
AUGMENTED REALITY FOR REAL ESTATE
AUGMENTED REALITY FOR REAL ESTATEAUGMENTED REALITY FOR REAL ESTATE
AUGMENTED REALITY FOR REAL ESTATE
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
 
Business Utility Application
Business Utility ApplicationBusiness Utility Application
Business Utility Application
 
Bitcoin Price Prediction Using LSTM
Bitcoin Price Prediction Using LSTMBitcoin Price Prediction Using LSTM
Bitcoin Price Prediction Using LSTM
 

More from IRJET Journal

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
pritamlangde
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
Kamal Acharya
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
AldoGarca30
 

Recently uploaded (20)

Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
Post office management system project ..pdf
Post office management system project ..pdfPost office management system project ..pdf
Post office management system project ..pdf
 
fitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptfitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .ppt
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 

IRJET- House Cost Prediction using Data Science and Machine Learning

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1786 HOUSE COST PREDICTION USING DATA SCIENCE AND MACHINE LEARNING Anuj V. Kumar1, Anshuman Kumar2, Satish S. Tiwari3, Sneha G. Gobade4, Prof. Amita Suke5 1,2,3,4,5CSE, Wainganga College of Engineering and Management, Nagpur, India. -------------------------------------------------------------------------***---------------------------------------------------------------------- Abstract- This project is based on data science and machine learning to predict the cost of house by collected data for housing price to analyze data and predict the cost of new house. Predicting housing cost based on three important factor which include physical condition, Concept for building and location. First we explore the data for analyzing by plotting different graph and next, split the data into training set and testing set and using different regression method to predict the price. There are various method to such this and each method has pros and cons. I think regression is one of the most important method because it gives all more insight about the data by using different regression technique to improve the performance of the model. Keywords: data science, machine learning, linear regression, multiple linear regression, MSE, RMSE, polynomial regression, K-NN regression, R2 square, adjusted R2 square. I. INTRODUCTION In day to day life population is increasing, everyone wants house for living. So, this is very difficult to predict the cost of house, so there is a need of the system to predict the cost of price in future. The system help the developer to the determine the selling price of house and help customer to arrange the right time to purchase a house. Regression is best method of prediction to predict the cost of house. To predict the cost of house price person usually tries to locate similar properties his or her locate similar properties his or her neighborhood and based on collected data that predict the cost of house. All these analysis of data collection emerging the area of prediction, which is required for machine learning for create the model of who predicts the cost of house by linear regression, also apply multiple regression which is more accurate predict the cost of house. Also available various regression technique to predict the good price of the data like polynomial regression, KN-regression, importing modules, reading the dataset and defining an evaluation table. This data frame includes root mean squared error (RMSE), R-squared, adjusted R-squared and mean of the R- square value obtained by the k-fold cross validation. II. RELATED WORK The biggest challenges faced by researchers is to optimum number of feature that will help to predict the accurate direction of house prices. It can mention that productivity growth in various residential construction sectors does impact the growth of the housing prices. [2] In datasets various attributes utilizes such as ID, date, price, bedroom, bathroom, sqft-living, sqft-lot, floors, water tank, view, yr-renovated, Zip code, flat, long. Another biggest challenge that is faced by the researchers is to find out various machine learning technique that will be more effective for accurate predicting the cost of house. The current work is unique compared to other problem from regression perspective. That tries to predict the cost of house but in regression less feature take for finding more accurate result apply different regression technique. 2.1 Install This project requires anaconda python, because below libraries already available.  Numpy  Matplotlib  Seaborn  Skit-learn  Pandas Also need to have software. Install, run and execute a Jupyter notebook. Anaconda python provide various features that will really helpful for us such as ‘help‘one of the keyword that is used to take library help and also learn sum extra features. 2.2 Data The housing data consist of 21614, Dataset, and 21 feature(attributes). This dataset is a modified version of the housing dataset found on kaggle.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1787 2.3 Features Id, Date, price, bedroom, bathroom, sqft-living, sqft-lot, floors, water tank, view, grade, sqft-above, sqft-base, Yr- built. There are number of features available some of important features given below  Price: Price of each room  Bedroom: number of bedroom in each house  Bathroom: Number of bathroom in each house  Sqft-living: Square fit area for living  Yr-built: Year of built  Yr-renovated: Year of renovated 2.4 Data exploration In this first section of this project you will make a cursory investigation about the housing data and provide your observations. Familiarizing ourself with the data through discovery. Process is a fundamental practice to help you for better understanding and justify with results. [2] The main aim of this project is a construct a regression model which is predicting the cost of house. As different research are done by author using various machine learning algorithm, it is seen then predicting real estate cost is a complex study.[1] We see various papers and conclude that most of them do not predict the future cost of house. Location is an important factor for predicting the price of a house. This is because the location determine the prevailing land price of the required house .[4] III. METHODOLOGY The methodology section of a research gives two main answer to the question how was the data collected and how was it analyzed. We have undertaken various Machine learning concepts Fig.3.1. Steps of methodology We are taking row data, preprocessing given dataset after that we analyze the data by plotting different types of graph. Then apply different regression for finding good accuracy. IV. CONCLUSION In the present real estate world population is continuously increasing. People are not able predict accurate price of house. In today real world, it has become different to staring. House amount of data. The making of different model applied optimal use of the regression algorithm. The model use such data in more efficient way, the regression algorithm fulfill the customer requirement and reducing the risk of the investment in an estate. A lot of features added make model more widely acceptable. after applying different regression technique we will see which technique can well classifier fit to regression problem. V. FUTURE WORK We suggest that working on large data sets, better result can be found. We have taken only regression algorithm for predicting the cost of house applying different regression algorithm to improve the performance of model. This model is useful for development of application for various cities.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1788 VI. REFERENCES [1].Nehal N Ghosalkar “Real Estate Value Prediction” nehal.ghosalkar@spit.ac.in. [2].B. Balakumar, P. Raviraj, S. EssakKiammal “Predicting Housing prices using Machine Learning Techniques”. [3].An Nguyen “Housing price Prediction” March2018. [4].Adyan Alfiyatin, Hilman Taufiq, Ruth Febrita, Wayan Mahmudy “Modeling House Price Prediction Using Regression Analysis and Particle Swarm Optimization” International Journal of Advanced Science and Application 2017. [5]Shantanu Chakraborty, “Application of Incentive Based Scoring Rule Deciding Pricing for Smart Houses”shantanu.chakraborty@nitech.ac.jp. [6]Danping Ren, Hui Li, Yuefeng Ji, “House energy management system for the residential load control based on the price prediction” 2011. [7]Rushab Sawant, Saurabh Jain, Tushar Tiwari, Yashwant Jangid, Anita Gupta, “A multi feature based Housing Price Prediction For Indian Market Using Machine Learning” International journal of computer & Mathematical Science December 2017. [8]Azme Bin Khamis, Nur Khalidah Khalilah Binti Kamarudin, “Comparative Study On Estimate House Price Using Statistical And Neural Network model” International Journal of Science & Technology Research Volume 3.December 2014.