Oral Presentation

The document discusses predicting house prices in Bengaluru, India using linear and multiple regression techniques on a dataset of 1298 locations. It describes preprocessing the data, developing linear, lasso and decision tree models, and finding that multiple linear regression achieved 85% accuracy. A web app was created to provide price predictions based on the model.

Uploaded by

aryanrajesh6702

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Oral Presentation

Uploaded by

aryanrajesh6702

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

✓ House Price Prediction

✓ Aryan Rajesh, Yatheendra Reddy

✓ School of Computer Science and

Engineering, VIT University, Vellore,
Tamil Nadu, India.

✓ [email protected] ,
[email protected]
✓ Highlights (what was firstly discovered and why is it abreakthrough)
Using housing data from kaggle to build prediction models. The data often includes attributes like square footage,
location, number of bedrooms etc. Location data requires special preprocessing as it has an outsized impact on
prices. Techniques like one-hot encoding for neighborhoods are common. Identifying and removing outliers also very
important. Predictive accuracy in the 70-80% range on unseen test data is considered quite good. Model
interpretability also important to understand which factors are most influential. Sometimes insights are extracted -
e.g. ranking locations by average price per square foot. These add business value over just predictions.
✓ Abstract (self-explanation of the main discovery)
This study addresses house price prediction in Bengaluru using linear and multiple regression techniques. Utilizing a
dataset of 1298 unique localities, the research focuses on forecasting land prices in the Bengaluru Metropolitan
Area (BMA) in Karnataka, India. Beyond the House Price Index (HPI), factors such as area type, availability, location,
society, and apartment size are considered. The goal is to predict the price per square foot for apartments. In
metropolitan cities like Bengaluru, determining accurate sales prices remains challenging, making predictive
modeling crucial for real estate decision-making. The models aim to capture the complex interplay of these factors
in influencing individual house prices in the dynamic real estate market of Bengaluru.
✓ Introduction (clarify the complexity of the topic and justify the urgency to investigate the
research hypothesis)
Housing is an essential human need and real estate markets impact economies. Accurately valuing properties is
important but challenging due to many influencing factors. Prices depend on attributes like size, rooms, location.
Regression techniques used to predict sales prices. Bengaluru, India sees rising housing demand. Buyers consider
amenities, area, facilities when purchasing. Study develops model to forecast Bengaluru house prices per square foot
using machine learning algorithms. Based on dataset of 1298 locations with details like number of rooms, baths,
location features. Compares linear regression, lasso regression and decision tree models. Tunes data by handling
outliers, missing values. Multiple linear regression provides 85% accuracy in final model. Location data requires
preprocessing as it strongly influences prices. Model helps determine fair valuations across many neighborhoods.
Enables real-world usage via web interface that provides price estimates. Demonstrates feasibility of applying
machine learning to complex real estate market. Overall a breakthrough in bringing efficiency, transparency and
analytical rigor to property pricing. Has implications for home buyers, investors, developers by accounting for many
parameters.
✓ Methods (reproducible instructions to confirm or disprove a research hypothesis)
Data Set:
Utilized the "Bangalore_House_data prediction" dataset with 13320 rows and 9 columns. The target variable is "price," which is to
be predicted.
Data Preprocessing and Integration:
Cleaning: Ensured data quality by addressing missing values through mean or median replacement.
Refinement: Improved model efficiency by removing irrelevant columns, focusing on essential data.
Outlier Detection:
Identification: Identified outliers using statistical measures such as interquartile range and visualizations like boxplots.
Handling: Employed the removeOutliers function to enhance data accuracy by eliminating outliers.
Data Visualization:
Techniques: Utilized box plots for effective visualization, specifically focusing on trends in location area vs. prices.
Purpose: Enhanced interpretability and understanding of data patterns, aiding subsequent modeling.
Test Train Split:
Procedure: Applied the train_test_split() method, allocating 75% of data for training and 25% for testing.
Objective: Facilitated robust model evaluation by segregating data appropriately.
Machine Learning Models:
Linear Regression:
Modeling Approach: Developed a supervised machine learning model capturing a linear relationship
between dependent and independent variables.
Representation: Y=a0+a1X+εY=a0+a1X+ε.
Focus: Predicted house prices based on individual factors.
Multiple Linear Regression:
Approach: Explored relationships between house prices and multiple independent variables.
Utility: Identified and utilized various factors contributing to house price prediction.
Random Forest:
Advantages: Handled missing values efficiently, maintained accuracy, and addressed overfitting
concerns.
Implementation: Developed decision trees based on random data and variable selection.
Application: Particularly effective for predicting house prices in large datasets.
✓ Results (present only the most environmentally and industrially important results, make
sure to provide corresponding units)
After the preprocessing and visualization of our dataset, we realized that for a certain number of
attributes we could use a few models such as Multiple linear regression, Lasso Regression, Decision tree
etc. Further evaluating through GridSearchcv, we observed that multiple linear regression was the best
suitable model giving the best scores. Hence we were able to evaluate our model successfully by using
MSE, R square, RMSE as our evaluation metrics and obtain an accuracy of 85% and therefore predict the
price of various houses in Bangalore by taking in the final parameters as location, area in sq ft, bathroom
and BHK.
We also compared the three models that we have used and found linear regression to be the best
among them and we visualized the result in the form of a bar chart. We further tried to establish a
website which took in all the parameters such as Area(in sq ft), BHK, No. of bathrooms and the locality
and in turn give in the price prediction for the house using the multiple regression model we used which
gave us the best accuracy among the 3 models we choose.
A pickle model is exported from the notebook. The model is integrated into a simple and userfriendly website by
using the flask server and API requests received from the user are given a suitable HTML server side response from
the model imported here.
The working looks like this -
✓ Discussion (critically compare your results to existing literature and reveal the
mechanism that might caused the differences, identify the weakneses of your methods,
do not ignore (economic) reality, identify promising directions for futureresearch)
Our project findings align with Wang et al. (2021) and Varma et al. (2018), employing machine learning algorithms like
Linear Regression for accurate house price prediction. Similar to Varma et al. (2018), our study utilized machine
learning techniques achieving consumer satisfaction with accurate outputs. However, in contrast to Phan (2018), who
employed Random Forest algorithms, we found Multiple Linear Regression to be the optimal model among
alternatives. A limitation of our approach is the exclusive reliance on machine learning algorithms, potentially
overlooking crucial economic factors influencing house prices, as noted in the literature by Phan (2018). Future
research directions should integrate economic and real estate indicators into machine learning models, addressing
these limitations for a more comprehensive prediction approach. Moreover, incorporating advanced data visualization
techniques like Augmented Reality, proposed by Varma et al. (2018), could enhance user experience and decision-
making in real estate, offering avenues for further research. While our findings align with existing literature, we
acknowledge the need for future research integrating economic reality, advanced visualization, and a broader set of
influencing variables for more accurate and comprehensive house price prediction models. Such research would bridge
the gap between machine learning and economic reality, providing enhanced tools for real estate stakeholders.
✓ Conclusions (make sure you are building synthesis above your discussion and not
repeating your results, clearly indicate whether the research hypothesis tends to be
confirmed or not and whether the concept seems to be industrially promising
(economically sustainable))
The main goal of this project is to determine the house price prediction which we have successfully done using
different machine learning algorithms like a Linear Regression, Lasso and Decision Tree. It is quite evident from our
evaluation that the Linear Regression model has more accuracy inwhen compared to the others. Moreover, our project
provides a way to find the attributes contribution in prediction. Hence we could conclude that this project would be
helpful to a variety of people. The above models of prediction are very efficient from the point of view of linearly
dependent data. Thus we use the linear regression techniques. The Exploratory Data Analysis helps us to visualize the
data better and decide which regression technique must be deployed. We use the scatter plot to compare the
dependent variables and the bar plot to compare individual model accuracy which helps us best decide which model
should be used. Different accuracies might be possible for the same model when we are using the train_test_split with
different values for the test_size attribute. Currently we have used 90% data for train and 10% for test. The
GridSearchCV should be further calibrated such that is it capable of not only handling more parameters for a given
model but also handling more models at a time.

NIT2202 Group Assignment Requirements 2024-H2B3 - 241017 - 155239
No ratings yet
NIT2202 Group Assignment Requirements 2024-H2B3 - 241017 - 155239
5 pages
KIIT Deemed To Be University: A Project Report
No ratings yet
KIIT Deemed To Be University: A Project Report
33 pages
SSRN Id3565512
No ratings yet
SSRN Id3565512
5 pages
Advanced Regression Techniques Based Housing Price Prediction Model
No ratings yet
Advanced Regression Techniques Based Housing Price Prediction Model
11 pages
House_Price_Prediction_using_AI[1]
No ratings yet
House_Price_Prediction_using_AI[1]
12 pages
A14 Abstract
No ratings yet
A14 Abstract
2 pages
Final Defence
No ratings yet
Final Defence
55 pages
Real Estate Price Prediction
No ratings yet
Real Estate Price Prediction
7 pages
House Price Prediction using AI
No ratings yet
House Price Prediction using AI
14 pages
Comparative Study of House Price Prediction Using Machine Learning Research Paper
No ratings yet
Comparative Study of House Price Prediction Using Machine Learning Research Paper
14 pages
Real-Estate Property
No ratings yet
Real-Estate Property
11 pages
Abstract Machine Learning Has Been Instrumental Across Diver
No ratings yet
Abstract Machine Learning Has Been Instrumental Across Diver
6 pages
Real Estate Price Prediction Model
No ratings yet
Real Estate Price Prediction Model
3 pages
CSIC 6132 排版870 878
No ratings yet
CSIC 6132 排版870 878
9 pages
House Price Prediction - Research Paper FINAL DRAFT
100% (1)
House Price Prediction - Research Paper FINAL DRAFT
10 pages
Utkarsh Gupta - House Price Prediction
No ratings yet
Utkarsh Gupta - House Price Prediction
6 pages
intership report
No ratings yet
intership report
20 pages
Machine Learning Based Predicting House Prices Using Regression Techniques
No ratings yet
Machine Learning Based Predicting House Prices Using Regression Techniques
7 pages
Bangalore House Price Prediction Using The Best Machine Learning Model Submitted by Rukzana Vadakkekudy Rassak P2682221
No ratings yet
Bangalore House Price Prediction Using The Best Machine Learning Model Submitted by Rukzana Vadakkekudy Rassak P2682221
9 pages
Data Science Assignment Chapter 1
No ratings yet
Data Science Assignment Chapter 1
5 pages
HOUSE PRICE PREDICTION
No ratings yet
HOUSE PRICE PREDICTION
17 pages
Khare 2021 IOP Conf. Ser. Mater. Sci. Eng. 1099 012053
No ratings yet
Khare 2021 IOP Conf. Ser. Mater. Sci. Eng. 1099 012053
15 pages
Bangalore House Price Prediction
No ratings yet
Bangalore House Price Prediction
4 pages
Sample Synopsis
No ratings yet
Sample Synopsis
4 pages
Minor Project Report
No ratings yet
Minor Project Report
23 pages
Survey Paper Updated
No ratings yet
Survey Paper Updated
12 pages
SSRN Id4413863
No ratings yet
SSRN Id4413863
5 pages
House Price Prediction Report
No ratings yet
House Price Prediction Report
2 pages
Utkarsh Gupta G (73) (House Price Prediction)
No ratings yet
Utkarsh Gupta G (73) (House Price Prediction)
6 pages
Comprehensive Project
No ratings yet
Comprehensive Project
10 pages
ml project clg (2)
No ratings yet
ml project clg (2)
62 pages
Fyp Proposal
No ratings yet
Fyp Proposal
3 pages
House Price Prediction Using Machine Learning: © MAY 2021 - IRE Journals - Volume 4 Issue 11 - ISSN: 2456-8880
No ratings yet
House Price Prediction Using Machine Learning: © MAY 2021 - IRE Journals - Volume 4 Issue 11 - ISSN: 2456-8880
5 pages
Phase 5
No ratings yet
Phase 5
5 pages
Housepriceprediction ML 221104055342 Fb5109ae
No ratings yet
Housepriceprediction ML 221104055342 Fb5109ae
17 pages
Bangalore House Price Prediction
No ratings yet
Bangalore House Price Prediction
5 pages
House Prices
No ratings yet
House Prices
5 pages
Project1 Report1
No ratings yet
Project1 Report1
3 pages
Bangalore House Price Prediction
No ratings yet
Bangalore House Price Prediction
5 pages
BDA_REPORT
No ratings yet
BDA_REPORT
27 pages
Updated_House_Price_Prediction_Report
No ratings yet
Updated_House_Price_Prediction_Report
5 pages
Synopsis Format1.PDF
No ratings yet
Synopsis Format1.PDF
6 pages
House Price Prediction 3 47
No ratings yet
House Price Prediction 3 47
45 pages
Topic - Mini Research Project (CIA 4)
No ratings yet
Topic - Mini Research Project (CIA 4)
4 pages
R D National College Mumbai University: On "House Price Prediction System"
No ratings yet
R D National College Mumbai University: On "House Price Prediction System"
14 pages
House price predictor ppt Project
No ratings yet
House price predictor ppt Project
13 pages
UtkarshGupta (House Price Prediction)
No ratings yet
UtkarshGupta (House Price Prediction)
14 pages
MBB JETIR2204579
No ratings yet
MBB JETIR2204579
5 pages
House Price Prediction Using Machine Learning: Bachelor of Technology
No ratings yet
House Price Prediction Using Machine Learning: Bachelor of Technology
20 pages
Visvesvaraya Technological University Belagavi: House Price Prediction Using Machine Learning
No ratings yet
Visvesvaraya Technological University Belagavi: House Price Prediction Using Machine Learning
9 pages
Real_Estate_Price_Prediction_Using_a_Logistic_Regression_Model (1)
No ratings yet
Real_Estate_Price_Prediction_Using_a_Logistic_Regression_Model (1)
8 pages
Artificial Intelligence Approach For Modeling House Price Prediction
No ratings yet
Artificial Intelligence Approach For Modeling House Price Prediction
5 pages
Housepricepdf 2
No ratings yet
Housepricepdf 2
3 pages
Bi El
No ratings yet
Bi El
26 pages
Comparing Linear Regression and Decision Trees For Housing Price Prediction
No ratings yet
Comparing Linear Regression and Decision Trees For Housing Price Prediction
8 pages
IJIRCT2203007
No ratings yet
IJIRCT2203007
4 pages
Ijcse Icter P113
No ratings yet
Ijcse Icter P113
5 pages
Faisal Nadeem (SAP# 30601)
No ratings yet
Faisal Nadeem (SAP# 30601)
7 pages
Title Predicting House Pricing Using AIML (KASHISH)
No ratings yet
Title Predicting House Pricing Using AIML (KASHISH)
2 pages
ES205 Researchpaper
No ratings yet
ES205 Researchpaper
17 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Data-centric Living: Algorithms, Digitization and Regulation 1st Edition V. Sridhar - The latest ebook is available, download it today
100% (2)
Data-centric Living: Algorithms, Digitization and Regulation 1st Edition V. Sridhar - The latest ebook is available, download it today
74 pages
Final Project Report Crime Data 2
No ratings yet
Final Project Report Crime Data 2
38 pages
UP Police SI Syllabus in English PDF 2024
No ratings yet
UP Police SI Syllabus in English PDF 2024
3 pages
Non Tech Data Analytics Roadmap 1689017100
No ratings yet
Non Tech Data Analytics Roadmap 1689017100
10 pages
1675586852614
No ratings yet
1675586852614
24 pages
Graphs: Histogram, Pie Chart, Cubic Graph, Response Surface Plot, Counter Plot Graph
100% (1)
Graphs: Histogram, Pie Chart, Cubic Graph, Response Surface Plot, Counter Plot Graph
32 pages
Lunsford 3rd Grade Bar Graph Lesson Plan With Reflection
No ratings yet
Lunsford 3rd Grade Bar Graph Lesson Plan With Reflection
8 pages
Basic Data Storytelling Design Checklist TEMPLATE
No ratings yet
Basic Data Storytelling Design Checklist TEMPLATE
3 pages
neumayer-rossi-2016-15-years-of-protest-and-media-technologies-scholarship-a-sociotechnical-timeline
No ratings yet
neumayer-rossi-2016-15-years-of-protest-and-media-technologies-scholarship-a-sociotechnical-timeline
13 pages
23 PGProceedings
100% (2)
23 PGProceedings
493 pages
3rd Unit - DA
No ratings yet
3rd Unit - DA
20 pages
MSinDataScience 1667222803741
No ratings yet
MSinDataScience 1667222803741
47 pages
YashJakhar_report.docx
No ratings yet
YashJakhar_report.docx
20 pages
Mil 2ND Quarter Reviewer
No ratings yet
Mil 2ND Quarter Reviewer
17 pages
24CS3019-DATA ANALYTICS AND VISUALIZATION
No ratings yet
24CS3019-DATA ANALYTICS AND VISUALIZATION
2 pages
Unit2 PDS
No ratings yet
Unit2 PDS
17 pages
Pwc Power Bi
No ratings yet
Pwc Power Bi
18 pages
Kamioun
No ratings yet
Kamioun
24 pages
Artificial Intelligence IX Full Notes. new (Repaired)
No ratings yet
Artificial Intelligence IX Full Notes. new (Repaired)
48 pages
Business Requirements Document Template Eeee
No ratings yet
Business Requirements Document Template Eeee
11 pages
AI Project Cycle
No ratings yet
AI Project Cycle
25 pages
Codsoft Report
No ratings yet
Codsoft Report
26 pages
Email Dataset Analysis in Excel
No ratings yet
Email Dataset Analysis in Excel
4 pages
Module-1 MCQ of Data Analytics and Visualization
No ratings yet
Module-1 MCQ of Data Analytics and Visualization
6 pages
Projectip
No ratings yet
Projectip
14 pages
Week 2 Notes
No ratings yet
Week 2 Notes
11 pages
Storytelling_Structures_in_Data_Journali
No ratings yet
Storytelling_Structures_in_Data_Journali
6 pages
AR - New EA Playbook - 2020 Update
No ratings yet
AR - New EA Playbook - 2020 Update
19 pages
A-Z-Python using Gen AI 2025 Brochure_241226_000709
No ratings yet
A-Z-Python using Gen AI 2025 Brochure_241226_000709
9 pages

Oral Presentation

Uploaded by

Oral Presentation

Uploaded by

✓ House Price Prediction

✓ Aryan Rajesh, Yatheendra Reddy

✓ School of Computer Science and

You might also like