0% found this document useful (0 votes)
15 views13 pages

ES205 Presentation

The document discusses building a machine learning model to predict house prices. It covers gathering and preprocessing data, training models like linear regression, decision trees and evaluating them. The best performing models are then deployed as an API for users.

Uploaded by

Bushra Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views13 pages

ES205 Presentation

The document discusses building a machine learning model to predict house prices. It covers gathering and preprocessing data, training models like linear regression, decision trees and evaluating them. The best performing models are then deployed as an API for users.

Uploaded by

Bushra Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 13

House Price Insight: Application of

Linear Algebra in Machine Learning


by

Umar Mushtaq 2022602

Bushra Khan 2022139

Zainab Bilal 2022635

Syed Hanan Shabir 2022565


Building a Machine
Learning Model to Predict
House Prices
Welcome to our presentation on how to build a machine learning model that
predicts house prices. Join us on this journey as we introduce you to the
important steps required in creating a highly effective and accurate model.
Data Gathering
• There are different types of data needed for this model such as:
location, square footage, number of bedrooms, bathrooms, year built,
and more.
Data Preprocessing

Handling Missing Data Feature Scaling and Normalization


There were missing values in some of the We scaled the features to the same range to
features, so we used different techniques such as prevent the model from being biased and
interpolation and imputation to handle them. normalized the features to make it easier for the
model to learn the data.
Model Training
1 Selection of Machine Learning Algorithms
We explored different machine learning algorithms such as linear regression,
decision trees, and random forests to select the best model based on performance,
accuracy, and interpretability.

2 Splitting the Dataset into Training and Testing Sets


We randomly split the dataset into training and testing sets to train and evaluate
the model respectively.

3 Validation of Model
We performed cross-validation to ensure the model learned the features and
performed consistently on new data. We monitored the model for overfitting and
underfitting and made adjustments as necessary.
Model Evaluation
• To evaluate the model's performance, we used the root mean square
error (RMSE) and the R-squared (R2) metrics.
• We analyzed the model's accuracy and precision by using K-fold cross-
validation and RMSE.
Model Deployment

Steps to Deploy the Model


We used joblib library to save the trained model and exported it as an API that can be used in the user's
application.
Linear Regression: The Linear Regression
model was trained on the dataset, yielding
the following results on the training set:

On Dataset 1: On Dataset 2:
Mean Squared Error (MSE): 2479644574.8373423 Mean Squared Error (MSE): 23.38

Root Mean Squared Error (RMSE): Root Mean Squared Error (RMSE): 4.835
49796.02970957968
R2 score: 0.7339
Cross-validation was performed to assess the
Cross-validation was performed to assess the
model's performance across different subsets of
model's performance across different subsets of
the data. The average cross validated RMSE is
the data. The average cross validated RMSE is
49799.52283937057.
68.80
2) Decision Tree Regressor: The Decision
Tree Regressor was trained and evaluated
on the training set with the following
outcomes:
On Dataset 1: On Dataset 2:
Mean Squared Error (MSE): 2479644574.8373423 Mean Squared Error (MSE): 2479644574.8373423

Root Mean Squared Error (RMSE): Root Mean Squared Error (RMSE):
49796.02970957968 49796.02970957968

Cross-validation was performed to assess the Cross-validation was performed to assess the
model's performance across different subsets of model's performance across different subsets of
the data. The average cross validated RMSE is the data. The average cross validated RMSE is
49799.52283937057. 49799.52283937057.
3) Gradient Boosting Regressor and
Random Forest Regressor: Briefly
summarizing the performance of these
models:
Gradient Boosting Regressor: 2452926498.283182, 49527.027957300066

Random Forest Regressor: 398118763.1372196, 19952.91365032234


Results:

• Linear Regression Model

• Decision Tree Regressor Model

• Gradient Boost Model

• Random Forest Regressor Model


Summary
• Building a machine learning model that predicts house prices is a complex process that requires
careful attention to detail.
• Our model is highly effective and accurate, providing users with reliable predictions that can help
them make informed real estate decisions.
• The model can be integrated into a web application and is highly accessible to anyone looking to use it.
• We look forward to exploring new possibilities and developments in this field that improve the
accuracy and speed of house price prediction.
THANK YOU!

You might also like