0% found this document useful (0 votes)
4 views

De Assignment 3

The objective is to develop a machine learning model to predict house prices using features such as location and size. The process includes data preparation, outlier removal, and feature engineering, followed by training and evaluating models like Linear Regression, Decision Tree, and Random Forest. The final goal is to identify the best-performing model based on prediction accuracy and upload the cleaned dataset and code to the drive.

Uploaded by

Vicky
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

De Assignment 3

The objective is to develop a machine learning model to predict house prices using features such as location and size. The process includes data preparation, outlier removal, and feature engineering, followed by training and evaluating models like Linear Regression, Decision Tree, and Random Forest. The final goal is to identify the best-performing model based on prediction accuracy and upload the cleaned dataset and code to the drive.

Uploaded by

Vicky
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Objective:

To build a machine learning model that can accurately predict house prices based on various
property features such as location, size, number of bathrooms, and total square footage. You
are expected to perform end-to-end data preprocessing, outlier removal, feature engineering,
and train multiple machine learning algorithms. The goal is to identify the best-performing model
through evaluation and comparison of prediction accuracy.

Tasks to be done:

1.Data Understanding & Preparation


●​ Load the dataset and explore its structure.
●​ Convert data types where needed.

2. Transform Data (Data Cleaning)


Handling outliers
●​ Detect, visualise and remove the outliers,
●​ Eg: having more bathrooms than bedrooms.
●​ Eg. Extreme high price and lower price using outlier formula etc.,

Cleaning
●​ Fill rows with missing or null values in key features (e.g., bath, location, size).
●​ Remove unwanted columns for better results
●​ In the case of range values, take the average of it.
●​ Use one hot encoding if applicable.

3. Building and Training the model & evaluating the results


●​ Split the dataset into training and testing sets
●​ Train these three algorithms - Linear Regression, Decision Tree Regressor, Random
Forest Regressor.
●​ Evaluate the model using R-square score, MAE and RMSE
●​ Compare the results of three models and find which is accurate.
●​ Upload the cleaned dataset and python workbook in the drive.

Data Source Link: ASSIGNMENT 3

You might also like