Data_Science_Project_Report
Data_Science_Project_Report
1. Introduction
This project focuses on predicting house prices using Machine Learning algorithms. The real estate
market has various factors that influence property prices. By leveraging historical data, we can build
a model that predicts prices accurately based on features like location, size, number of rooms, and
more.
2. Objective
The main objective of this project is to develop a predictive model for house prices using machine
learning techniques to assist buyers, sellers, and investors in making informed decisions.
- Python
- Pandas, NumPy
- Scikit-learn
- Matplotlib, Seaborn
Major Project Report
- Jupyter Notebook
4. Dataset Description
The dataset used is the 'House Prices: Advanced Regression Techniques' from Kaggle. It includes
79 explanatory variables describing (almost) every aspect of residential homes in Ames, Iowa.
5. Methodology
6. Hyperparameter Tuning
After evaluating different models, XGBoost performed the best with an RMSE score of 0.12
(normalized). The model was able to predict house prices with high accuracy.
7. Conclusion
Major Project Report
Machine learning provides a powerful approach to predicting house prices. This project
demonstrated how various algorithms can be applied to real-world datasets to yield actionable
insights.
8. References
- Scikit-learn Documentation
- Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow by Aurélien Géron