0% found this document useful (0 votes)

0 views

ML_assignment_lab_7

The document outlines a multi-task learning project for predicting real estate prices in urban Indian markets, focusing on regression and classification tasks. It details the dataset, key features, mathematical formulations, optimization strategies, and the rationale for multi-task learning. The submission guidelines include a comprehensive report and Python code implementation with hyperparameter tuning.

Uploaded by

2022mcb1318

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

ML_assignment_lab_7

Uploaded by

2022mcb1318

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Multi-Task Learning for Real Estate

Prediction
Course: AI211/ CS503 Machine Learning
Instructor: Dr. Santosh Kumar Vipparthi
Due Date: 01-04-2025

1 Problem Scenario
You are a data scientist at BharatHomes, a real estate analytics firm focus-
ing on urban Indian markets. The dataset includes 79 features describing
residential properties across Mumbai, Delhi, Bangalore, and Chennai.

1.1 Key Features in the Indian Context

Make sure when you analyze the values for the further regularization and
optimization of your own choices. All the feature values must be in the
euclidean space to obtain your choices for the considered regularization and
optimization.
Feature Indian Contextualization
SalePrice Price in INR (lakhs/crores)
Neighborhood Mumbai: Bandra, Andheri; Delhi: Gurgaon, Noida
Condition1 Proximity to metro stations (e.g., Delhi Metro)
Utilities Water supply consistency (24/7 vs. tanker-dependent)
LandContour Flood-prone zones (e.g., Chennai’s coastal areas)
OverallQual Builder reputation (e.g., Tata, DLF)
GarageQual Security features (gated community vs. standalone)
PoolQC Presence of clubhouse amenities
Objective:
1. Regression: Predict SalePrice (INR).
2. Classification: Classify properties as ”Premium” (top 20% prices)
or ”Standard”.

1
2 Mathematical Formulation
Let the dataset be:
• Features: X ∈ Rn×79 (e.g., GrLivArea, OverallQual, Neighborhood)

• Targets:

– Regression: yr ∈ Rn (SalePrice)
– Classification: yc ∈ {0, 1}n (Premium and Standard)

2.1 Loss Functions

1. Regression: Mean Squared Error (MSE)
n
1 X (i) 2
Lr = yr − f (x(i) ) (1)
n i=1

2. Classification: Cross-Entropy
n
1 X (i)
Lc = − y log g(x(i) ) (2)
n i=1 c

3. Joint Loss
L(θ) = αLr + βLc + λ∥θ∥22 (3)

3 Optimization
3.1 Key Analysis
1. Convexity:
• Prove whether L(θ) is convex under linear models for both tasks.

• Discuss the implications of non-convexity in deep neural networks.

2. Hessian Conditioning:
• Let Hr = ∇2 Lr and Hc = ∇2 Lc . Derive the condition number of the
joint Hessian H = αHr + βHc .

• Show how κ(H) affects optimization convergence.

3. Regularization:

2
• Compare L1 (sparse feature selection) vs. L2 (smooth feature weight-
ing).

• Justify the choice of regularization strength λ using the bias-variance

trade-off.

4 Multi-Task Learning Model

1. Problem Formulation:

• Why is multi-task learning suitable for this problem?

• How does feature sharing between tasks improve model perfor-
mance?

2. Data Understanding:

• Compute the correlation between OverallQual and SalePrice in

Delhi.
• Handle missing values in YearBuilt using imputation techniques.

3. Model Selection:

• Justify using linear regression + logistic regression vs. decision

trees.
• Discuss the role of feature engineering in improving model accu-
racy.

5 Data and Statistical Understanding

• Predict the 25th, 50th, and 75th quantiles of SalePrice in Mumbai.

• Analyze the distribution of SalePrice (e.g., normality, skewness).

• Define the hypothesis space for both regression and classification tasks.

• Explain how the choice of hypothesis space affects model performance.

• Modify hyperparameters (α, β, λ) based on quantile analysis.

3
6 Submission Guidelines
1. Report (12 pages) should include:

• Proofs of convexity and Hessian bounds.

• Feature importance plots for OverallQual and YearBuilt.

• Regularization path analysis for λ.

• Quantile predictions and data distribution analysis.

• Hypothesis space definition and its impact on performance.

• Hyperparameter tuning results and justification.

2. Code: Python implementation with adaptive hyperparameter tuning

as mentioned with stepwise instructions in the sample report.

Predicting House Prices Using Machine Learning
No ratings yet
Predicting House Prices Using Machine Learning
6 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
Housepriceprediction ML 221104055342 Fb5109ae
No ratings yet
Housepriceprediction ML 221104055342 Fb5109ae
17 pages
Real Estate Price Prediction Model
No ratings yet
Real Estate Price Prediction Model
3 pages
Lec3 4 ML Project
No ratings yet
Lec3 4 ML Project
26 pages
Lab 1. Boston House
No ratings yet
Lab 1. Boston House
7 pages
1_Lab Manual (ML)
No ratings yet
1_Lab Manual (ML)
42 pages
ML Lecture # 04 Multiple Regression
No ratings yet
ML Lecture # 04 Multiple Regression
29 pages
6579871e0b6cbsemester Project
No ratings yet
6579871e0b6cbsemester Project
2 pages
ML-ASSN-1
No ratings yet
ML-ASSN-1
4 pages
Main
No ratings yet
Main
35 pages
House Price Prediction Using Linear Regression in ML
No ratings yet
House Price Prediction Using Linear Regression in ML
9 pages
Assignment 1
100% (1)
Assignment 1
3 pages
New Opendocument Text
No ratings yet
New Opendocument Text
7 pages
dl lab prog 2
No ratings yet
dl lab prog 2
2 pages
ml project clg (2)
No ratings yet
ml project clg (2)
62 pages
Assignment 1 - Regression
No ratings yet
Assignment 1 - Regression
1 page
MY PRO DAY 9 Copy
No ratings yet
MY PRO DAY 9 Copy
59 pages
Project
No ratings yet
Project
10 pages
ml project part a 1
No ratings yet
ml project part a 1
6 pages
day 5
No ratings yet
day 5
2 pages
Oral Presentation
No ratings yet
Oral Presentation
9 pages
Final Defence
No ratings yet
Final Defence
55 pages
Bi El
No ratings yet
Bi El
26 pages
A14 Abstract
No ratings yet
A14 Abstract
2 pages
Vertopal.com C1 W2 Lab02 Multiple Variable Soln
No ratings yet
Vertopal.com C1 W2 Lab02 Multiple Variable Soln
11 pages
House Price Prediction 1
No ratings yet
House Price Prediction 1
27 pages
Machine Learning For Data Science
No ratings yet
Machine Learning For Data Science
2 pages
Main
No ratings yet
Main
35 pages
Project - Synopsis - Format (1) (1) (1) Copy 2
No ratings yet
Project - Synopsis - Format (1) (1) (1) Copy 2
33 pages
UtkarshGupta (House Price Prediction)
No ratings yet
UtkarshGupta (House Price Prediction)
14 pages
FinalProject STAT4444
No ratings yet
FinalProject STAT4444
11 pages
Data Science Assignment Chapter 1
No ratings yet
Data Science Assignment Chapter 1
5 pages
AIML
No ratings yet
AIML
5 pages
Project Immo en
No ratings yet
Project Immo en
11 pages
Price Prediction
No ratings yet
Price Prediction
16 pages
NN - CCP
No ratings yet
NN - CCP
10 pages
Assignment 1 (1)
No ratings yet
Assignment 1 (1)
4 pages
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Regression: Introduction: Basic Idea: Use Data To Identify Among Variables and Use These Relationships To Make
No ratings yet
Regression: Introduction: Basic Idea: Use Data To Identify Among Variables and Use These Relationships To Make
23 pages
Shub Neet Dt
No ratings yet
Shub Neet Dt
12 pages
Prediction of House Rent Using Multiple Linear Regression
No ratings yet
Prediction of House Rent Using Multiple Linear Regression
20 pages
Real Estate Price Prediction
No ratings yet
Real Estate Price Prediction
7 pages
3
No ratings yet
3
14 pages
AIML Lab Assignment-1 Set 2
No ratings yet
AIML Lab Assignment-1 Set 2
2 pages
Vasanth Sample 2
No ratings yet
Vasanth Sample 2
30 pages
LR 1
No ratings yet
LR 1
35 pages
Real Estate Price Prediction With Regression and Classification
No ratings yet
Real Estate Price Prediction With Regression and Classification
5 pages
Final Lab Manual
No ratings yet
Final Lab Manual
34 pages
SiddharthShah 1032221195 DivC 50 DL LabAssignment2
No ratings yet
SiddharthShah 1032221195 DivC 50 DL LabAssignment2
7 pages
House
No ratings yet
House
58 pages
Final Page Setup 29 Gatya Done All
No ratings yet
Final Page Setup 29 Gatya Done All
45 pages
Machine Learning Project Presentation
No ratings yet
Machine Learning Project Presentation
14 pages
Yug Removed
No ratings yet
Yug Removed
29 pages
House Pricing
No ratings yet
House Pricing
15 pages
House Price Prediction Using Machine Learning: © MAY 2021 - IRE Journals - Volume 4 Issue 11 - ISSN: 2456-8880
No ratings yet
House Price Prediction Using Machine Learning: © MAY 2021 - IRE Journals - Volume 4 Issue 11 - ISSN: 2456-8880
5 pages
House price predictor ppt Project
No ratings yet
House price predictor ppt Project
13 pages
Phase 5
No ratings yet
Phase 5
5 pages
Linear and Nonlinear Programming Essentials
From Everand
Linear and Nonlinear Programming Essentials
Tanushri Kaniyar
No ratings yet
Complex Analysis: Advanced Concepts
From Everand
Complex Analysis: Advanced Concepts
Shashank Tiwari
No ratings yet
DTFT Properties
No ratings yet
DTFT Properties
39 pages
EDUREKHA Data Science and ML Internship Program V2 - Program Brochure
No ratings yet
EDUREKHA Data Science and ML Internship Program V2 - Program Brochure
60 pages
Mcgraw Hill/Irwin
No ratings yet
Mcgraw Hill/Irwin
12 pages
A Prlmal Algorithm For Interval Linear-Programming Problems
No ratings yet
A Prlmal Algorithm For Interval Linear-Programming Problems
14 pages
DDA & BRESENHAMS, Circle (Mid Point Approch)
No ratings yet
DDA & BRESENHAMS, Circle (Mid Point Approch)
36 pages
Novel Soliton Solutions of Two-Mode Sawada-Kotera Equation and Its Applications
No ratings yet
Novel Soliton Solutions of Two-Mode Sawada-Kotera Equation and Its Applications
14 pages
Path of Steepest Ascent, Descent
No ratings yet
Path of Steepest Ascent, Descent
7 pages
Binomial Trees: Practice Questions
100% (1)
Binomial Trees: Practice Questions
3 pages
Control Systems Two Marks Question
No ratings yet
Control Systems Two Marks Question
41 pages
Machine Learning Notes 1
No ratings yet
Machine Learning Notes 1
120 pages
Syllabus Sem-V
No ratings yet
Syllabus Sem-V
30 pages
Intro To Machine Learning Nanodegree Program Syllabus
No ratings yet
Intro To Machine Learning Nanodegree Program Syllabus
14 pages
Bit Cipher 1 Example of Bit Cipher 2 Practical Stream Cipher 3
No ratings yet
Bit Cipher 1 Example of Bit Cipher 2 Practical Stream Cipher 3
13 pages
NAME:K.Harshavardhan Reg no:11BEC1074
No ratings yet
NAME:K.Harshavardhan Reg no:11BEC1074
13 pages
YZV231E Hw2
No ratings yet
YZV231E Hw2
4 pages
Snake and Ladder Program
No ratings yet
Snake and Ladder Program
2 pages
Assosa University Adc Assignment
No ratings yet
Assosa University Adc Assignment
9 pages
3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training
No ratings yet
3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training
13 pages
Dipmaths 2018 Scheme 4 Sem Model Paper 2
No ratings yet
Dipmaths 2018 Scheme 4 Sem Model Paper 2
3 pages
CNS Unit 4
No ratings yet
CNS Unit 4
11 pages
Course Outline
No ratings yet
Course Outline
3 pages
Classical and Quantum Dynamics From Classical Paths to Path Integrals Fourth Edition Dittrich all chapter instant download
No ratings yet
Classical and Quantum Dynamics From Classical Paths to Path Integrals Fourth Edition Dittrich all chapter instant download
55 pages
Year Return On The Stock Apple Computers (%) Return Market Portfolio (%)
No ratings yet
Year Return On The Stock Apple Computers (%) Return Market Portfolio (%)
9 pages
(Partially Observable) Markov Decision Processes: Frederike Petzschner & Lionel Rigoux
No ratings yet
(Partially Observable) Markov Decision Processes: Frederike Petzschner & Lionel Rigoux
19 pages
Minimum Weight Triangulation
No ratings yet
Minimum Weight Triangulation
3 pages
Session 2
No ratings yet
Session 2
39 pages
VTU Provisional Results Sheet-3.pdf - 65
No ratings yet
VTU Provisional Results Sheet-3.pdf - 65
1 page
Chap 05 LP Models Graphical and Computer Methods Soan
No ratings yet
Chap 05 LP Models Graphical and Computer Methods Soan
50 pages
Application of Pythagorean Fuzzy Set in MCDM: Presented by Under The Guidance
No ratings yet
Application of Pythagorean Fuzzy Set in MCDM: Presented by Under The Guidance
20 pages
Solving The N-Queens Problem Using A Tuned Hybrid Imperialist Competitive Algorithm
No ratings yet
Solving The N-Queens Problem Using A Tuned Hybrid Imperialist Competitive Algorithm
11 pages

ML_assignment_lab_7

Uploaded by

ML_assignment_lab_7

Uploaded by

Multi-Task Learning for Real Estate

1.1 Key Features in the Indian Context

2.1 Loss Functions

• Discuss the implications of non-convexity in deep neural networks.

• Show how κ(H) affects optimization convergence.

• Justify the choice of regularization strength λ using the bias-variance

4 Multi-Task Learning Model

• Why is multi-task learning suitable for this problem?

• Compute the correlation between OverallQual and SalePrice in

• Justify using linear regression + logistic regression vs. decision

5 Data and Statistical Understanding

• Analyze the distribution of SalePrice (e.g., normality, skewness).

• Explain how the choice of hypothesis space affects model performance.

• Modify hyperparameters (α, β, λ) based on quantile analysis.

• Proofs of convexity and Hessian bounds.

• Feature importance plots for OverallQual and YearBuilt.

• Regularization path analysis for λ.

• Quantile predictions and data distribution analysis.

• Hypothesis space definition and its impact on performance.

• Hyperparameter tuning results and justification.

2. Code: Python implementation with adaptive hyperparameter tuning

You might also like