ML_assignment_lab_7
ML_assignment_lab_7
Prediction
Course: AI211/ CS503 Machine Learning
Instructor: Dr. Santosh Kumar Vipparthi
Due Date: 01-04-2025
1 Problem Scenario
You are a data scientist at BharatHomes, a real estate analytics firm focus-
ing on urban Indian markets. The dataset includes 79 features describing
residential properties across Mumbai, Delhi, Bangalore, and Chennai.
1
2 Mathematical Formulation
Let the dataset be:
• Features: X ∈ Rn×79 (e.g., GrLivArea, OverallQual, Neighborhood)
• Targets:
– Regression: yr ∈ Rn (SalePrice)
– Classification: yc ∈ {0, 1}n (Premium and Standard)
2. Classification: Cross-Entropy
n
1 X (i)
Lc = − y log g(x(i) ) (2)
n i=1 c
3. Joint Loss
L(θ) = αLr + βLc + λ∥θ∥22 (3)
3 Optimization
3.1 Key Analysis
1. Convexity:
• Prove whether L(θ) is convex under linear models for both tasks.
2
• Compare L1 (sparse feature selection) vs. L2 (smooth feature weight-
ing).
2. Data Understanding:
3. Model Selection:
• Define the hypothesis space for both regression and classification tasks.
3
6 Submission Guidelines
1. Report (12 pages) should include: