Supervised_Learning_Regression_Analysis (1)
Supervised_Learning_Regression_Analysis (1)
Analysis
Structured Explanation and
Presentation Pattern (SEPP)
Introduction to Supervised
Learning
• Supervised learning uses labeled data for
training.
• Regression is a supervised learning technique
for continuous variable prediction.
Linear Regression
• Predicting continuous values with a single
independent variable.
Linear Regression Formula
• Formula: Y = β0 + β1X + ε
• Y: Dependent variable (target).
• X: Independent variable (predictor).
• β0: Intercept, β1: Coefficient, ε: Error term.
Python Example: Linear Regression
• X = data[['Feature1', 'Feature2']]
y = data['Target']
model = LinearRegression()
model.fit(X, y)
Understanding Multicollinearity
• Multicollinearity occurs when predictors are
highly correlated.
Detecting Multicollinearity
• from statsmodels.stats.outliers_influence
import variance_inflation_factor
vif_data = pd.DataFrame()
vif_data['VIF'] =
[variance_inflation_factor(X.values, i) for i in
range(X.shape[1])]
vif_data['Feature'] = X.columns
Evaluating Regression Models
• Metrics to evaluate regression models include
R-squared, MSE, MAE, and RMSE.
Evaluation Metrics Formulas