0% found this document useful (0 votes)

25 views14 pages

Bias and Variance in Regression Models

The document discusses bias and variance in linear and logistic regression, highlighting the issues of overfitting and underfitting. It presents methods for model validation and selection, including holdout, repeated holdout, and k-fold cross-validation, as well as strategies to combat overfitting through feature reduction and regularization techniques like L1, L2, and Elastic Net. The importance of using separate training, validation, and test sets to ensure generalization is emphasized throughout the document.

Uploaded by

bharat.goel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views14 pages

Bias and Variance in Regression Models

Uploaded by

bharat.goel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Bias and Variance

Minati Rath
Example: Linear regression (housing prices)
Fitting a linear function
Fitting a quadratic function
Price

Fitting a higher order function

Size

Bias vs. variance in linear regression

Price

Size
Bias vs. variance in linear regression
Price

Size

High bias “Just right” High variance

(underfitting) (overfitting)

Overfitting
If we have too many features, the learned hypothesis may fit the
training set very well

but fail to generalize to new examples.

Bias vs. variance in logistic regression

Example: Logistic regression

Sources of noise and error
While learning a target function using a training set
Two sources of noise
Some training points may not come exactly from the target
function: stochastic noise
The target function may be too complex to capture using the
chosen hypothesis set: deterministic noise
Generalization error: Model tries to fit the noise in the training data,
which gets extrapolated to the test set

Ways to handle noise

Validation
Check performance on data other than training data, and tune model
accordingly
Regularization
Constraint the model so that the noise cannot be learnt too well
Validation
Divide given data into train set and test set
E.g., 80% train and 20% test
Better to select randomly
Learn parameters using training set
Check performance (validate the model) on test set, using
measures such as accuracy, misclassification rate, etc.
Trade-off: more data for training vs. validation
An example: model selection
• Which order polynomial will best fit a given data? Polynomials
available: h1, h2, …, h10
• As if an extra parameter - degree of the polynomial - is to be
learned
• Approach 1
– Divide into train and test set
– Train each hypothesis on train set, measure error on test set
– Select the hypothesis with minimum test set error
• Problem with the previous approach
– The test set error we computed is not a true estimate of
generalization error
– Since our extra parameter (order of polynomial) is fit to the test
set
An example: model selection

Approach 2
– Divide data into train set (60%), validation set
(20%) and test set (20%)
– Select that hypothesis which gives lowest error on
validation set
– Use test set to estimate generalization error

Note: Test set not at all seen during training

Popular methods of evaluating a classifier
• Holdout method
– Split data into train and test set (usually 2/3 for train and 1/3 for
test). Learn model using train set and measure performance
over test set
– Usually used when there is sufficiently large data, since both
train and test data will be a part
• Repeated Holdout method
– Repeat the Holdout method multiple times with different
subsets used for train/test
– In each iteration, a certain portion of data is randomly selected
for training, rest for testing
– The error rates on the different iterations are averaged to yield
an overall error rate
– More reliable than simple Holdout
Popular methods of evaluating a classifier
• k-fold cross-validation
– First step: data is split into k subsets of equal size;
– Second step: each subset in turn is used for testing and the
remainder for training
– Performance measures averaged over all folds

Popular choice for k: 10 or 5

Advantage: all available data points being used to train as well test
model
Classifier

k-fold cross validation (shown for k=3)

train train test

train test train

Data
test train train
Regularization
Addressing overfitting: Two ways
1. Reduce number of features
— Manually select which features to keep
— Problem: loss of some information (discarded features)
2. Regularization
— Keep all the features, but reduce magnitude/values of parameters
— Works well when we have a lot of features, each of which contributes a
bit to predicting

Intuition of regularization

Price
Price

Size of house
Size of house

Suppose we penalize and make really small

Combatting Overfitting
➢ Problem of overfitting can be overcome by increasing the input
training data points
➢ Number of input data points should be at least more than 10
times the number of parameters or features
➢ But what if we have less data points:
➢ Put a bound on regression coefficients by using regularization

Regularization for linear regression

In regularized linear regression, we choose to minimize

By convention, regularization is
not applied on θ0 (makes little
difference to the solution)
λ: Regularization parameter

Smaller values of parameters lead to more generalizable models,

less overfitting
L1, L2 and Elastic net Regularization
What we are discussing is called L2 regularization or “ridge”
regularization – it adds squared magnitude of parameters as penalty
term

Look up L1 or “Lasso” regularization

– adds absolute value of magnitude of parameters as penalty term

Elastic Net (Combination of L1 and L2 Regularization)

Effect: Combines the benefits of both Ridge and Lasso. It allows
for some coefficients to be set to zero (like Lasso) while shrinking
others (like Ridge). It is useful when there is multicollinearity, and
some feature selection is needed

Machine Learning Model Evaluation Techniques
No ratings yet
Machine Learning Model Evaluation Techniques
57 pages
Koustav Rudra: ML Evaluation Insights
No ratings yet
Koustav Rudra: ML Evaluation Insights
28 pages
Bias-Variance Tradeoff in ML Models
No ratings yet
Bias-Variance Tradeoff in ML Models
23 pages
Train-Test Split in Machine Learning
No ratings yet
Train-Test Split in Machine Learning
10 pages
Regression Techniques in Machine Learning
No ratings yet
Regression Techniques in Machine Learning
56 pages
Understanding Overfitting in Machine Learning
No ratings yet
Understanding Overfitting in Machine Learning
19 pages
K-Fold Cross-Validation Explained
No ratings yet
K-Fold Cross-Validation Explained
42 pages
Machine Learning Application Advice
No ratings yet
Machine Learning Application Advice
8 pages
Foundational Machine Learning Concepts
No ratings yet
Foundational Machine Learning Concepts
22 pages
Introduction to Machine Learning Concepts
100% (1)
Introduction to Machine Learning Concepts
116 pages
Understanding Bias, Variance, and Model Optimization
No ratings yet
Understanding Bias, Variance, and Model Optimization
19 pages
Generalisation
No ratings yet
Generalisation
42 pages
Unit-3 Supervised Learning
No ratings yet
Unit-3 Supervised Learning
30 pages
Understanding Bias-Variance Tradeoff in ML
No ratings yet
Understanding Bias-Variance Tradeoff in ML
24 pages
Machine Learning Types and Techniques
No ratings yet
Machine Learning Types and Techniques
31 pages
Classification Algorithms in Data Analytics
No ratings yet
Classification Algorithms in Data Analytics
40 pages
Machine Learning Model Evaluation Metrics
No ratings yet
Machine Learning Model Evaluation Metrics
23 pages
Intro to Supervised Machine Learning
No ratings yet
Intro to Supervised Machine Learning
31 pages
Regularization Techniques in Machine Learning
No ratings yet
Regularization Techniques in Machine Learning
46 pages
Model Selection and Evaluation in Learning
No ratings yet
Model Selection and Evaluation in Learning
47 pages
Stepwise Regression Techniques Explained
No ratings yet
Stepwise Regression Techniques Explained
76 pages
How to Train and Validate ML Models
No ratings yet
How to Train and Validate ML Models
40 pages
Cross-Validation in Model Evaluation
No ratings yet
Cross-Validation in Model Evaluation
63 pages
Understanding Regularization in Deep Learning
No ratings yet
Understanding Regularization in Deep Learning
37 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
Understanding Regularization in Machine Learning
No ratings yet
Understanding Regularization in Machine Learning
32 pages
Machine Learning: Bias, Variance, and Regularization
No ratings yet
Machine Learning: Bias, Variance, and Regularization
37 pages
ML Unit-4
No ratings yet
ML Unit-4
45 pages
Types of Regression in Machine Learning
No ratings yet
Types of Regression in Machine Learning
23 pages
Machine Learning Debugging Techniques
No ratings yet
Machine Learning Debugging Techniques
45 pages
Regularization and Multicollinearity Solutions
No ratings yet
Regularization and Multicollinearity Solutions
30 pages
Machine Learning with MATLAB Overview
No ratings yet
Machine Learning with MATLAB Overview
30 pages
Predictive Techniques in Machine Learning
No ratings yet
Predictive Techniques in Machine Learning
5 pages
Bias-Variance Tradeoff in Machine Learning
No ratings yet
Bias-Variance Tradeoff in Machine Learning
23 pages
E-Note 47072 Content Document 20251208100443AM
No ratings yet
E-Note 47072 Content Document 20251208100443AM
187 pages
Types of Machine Learning Explained
No ratings yet
Types of Machine Learning Explained
50 pages
Understanding Machine Learning Models
No ratings yet
Understanding Machine Learning Models
49 pages
Evaluating Regression Model Quality
No ratings yet
Evaluating Regression Model Quality
28 pages
Overfitting vs Underfitting in Regression
No ratings yet
Overfitting vs Underfitting in Regression
5 pages
Key Concepts in Regression Models
No ratings yet
Key Concepts in Regression Models
26 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
5 pages
Supervised Machine Learning: Regression Insights
No ratings yet
Supervised Machine Learning: Regression Insights
11 pages
ML Lecture 2 Models in ML
No ratings yet
ML Lecture 2 Models in ML
10 pages
DL Unit 1 Notes
No ratings yet
DL Unit 1 Notes
18 pages
Machine Learning Most Important Concepts
No ratings yet
Machine Learning Most Important Concepts
17 pages
Machine Learning Model Fundamentals
No ratings yet
Machine Learning Model Fundamentals
13 pages
Role of Regularization in ML Models
No ratings yet
Role of Regularization in ML Models
5 pages
Model Selection and Hyperparameter Tuning
No ratings yet
Model Selection and Hyperparameter Tuning
20 pages
Overfitting and Feature Engineering Guide
No ratings yet
Overfitting and Feature Engineering Guide
37 pages
Deep Learning Techniques Overview
No ratings yet
Deep Learning Techniques Overview
34 pages
Understanding Bias-Variance in ML Models
100% (1)
Understanding Bias-Variance in ML Models
65 pages
Deep Neural Network Regularization Techniques
No ratings yet
Deep Neural Network Regularization Techniques
53 pages
Linear Regression and Regularization Techniques
No ratings yet
Linear Regression and Regularization Techniques
48 pages
Understanding Bias and Variance in ML
No ratings yet
Understanding Bias and Variance in ML
46 pages
Bias-Variance Tradeoff in Ridge Regression
100% (2)
Bias-Variance Tradeoff in Ridge Regression
35 pages
Prediction Error in Model Selection
No ratings yet
Prediction Error in Model Selection
28 pages
Practical Aspects of Deep Learning
No ratings yet
Practical Aspects of Deep Learning
46 pages
Applying Machine Learning: Key Advice
No ratings yet
Applying Machine Learning: Key Advice
25 pages
ML Fundamentals: Bias, Variance, and Overfitting
No ratings yet
ML Fundamentals: Bias, Variance, and Overfitting
34 pages
Daily Meter Installation Progress Report
No ratings yet
Daily Meter Installation Progress Report
21 pages
Understanding Significant Figures
No ratings yet
Understanding Significant Figures
1 page
Smart IoT Road Divider Hardware Design
No ratings yet
Smart IoT Road Divider Hardware Design
12 pages
Tanishq Bhatia: Tech Skills & Achievements
No ratings yet
Tanishq Bhatia: Tech Skills & Achievements
1 page
Grid-Tied Inverter Synchronization Techniques
No ratings yet
Grid-Tied Inverter Synchronization Techniques
7 pages
ZBrush User Interface Overview
No ratings yet
ZBrush User Interface Overview
5 pages
Voltage and Fuse Setup Guide
No ratings yet
Voltage and Fuse Setup Guide
6 pages
PT2272-M4 Remote Control Decoder Datasheet
No ratings yet
PT2272-M4 Remote Control Decoder Datasheet
18 pages
Cambridge Final Exam Timetable 2018
0% (1)
Cambridge Final Exam Timetable 2018
7 pages
CSB PGT Computer Science Exam Guidelines
No ratings yet
CSB PGT Computer Science Exam Guidelines
11 pages
Salesforce User and Platform Basics
No ratings yet
Salesforce User and Platform Basics
2 pages
Autodesk 2014 Product Keys List
No ratings yet
Autodesk 2014 Product Keys List
7 pages
Affidavit of Identity - Philippines Consulate
No ratings yet
Affidavit of Identity - Philippines Consulate
1 page
Exponential Distribution Overview and Applications
No ratings yet
Exponential Distribution Overview and Applications
14 pages
RDBMS Key Questions and DBA Roles
No ratings yet
RDBMS Key Questions and DBA Roles
29 pages
Infosys NEAT Evaluation: Digital Workplace Services
No ratings yet
Infosys NEAT Evaluation: Digital Workplace Services
14 pages
Overview of Industrial Communication Networks
No ratings yet
Overview of Industrial Communication Networks
26 pages
Dr. T. Sheela's Research Profile
No ratings yet
Dr. T. Sheela's Research Profile
2 pages
Technology in ESL Writing Instruction
No ratings yet
Technology in ESL Writing Instruction
8 pages
Embedded Software Engineer Job at Ribbon
No ratings yet
Embedded Software Engineer Job at Ribbon
1 page
eBay Clone for SRM College Students
No ratings yet
eBay Clone for SRM College Students
7 pages
Understanding Distributed Systems Concepts
No ratings yet
Understanding Distributed Systems Concepts
75 pages
Computer Fundamentals MCQs with Answers
No ratings yet
Computer Fundamentals MCQs with Answers
4 pages
LoRa Transmitter and Receiver Code Guide
No ratings yet
LoRa Transmitter and Receiver Code Guide
22 pages
Acsuserguide (User Guide For The Cisco Secure Access Control System 5.1)
No ratings yet
Acsuserguide (User Guide For The Cisco Secure Access Control System 5.1)
624 pages
The Value of Information: Phil Kaminsky David Simchi-Levi Philip Kaminsky Edith Simchi-Levi
No ratings yet
The Value of Information: Phil Kaminsky David Simchi-Levi Philip Kaminsky Edith Simchi-Levi
28 pages
IC-2300H Marine HM-154 QG 2
No ratings yet
IC-2300H Marine HM-154 QG 2
2 pages
Pega CSSA v7.1 Exam Dumps Guide
No ratings yet
Pega CSSA v7.1 Exam Dumps Guide
17 pages
BR-Female Sex Toys.2025
No ratings yet
BR-Female Sex Toys.2025
20 pages
SAP Automatic Payment Program Setup Guide
No ratings yet
SAP Automatic Payment Program Setup Guide
3 pages

Bias and Variance in Regression Models

Uploaded by

Bias and Variance in Regression Models

Uploaded by

Bias and Variance

Fitting a higher order function

Bias vs. variance in linear regression

High bias “Just right” High variance

but fail to generalize to new examples.

Example: Logistic regression

Ways to handle noise

Note: Test set not at all seen during training

Popular choice for k: 10 or 5

k-fold cross validation (shown for k=3)

train test train

Suppose we penalize and make really small

Regularization for linear regression

Smaller values of parameters lead to more generalizable models,

Look up L1 or “Lasso” regularization

Elastic Net (Combination of L1 and L2 Regularization)

You might also like