0% found this document useful (0 votes)

9 views

Lecture 3-4

- Ordinary Least Squares (OLS) is a method to estimate the parameters of a linear regression model by minimizing the sum of squared residuals. - It provides a closed-form solution for the parameters without needing an iterative process like gradient descent. - The OLS solution is obtained by taking the inverse of X'X multiplied by X'Y, where X is the feature matrix and Y is the target variable.

Uploaded by

Usama Mustafa

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Lecture 3-4

Uploaded by

Usama Mustafa

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 87

Python Tools for Machine Learning

Gradient descent in
practice I: Feature Scaling

Machine Learning
Feature Scaling
Idea: Make sure features are on a similar scale.
E.g. = size (0-2000 feet2) size (feet2)
= number of bedrooms (1-5)
number of bedrooms

Andrew Ng
Feature Scaling
Get every feature into approximately a range.

Andrew Ng
Mean normalization
Replace with to make features have approximately zero mean
(Do not apply to ).
E.g.

Andrew Ng
Andrew Ng
Andrew Ng
Gradient descent in
practice II: Learning rate

Machine Learning
Gradient descent

- “Debugging”: How to make sure gradient

descent is working correctly.
- How to choose learning rate .

Andrew Ng
Making sure gradient descent is working correctly.

Example automatic
convergence test:

Declare convergence if
decreases by less
than in one
0 100 200 300 400
iteration.
No. of iterations
Andrew Ng
Making sure gradient descent is working correctly.
Gradient descent not working.
Use smaller .

No. of iterations

No. of iterations No. of iterations

- For sufficiently small , should decrease on every iteration.

- But if is too small, gradient descent can be slow to converge.
Andrew Ng
Summary:
- If is too small: slow convergence.
- If is too large: may not decrease on
every iteration; may not converge.

To choose , try

Andrew Ng
Linear Regression with
multiple variables

Features and
polynomial regression
Machine Learning
Housing prices prediction

Andrew Ng
Polynomial regression

Price
(y)

Size (x)

Andrew Ng
Choice of features

Price
(y)

Size (x)

Andrew Ng
Alternative and efficient approach
for multiple features problems

NORMAL EQUATION

Machine Learning
Multiple features (variables).

Size (feet2) Price ($1000)

2104 460
1416 232
1534 315
852 178
… …

Andrew Ng
Multiple features
Approaches: 1) Gradient Descent
2) Normal equation

Revise basic concept of linear algebra

Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Multiple features (variables).
Size (feet2) Number of Number of Age of home Price ($1000)
bedrooms floors (years)

2104 5 1 45 460
1416 3 2 40 232
1534 3 2 30 315
852 2 1 36 178
… … … … …

Notation:
= number of features
= input (features) of training example.
= value of feature in training example.

Andrew Ng
Hypothesis:
Previously:

Andrew Ng
For convenience of notation, define .

Multivariate linear regression.

Andrew Ng
Example
7

2 7
Let Compute
Let Compute
Identity Matrix
Denoted (or ).
Examples of identity matrices:

2x2
3x3
4x4
For any matrix ,
Linear Algebra
review (optional)
Inverse and
transpose
Machine Learning
Not all numbers have an inverse.
Matrix inverse:
If A is an m x m matrix, and if it has an inverse,

Matrices that don’t have an inverse are “singular” or “degenerate”

Matrix Transpose
Example:

Let be an m x n matrix, and let

Then is an n x m matrix, and
Linear Algebra
review (optional)
Matrix multiplication
properties

Machine Learning
Let and be matrices. Then in general,
(not commutative.)

E.g.
Linear Regression with
multiple variables

Gradient descent for

multiple variables
Machine Learning
Hypothesis:
Parameters:
Cost function:

Gradient descent:
Repeat

(simultaneously update for every )

Andrew Ng
New algorithm :
Gradient Descent
Repeat
Previously (n=1):
Repeat
(simultaneously update for

(simultaneously update )

Andrew Ng
Linear Regression with
multiple variables

Normal equation

Machine Learning
Gradient Descent

Normal equation: Method to solve for

analytically.

Andrew Ng
Intuition: If 1D

(for every )

Solve for
Andrew Ng
Examples:
Size (feet2) Number of Number of Age of home Price ($1000)
bedrooms floors (years)

1 2104 5 1 45 460
1 1416 3 2 40 232
1 1534 3 2 30 315
1 852 2 1 36 178

Math behind this equation

Andrew Ng
Examples:
Size (feet2) Number of Number of Age of home Price ($1000)
bedrooms floors (years)

1 2104 5 1 45 460
1 1416 3 2 40 232
1 1534 3 2 30 315
1 852 2 1 36 178
1 3000 4 1 38 540

Andrew Ng
training examples, features.
Gradient Descent Normal Equation
• Need to choose . • No need to choose .
• Needs many iterations. • Don’t need to iterate.
• Works well even • Need to compute
when is large.
• Slow if is very large.

Andrew Ng
Linear Regression with
multiple variables

Normal equation
and non-invertibility
Machine Learning
is inverse of matrix .

Andrew Ng
What if is non-invertible?
• Redundant features (linearly dependent).
E.g. size in feet2
size in m2

• Too many features (e.g. ).

- Delete some features, or use regularization.

Andrew Ng
Normal equation

- What if is non-invertible? (singular/

degenerate)
- Octave: pinv(X’*X)*X’*y

Andrew Ng
Example of Normal Equation
Ordinary Least
Squares (OLS) is a
method used to
estimate the
parameter of a
linear regression
model.
Method to solve for
theta analytically.
This will end up
with closed form
solution
Ordinary Least Squares (OLS)

Lecture 3
No ratings yet
Lecture 3
32 pages
CENGR 3140:: Numerical Solutions To Ce Problems
No ratings yet
CENGR 3140:: Numerical Solutions To Ce Problems
9 pages
Functions of Matrices Theory and Computation TQW - Darksiderg PDF
100% (3)
Functions of Matrices Theory and Computation TQW - Darksiderg PDF
446 pages
VR10 Ug Ece
No ratings yet
VR10 Ug Ece
145 pages
Lecture3-Linear Regression With Multiple Variables
No ratings yet
Lecture3-Linear Regression With Multiple Variables
27 pages
Linear Regression With Multiple Variables
100% (1)
Linear Regression With Multiple Variables
38 pages
4.Multivariate Linear Regression-Shared
No ratings yet
4.Multivariate Linear Regression-Shared
41 pages
Multivariate Linear Regression-Shared
No ratings yet
Multivariate Linear Regression-Shared
41 pages
Linear Regression With Multiple Variables
No ratings yet
Linear Regression With Multiple Variables
20 pages
Linear Regression With Multiple Variables: Reading Material: Part 1 of Lecture Notes 1
No ratings yet
Linear Regression With Multiple Variables: Reading Material: Part 1 of Lecture Notes 1
24 pages
Lecture 2
No ratings yet
Lecture 2
71 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
49 pages
Lecture 2
No ratings yet
Lecture 2
62 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
49 pages
Regularization: The Problem of Overfitting
No ratings yet
Regularization: The Problem of Overfitting
23 pages
Regularization: The Problem of Overfitting
No ratings yet
Regularization: The Problem of Overfitting
23 pages
LinearRegression
No ratings yet
LinearRegression
64 pages
Gradient Descent - Linear Regression
100% (1)
Gradient Descent - Linear Regression
47 pages
Machine Learning Lecture
No ratings yet
Machine Learning Lecture
21 pages
ML 02 Linear Regression
No ratings yet
ML 02 Linear Regression
51 pages
Linear Regression
100% (1)
Linear Regression
51 pages
Regularization: The Problem of Overfitting
No ratings yet
Regularization: The Problem of Overfitting
24 pages
Regularization - AndrewNg
No ratings yet
Regularization - AndrewNg
24 pages
Lecture4 PDF
No ratings yet
Lecture4 PDF
31 pages
Lecture365
No ratings yet
Lecture365
28 pages
Lecture 4 - More On Linear Regression and Polynomial Regression
No ratings yet
Lecture 4 - More On Linear Regression and Polynomial Regression
26 pages
Lecture2
No ratings yet
Lecture2
49 pages
Mathematics Behind Machine Learning:: Linear Regression Model
No ratings yet
Mathematics Behind Machine Learning:: Linear Regression Model
21 pages
Machine Learning Coursera
100% (1)
Machine Learning Coursera
55 pages
AZ AI Lec 08 Machine Learing1
No ratings yet
AZ AI Lec 08 Machine Learing1
60 pages
Regression 2023
No ratings yet
Regression 2023
76 pages
Andrew NG
No ratings yet
Andrew NG
31 pages
Docs Slides Lecture6
No ratings yet
Docs Slides Lecture6
31 pages
Large Scale Machine Learning
No ratings yet
Large Scale Machine Learning
24 pages
1 Linear Regression With One Variable
No ratings yet
1 Linear Regression With One Variable
49 pages
Machine Learning - 5
No ratings yet
Machine Learning - 5
50 pages
ML03
No ratings yet
ML03
14 pages
Linear Regression: Jia-Bin Huang Virginia Tech
No ratings yet
Linear Regression: Jia-Bin Huang Virginia Tech
59 pages
Regulariza On: The Problem of Overfi6ng
No ratings yet
Regulariza On: The Problem of Overfi6ng
19 pages
Lecture 3
No ratings yet
Lecture 3
25 pages
Linear Algebra Review (Optional) : Matrices and Vectors
No ratings yet
Linear Algebra Review (Optional) : Matrices and Vectors
25 pages
Lecture 7
No ratings yet
Lecture 7
19 pages
Lecture 7
No ratings yet
Lecture 7
19 pages
Logistic Regression: Classification
No ratings yet
Logistic Regression: Classification
28 pages
Lecture2-Linear Regression With One Variable
No ratings yet
Lecture2-Linear Regression With One Variable
49 pages
Linear Regression With One Variable: Gradient Descent
No ratings yet
Linear Regression With One Variable: Gradient Descent
30 pages
CS 304.A Training Models
No ratings yet
CS 304.A Training Models
149 pages
3 - Linear Regression Multiple Variables
No ratings yet
3 - Linear Regression Multiple Variables
44 pages
Lec 24
No ratings yet
Lec 24
39 pages
ML 03 Logistic Regression
No ratings yet
ML 03 Logistic Regression
32 pages
Week-14 Lecture 28
No ratings yet
Week-14 Lecture 28
34 pages
Linear-Regression
No ratings yet
Linear-Regression
55 pages
MachineLearning PDF
No ratings yet
MachineLearning PDF
94 pages
DSA Week1
No ratings yet
DSA Week1
60 pages
Logistic Regression
No ratings yet
Logistic Regression
31 pages
3
No ratings yet
3
14 pages
Week 04
No ratings yet
Week 04
101 pages
Lecture - Activation Function
No ratings yet
Lecture - Activation Function
30 pages
lecture7-linear-regression
No ratings yet
lecture7-linear-regression
36 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet
Let's Practise: Maths Workbook Coursebook 7
From Everand
Let's Practise: Maths Workbook Coursebook 7
ExcelSoft Technologies Pvt. Ltd.
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
A4P-Matrices & Determinants
No ratings yet
A4P-Matrices & Determinants
2 pages
Coursre Outlines For BS Psychology 1st & 2nd 1.2.24
No ratings yet
Coursre Outlines For BS Psychology 1st & 2nd 1.2.24
37 pages
Unit - II: Simultaneous Equations (MCQ) : 1 A B C D C 2
No ratings yet
Unit - II: Simultaneous Equations (MCQ) : 1 A B C D C 2
6 pages
Math51 Linear - AlgebraLettersize Midterm
No ratings yet
Math51 Linear - AlgebraLettersize Midterm
3 pages
Isc Semester 1 Examination Specimen Question Paper Mathematics
No ratings yet
Isc Semester 1 Examination Specimen Question Paper Mathematics
10 pages
Fibonicci: Example
No ratings yet
Fibonicci: Example
7 pages
Matrices, Determinant and Inverse
100% (1)
Matrices, Determinant and Inverse
12 pages
Synthesis of A Quantum Tree Weyl Matrix
No ratings yet
Synthesis of A Quantum Tree Weyl Matrix
9 pages
Mathematical and Statistical Foundations
No ratings yet
Mathematical and Statistical Foundations
60 pages
Mathematic
No ratings yet
Mathematic
10 pages
Mentary
No ratings yet
Mentary
5 pages
Simulation of A CDMA Systems Using Linear Prediction and MUD (Matlab)
100% (2)
Simulation of A CDMA Systems Using Linear Prediction and MUD (Matlab)
44 pages
Vector and Matrix Norm
No ratings yet
Vector and Matrix Norm
17 pages
9th Math
No ratings yet
9th Math
2 pages
CEIC3000 Process Modelling and Analysis Week 1 Session 1, 2012
No ratings yet
CEIC3000 Process Modelling and Analysis Week 1 Session 1, 2012
17 pages
Math DETERMINANT
No ratings yet
Math DETERMINANT
15 pages
Sistem Persamaan Linear Dan Matriks: DR - Ir. M.Cahyono DR - Eng. Arno Adi Kuntoro M.Bagus Adityawan, Ph.D.
No ratings yet
Sistem Persamaan Linear Dan Matriks: DR - Ir. M.Cahyono DR - Eng. Arno Adi Kuntoro M.Bagus Adityawan, Ph.D.
20 pages
Matrix and Its Applications
No ratings yet
Matrix and Its Applications
50 pages
Applied Mathematics - 1 - 241221155013
No ratings yet
Applied Mathematics - 1 - 241221155013
70 pages
MT 1117: Linear Algebra For ICT: Instructor: A.V. Mathias Department of Mathematics & Statistics University of Dodoma
No ratings yet
MT 1117: Linear Algebra For ICT: Instructor: A.V. Mathias Department of Mathematics & Statistics University of Dodoma
25 pages
cs2 PDF
No ratings yet
cs2 PDF
84 pages
MATH 1003 hw 3
No ratings yet
MATH 1003 hw 3
4 pages
HW 2 Solutions
No ratings yet
HW 2 Solutions
3 pages
Canonical Form
No ratings yet
Canonical Form
21 pages
MA2401 Lecture Notes
No ratings yet
MA2401 Lecture Notes
270 pages
Matrix Norms: Tom Lyche
No ratings yet
Matrix Norms: Tom Lyche
45 pages
Using Matrices To Balance Chemical Reactions and Modeling The Imp
No ratings yet
Using Matrices To Balance Chemical Reactions and Modeling The Imp
22 pages