Unit 5 (The Method of Least Squares)
Unit 5 (The Method of Least Squares)
Unit-5
(The Method of Least Squares)
Dr. Muhammad Majid Gulzar (CIE-KFUPM)
Contents (Unit-5):
1) Linear Regression (Sec 17.1)
Coefficients of the model are determined such that the error between model values and
the given date is minimum.
Value of the 𝑘𝑘 𝑡𝑡𝑡 point can be obtained from → 𝑓𝑓(𝑥𝑥𝑘𝑘 ) = 𝑦𝑦𝑘𝑘 + 𝑒𝑒𝑘𝑘
∑𝑛𝑛
𝑘𝑘=1 𝑓𝑓 𝑥𝑥𝑘𝑘 −𝑦𝑦𝑘𝑘
Average error → 𝐸𝐸𝑎𝑎 =
𝑛𝑛
x y 8
0 2
6
2 3.2
4
y
4 3.8
5 4.6 2
8 6.2
0
0 2 4 6 8 10
x
9 6.8
Let the no. of independent variables be two, then → y = 𝑎𝑎0 + 𝑎𝑎1 𝑥𝑥 + 𝑎𝑎2 𝑥𝑥 2
Let the no. of independent variables be two, then → y = 𝑎𝑎0 + 𝑎𝑎1 𝑥𝑥1 + 𝑎𝑎2 𝑥𝑥2
For 𝑚𝑚 independent variables → y = 𝑎𝑎0 + 𝑎𝑎1 𝑥𝑥1 + 𝑎𝑎2 𝑥𝑥2 + ⋯ + 𝑎𝑎𝑚𝑚 𝑥𝑥𝑚𝑚
Dr. Muhammad Majid Gulzar (CIE-KFUPM)
Quantification of Error in Regression:
y
x Main Objective
(Measured)
8
1 0.5
2 2.5 6
3 2
4
y
4 4
2
5 3.5
6 6 0
0 2 4 6 8 10
x
7 5.5
𝟐𝟐
Not 𝒓𝒓, go for 𝒓𝒓
Modeled
6
y
2
0
0 1 2 3 4 5 6 7 8 9
x
Residuals
1
0.5
-0.5
-1
0 1 2 3 4 5 6 7 8 9
𝜕𝜕𝑆𝑆𝑟𝑟 𝜕𝜕𝑆𝑆𝑟𝑟
= −2 �(𝑦𝑦𝑖𝑖 −𝑎𝑎0 − 𝑎𝑎1 𝑥𝑥𝑖𝑖 ) = 0 = −2 � 𝑥𝑥𝑖𝑖 (𝑦𝑦𝑖𝑖 −𝑎𝑎0 − 𝑎𝑎1 𝑥𝑥𝑖𝑖 ) = 0
𝜕𝜕𝑎𝑎𝑜𝑜 𝜕𝜕𝑎𝑎1
− � 𝑦𝑦𝑖𝑖 + � 𝑎𝑎0 + � 𝑎𝑎1 𝑥𝑥𝑖𝑖 = 0 − � 𝑥𝑥𝑖𝑖 𝑦𝑦𝑖𝑖 + � 𝑥𝑥𝑖𝑖 𝑎𝑎0 + � 𝑎𝑎1 𝑥𝑥𝑖𝑖 2 = 0
� 𝑎𝑎0 + � 𝑎𝑎1 𝑥𝑥𝑖𝑖 = � 𝑦𝑦𝑖𝑖 � 𝑥𝑥𝑖𝑖 𝑎𝑎0 + � 𝑎𝑎1 𝑥𝑥𝑖𝑖 2 = � 𝑥𝑥𝑖𝑖 𝑦𝑦𝑖𝑖
𝑛𝑛𝑎𝑎0 + 𝑎𝑎1 � 𝑥𝑥𝑖𝑖 = � 𝑦𝑦𝑖𝑖 𝑎𝑎0 � 𝑥𝑥𝑖𝑖 + 𝑎𝑎1 � 𝑥𝑥𝑖𝑖 2 = � 𝑥𝑥𝑖𝑖 𝑦𝑦𝑖𝑖
8
13 6 8.24 0.79 5.02
y
6
16 9 9.35 4.45 0.12
4
18 12 10.09 26.11 3.65
2
�=6.89
𝒚𝒚 𝑺𝑺𝒕𝒕 =48.87 𝑺𝑺𝒓𝒓 =12.12
0
5 10 15 20
Dr. Muhammad Majid Gulzar (CIE-KFUPM) x
Linear Regression (Class Activity):
No. x y x2 xy 𝑛𝑛 � 𝑥𝑥𝑖𝑖 � 𝑦𝑦𝑖𝑖
𝑎𝑎0
𝑎𝑎1 =
1 1 0.5 1 0.5 � 𝑥𝑥𝑖𝑖 � 𝑥𝑥𝑖𝑖 2 � 𝑥𝑥𝑖𝑖 𝑦𝑦𝑖𝑖
2 2 2.5 4 5.0
7 28 𝑎𝑎0 24
=
3 3 2.0 9 6.0 28 140 𝑎𝑎1 119.5
4 4 4.0 16 16.0 7 28
∆= = 980 − 784 = 196
28 140
5 5 3.5 25 17.5
24 28
6 6 6.0 36 36.0 3360 − 3346 14
𝑎𝑎0 = 119.5 140 = = = 0.071
∆ 196 196
7 7 5.5 49 38.5
7 28 24 140 119.5 7 24
𝑎𝑎1 = 28 119.5 = 836.5 − 672 = 164.5 = 0.839
∆ 196 196
𝑀𝑀𝑀𝑀𝑀𝑀 𝑆𝑆𝑟𝑟 = � 𝑒𝑒𝑖𝑖2 = �(𝑦𝑦𝑖𝑖,𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 −𝑦𝑦𝑖𝑖,𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 )2 = �(𝑦𝑦𝑖𝑖 −𝑎𝑎0 − 𝑎𝑎1 𝑥𝑥𝑖𝑖 − 𝑎𝑎2 𝑥𝑥𝑖𝑖 2 )2
𝑖𝑖=1 𝑖𝑖=1 𝑖𝑖=1
− � 𝑦𝑦𝑖𝑖 + � 𝑎𝑎0 + � 𝑎𝑎1 𝑥𝑥𝑖𝑖 + � 𝑎𝑎2 𝑥𝑥𝑖𝑖 2 = 0 − � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑖𝑖 + � 𝑎𝑎0 𝑥𝑥𝑖𝑖 + � 𝑎𝑎1 𝑥𝑥𝑖𝑖 2 + � 𝑎𝑎2 𝑥𝑥𝑖𝑖 3 = 0 − � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑖𝑖 2 + � 𝑎𝑎0 𝑥𝑥𝑖𝑖 2 + � 𝑎𝑎1 𝑥𝑥𝑖𝑖 3 + � 𝑎𝑎2 𝑥𝑥𝑖𝑖 4 = 0
𝑛𝑛𝑎𝑎0 + 𝑎𝑎1 � 𝑥𝑥𝑖𝑖 + 𝑎𝑎2 � 𝑥𝑥𝑖𝑖 2 = � 𝑦𝑦𝑖𝑖 𝑎𝑎0 � 𝑥𝑥𝑖𝑖 + 𝑎𝑎1 � 𝑥𝑥𝑖𝑖 2 + 𝑎𝑎2 � 𝑥𝑥𝑖𝑖 3 = � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑖𝑖 𝑎𝑎0 � 𝑥𝑥𝑖𝑖 2 + 𝑎𝑎1 � 𝑥𝑥𝑖𝑖 3 + 𝑎𝑎2 � 𝑥𝑥𝑖𝑖 4 = � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑖𝑖 2
𝑎𝑎0 � 𝑥𝑥𝑖𝑖 + 𝑎𝑎1 � 𝑥𝑥𝑖𝑖 2 + 𝑎𝑎2 � 𝑥𝑥𝑖𝑖 3 + ⋯ + 𝑎𝑎𝑚𝑚 � 𝑥𝑥𝑖𝑖 𝑚𝑚+1 = � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑖𝑖 � 𝑥𝑥𝑖𝑖 � 𝑥𝑥𝑖𝑖 2 � 𝑥𝑥𝑖𝑖 3 … � 𝑥𝑥𝑖𝑖 𝑚𝑚+1 𝑎𝑎0 � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑖𝑖
𝑎𝑎1
→ 𝑎𝑎2 =
𝑎𝑎0 � 𝑥𝑥𝑖𝑖 2 + 𝑎𝑎1 � 𝑥𝑥𝑖𝑖 3 + 𝑎𝑎2 � 𝑥𝑥𝑖𝑖 4 + ⋯ + 𝑎𝑎𝑚𝑚 � 𝑥𝑥𝑖𝑖 𝑚𝑚+2 = � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑖𝑖 2 � 𝑥𝑥𝑖𝑖 2 � 𝑥𝑥𝑖𝑖 3 � 𝑥𝑥𝑖𝑖 4 … � 𝑥𝑥𝑖𝑖 𝑚𝑚+2 ⋮ � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑖𝑖 2
𝑎𝑎𝑚𝑚
⋮ ⋮ ⋮ ⋮ ⋮ ⋮ ⋮
𝑎𝑎0 � 𝑥𝑥𝑖𝑖 𝑚𝑚 + 𝑎𝑎1 � 𝑥𝑥𝑖𝑖 𝑚𝑚+1 + 𝑎𝑎2 � 𝑥𝑥𝑖𝑖 𝑚𝑚+2 + ⋯ + 𝑎𝑎𝑚𝑚 � 𝑥𝑥𝑖𝑖 2𝑚𝑚 = � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑖𝑖 𝑚𝑚 � 𝑥𝑥𝑖𝑖 𝑚𝑚 � 𝑥𝑥𝑖𝑖 𝑚𝑚+1 � 𝑥𝑥𝑖𝑖 𝑚𝑚+2 … � 𝑥𝑥𝑖𝑖 2𝑚𝑚 � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑖𝑖 𝑚𝑚
Dr. Muhammad Majid Gulzar (CIE-KFUPM) y = 𝑎𝑎0 + 𝑎𝑎1 𝑥𝑥 + 𝑎𝑎2 𝑥𝑥 2 = 2.48 + 2.36𝑥𝑥 + 1.86𝑥𝑥 2
Polynomial Regression (Example):
𝑛𝑛 𝑛𝑛
x1 y ymodel (y-ymean)2 (y-ymodel)2 � 2 = 2513.39,
𝑆𝑆𝑡𝑡 = �( 𝑦𝑦𝑖𝑖 − 𝑦𝑦) 𝑆𝑆𝑟𝑟 = �( 𝑦𝑦𝑖𝑖 − 𝑦𝑦𝑖𝑖,𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 )2 = 3.74
𝑖𝑖=1 𝑖𝑖=1
0 2.1 2.48 544.29 0.14
𝑆𝑆𝑡𝑡 − 𝑆𝑆𝑟𝑟 2513.39 − 3.74
𝑟𝑟 2 = = = 0.99
1 7.7 6.7 314.35 1 𝑆𝑆𝑡𝑡 2513.39
y
30
�=25.43
𝒚𝒚 𝑺𝑺𝒕𝒕 =2513.39 𝑺𝑺𝒓𝒓 =3.74
20
10
Measured
Modeled
0
0 1 2 3 4 5 6
Dr. Muhammad Majid Gulzar (CIE-KFUPM) x
Polynomial Regression (Example):
𝑛𝑛 𝑛𝑛
x1 y ymodel (y-ymean)2 (y-ymodel)2 � 2 = 2513.39,
𝑆𝑆𝑡𝑡 = �( 𝑦𝑦𝑖𝑖 − 𝑦𝑦) 𝑆𝑆𝑟𝑟 = �( 𝑦𝑦𝑖𝑖 − 𝑦𝑦𝑖𝑖,𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 )2 = 3.74
𝑖𝑖=1 𝑖𝑖=1
0 2.1 2.48 544.29 0.14
Standard error of the Estimate: (𝑚𝑚 = 2)
1 7.7 6.7 314.35 1
2 13.6 14.64 139.95 1.08 𝑆𝑆𝑟𝑟 3.74
𝑆𝑆𝑦𝑦⁄𝑥𝑥 = = = 1.12
𝑛𝑛 − (𝑚𝑚 + 1) 6−3
3 27.2 26.30 3.13 0.81
4 40.9 41.68 239.32 0.61 Correlation Coefficient:
𝑀𝑀𝑀𝑀𝑀𝑀 𝑆𝑆𝑟𝑟 = � 𝑒𝑒𝑖𝑖2 = �(𝑦𝑦𝑖𝑖,𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 −𝑦𝑦𝑖𝑖,𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 )2 = �(𝑦𝑦𝑖𝑖 −𝑎𝑎0 − 𝑎𝑎1 𝑥𝑥1𝑖𝑖 − 𝑎𝑎2 𝑥𝑥2𝑖𝑖 )2
𝑖𝑖=1 𝑖𝑖=1 𝑖𝑖=1
− � 𝑦𝑦𝑖𝑖 + � 𝑎𝑎0 + � 𝑎𝑎1 𝑥𝑥1𝑖𝑖 + � 𝑎𝑎2 𝑥𝑥2𝑖𝑖 = 0 − � 𝑦𝑦𝑖𝑖 𝑥𝑥1𝑖𝑖 + � 𝑎𝑎0 𝑥𝑥1𝑖𝑖 + � 𝑎𝑎1 𝑥𝑥1𝑖𝑖 2 + � 𝑎𝑎2 𝑥𝑥1𝑖𝑖 𝑥𝑥2𝑖𝑖 = 0 − � 𝑦𝑦𝑖𝑖 𝑥𝑥2𝑖𝑖 + � 𝑎𝑎0 𝑥𝑥2𝑖𝑖 + � 𝑎𝑎1 𝑥𝑥2𝑖𝑖 𝑥𝑥1𝑖𝑖 + � 𝑎𝑎2 𝑥𝑥2𝑖𝑖 2 = 0
𝑛𝑛𝑎𝑎0 + 𝑎𝑎1 � 𝑥𝑥1𝑖𝑖 + 𝑎𝑎2 � 𝑥𝑥2𝑖𝑖 = � 𝑦𝑦𝑖𝑖 𝑎𝑎0 � 𝑥𝑥1𝑖𝑖 + 𝑎𝑎1 � 𝑥𝑥1𝑖𝑖 2 + 𝑎𝑎2 � 𝑥𝑥1𝑖𝑖 𝑥𝑥2𝑖𝑖 = � 𝑦𝑦𝑖𝑖 𝑥𝑥1𝑖𝑖 𝑎𝑎0 � 𝑥𝑥2𝑖𝑖 + 𝑎𝑎1 � 𝑥𝑥2𝑖𝑖 𝑥𝑥1𝑖𝑖 + 𝑎𝑎2 � 𝑥𝑥2𝑖𝑖 2 = � 𝑦𝑦𝑖𝑖 𝑥𝑥2𝑖𝑖
𝑎𝑎0 � 𝑥𝑥1𝑖𝑖 + 𝑎𝑎1 � 𝑥𝑥1𝑖𝑖 2 + 𝑎𝑎2 � 𝑥𝑥1𝑖𝑖 𝑥𝑥2𝑖𝑖 + ⋯ + 𝑎𝑎𝑚𝑚 � 𝑥𝑥1𝑖𝑖 𝑥𝑥𝑚𝑚𝑚𝑚 = � 𝑦𝑦𝑖𝑖 𝑥𝑥1𝑖𝑖 � 𝑥𝑥1𝑖𝑖 � 𝑥𝑥1𝑖𝑖 2 � 𝑥𝑥1𝑖𝑖 𝑥𝑥2𝑖𝑖 … � 𝑥𝑥1𝑖𝑖 𝑥𝑥𝑚𝑚𝑚𝑚 𝑎𝑎0 � 𝑦𝑦𝑖𝑖 𝑥𝑥1𝑖𝑖
𝑎𝑎1
→ 𝑎𝑎2 =
𝑎𝑎0 � 𝑥𝑥2𝑖𝑖 + 𝑎𝑎1 � 𝑥𝑥2𝑖𝑖 𝑥𝑥1𝑖𝑖 + 𝑎𝑎2 � 𝑥𝑥2𝑖𝑖 2 + ⋯ + 𝑎𝑎𝑚𝑚 � 𝑥𝑥2𝑖𝑖 𝑥𝑥𝑚𝑚𝑚𝑚 = � 𝑦𝑦𝑖𝑖 𝑥𝑥2𝑖𝑖 � 𝑥𝑥2𝑖𝑖 � 𝑥𝑥2𝑖𝑖 𝑥𝑥1𝑖𝑖 � 𝑥𝑥2𝑖𝑖 2 … � 𝑥𝑥2𝑖𝑖 𝑥𝑥𝑚𝑚𝑚𝑚 ⋮ � 𝑦𝑦𝑖𝑖 𝑥𝑥2𝑖𝑖
𝑎𝑎𝑚𝑚
⋮ ⋮ ⋮ ⋮ ⋮ ⋮ ⋮
𝑎𝑎0 � 𝑥𝑥𝑚𝑚𝑖𝑖 + 𝑎𝑎1 � 𝑥𝑥1𝑖𝑖 𝑥𝑥𝑚𝑚𝑚𝑚 + 𝑎𝑎2 � 𝑥𝑥2𝑖𝑖 𝑥𝑥𝑚𝑚𝑚𝑚 + ⋯ + 𝑎𝑎𝑚𝑚 � 𝑥𝑥𝑚𝑚𝑚𝑚 2 = � 𝑦𝑦𝑖𝑖 𝑥𝑥2𝑖𝑖 � 𝑥𝑥𝑚𝑚𝑚𝑚 � 𝑥𝑥1𝑖𝑖 𝑥𝑥𝑚𝑚𝑚𝑚 � 𝑥𝑥2𝑖𝑖 𝑥𝑥𝑚𝑚𝑚𝑚 … � 𝑥𝑥𝑚𝑚𝑚𝑚 2 � 𝑦𝑦𝑖𝑖 𝑥𝑥𝑚𝑚𝑖𝑖
Dr. Muhammad Majid Gulzar (CIE-KFUPM) y = 𝑎𝑎0 + 𝑎𝑎1 𝑥𝑥1 + 𝑎𝑎2 𝑥𝑥2 = 14.02 − 6.44𝑥𝑥1 + 2.53𝑥𝑥2
Multiple Linear Regression (Example):
𝑛𝑛 𝑛𝑛
x1 x2 y ymodel (y-ymean)2 (y-ymodel)2 � 2 = 157.34,
𝑆𝑆𝑡𝑡 = �( 𝑦𝑦𝑖𝑖 − 𝑦𝑦) 𝑆𝑆𝑟𝑟 = �( 𝑦𝑦𝑖𝑖 − 𝑦𝑦𝑖𝑖,𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 )2 = 8.29
𝑖𝑖=1 𝑖𝑖=1
0 0 14 14.02 11.09 0.00
𝑆𝑆𝑡𝑡 − 𝑆𝑆𝑟𝑟 157.34 − 8.29
𝑟𝑟 2 = = = 0.95
0 2 21 19.08 13.47 3.69 𝑆𝑆𝑡𝑡 157.34
Dr. Muhammad Majid Gulzar (CIE-KFUPM) y = 𝑎𝑎0 + 𝑎𝑎1 𝑥𝑥1 + 𝑎𝑎2 𝑥𝑥2 = 5 + 4𝑥𝑥1 − 3𝑥𝑥2
Multiple Linear Regression (Class Activity):
𝑛𝑛 𝑛𝑛
x1 x2 y ymodel (y-ymean)2 (y-ymodel)2 � 2 = 458,
𝑆𝑆𝑡𝑡 = �( 𝑦𝑦𝑖𝑖 − 𝑦𝑦) 𝑆𝑆𝑟𝑟 = �( 𝑦𝑦𝑖𝑖 − 𝑦𝑦𝑖𝑖,𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 )2 = 0
𝑖𝑖=1 𝑖𝑖=1
0 0 5 5 16 0
Standard error of the Estimate: (𝑚𝑚 = 2)
2 1 10 10 1 0
2.5 2 9 9 0 0 𝑆𝑆𝑟𝑟 0
𝑆𝑆𝑦𝑦⁄𝑥𝑥 = = =0
𝑛𝑛 − (𝑚𝑚 + 1) 6−3
1 3 0 0 81 0
4 6 3 3 36 0 Correlation Coefficient: