Lecture BDS 5 23 24 Print

The document discusses the LARS algorithm and its relationship to least angle regression. It provides characterization of the LASSO estimator and proves that LASSO solutions are piecewise linear functions of the regularization parameter. It also discusses using cross-validation to select the regularization parameter in a data-dependent way to obtain estimators with low prediction error.

Uploaded by

Victor Van der Wel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views9 pages

Lecture BDS 5 23 24 Print

Uploaded by

Victor Van der Wel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Big Data Statistics, meeting 5: When d

is bigger than n, part 2

21 February 2024
LARS & friends
■ The LARS algorithm is very closely related to a variable selection technique called
least angle regression (if you are interested in this technique it was introduced in
Efron et al. (2004), see the references). Sometimes it is abbreviated just as LAR.
■ Recall that for the linear model we had the following characterization of the
LASSO estimator: A necessary and sufficient condition for β̂ to be a solution of
((3), lecture 4, slide 20) is

Gj (β̂) = −sign(β̂j )λ if β̂j 6= 0;

|Gj (β̂)| ≤ λ if β̂j = 0;

where Gj (β̂) is the jth component of G(β). Furthermore, if for a solution β̂ we

have |Gj (β̂)| < λ and hence β̂j = 0, then for any other solution β̃ of (3) in lecture
4, we have β̃j = 0.
■ Note that it characterizes the LASSO estimator for λ fixed.

7
LARS & friends (cont’d)
Based on this characterization one can prove the following result
Theorem (LASSO is piecewise linear for linear model) As a function of λ we have for
the solution β̂(λ) of the minimization problem labeled (3) on slide 20 of lecture 4 the
following:
(i) There exists a real number λmax such β̂(λ) = 0 for λ ≥ λmax (cf. Exercise 13).
(ii) There are real numbers 0 = λ0 < λ1 < . . . < λm = λmax and vectors γ i ∈ Rd ,
i = 0, . . . , m − 1, such

β̂(λ) = β̂(λk ) + (λ − λk )γ k , λ ∈ [λk , λk+1 ), k = 0, . . . , m − 1.

Some intuition: The characterization result on the previous slide gives conditions
on the gradient of the criterion function. For the squared loss the gradient is linear
in β for any λ. Thus, it is not entirely unexpected that the solution as a function of
λ is piecewise linear.

8
Density estimation (cont’d)
■ One can argue that (??) is
Z ∞ Z ∞
E (fˆh (x))2 dx − 2E fˆh (x)f (x) dx (3)
−∞ −∞

plus a constant not depending on fˆh .

■ Thanks to our leave-one out procedure we have not only one density estimator but
n.
■ Those can be used to estimate (??) by
n Z ∞ n
1 2
(fˆhj (x))2 dx − fˆhj (Xj ).
X X
CVh :=
n −∞ n
j=1 j=1

■ Note that CVh depends only on h and the data.

22
Density estimation (cont’d)
■ Because CVh is a proxi for the quantity we want to minimize as function of h, we
simply choose h as the value for which our CVh is minimal; mathematically we
can denote this by
ĥoptimal = arg min CVh ,
h

where h ranges over [c1 n−1/5 , c2 n−1/5 ] with 0 < c1 < c2 and the exponent of h
comes from the fact that we already concluded above that the optimal h ∼ n−1/5 .
■ Important message of ĥoptimal = arg minh CVh : We turned the somehow
arbitrary h (within the above range) into a data dependent or data based choice.

23
Cross validation LASSO (cont’d)
■ From the last bullet on the previous slide it is clear that the prediction error is also
a function of β̂(λ). We write this as
d
β̂j (λ) xfj uture .
X
λ → Y f uture −
j=1

■ Typically, we want the squared prediction error to be small on average, i.e. we

would like to find λ such that
 2 
d
β̂j (λ) xfj uture  
X
E Y f uture −
j=1

is small.
■ Recalling what we did above we need an estimator for this unknown expectation
for EVERY λ to find the one for which it is minimal.

26
Cross validation LASSO (cont’d)
■ How did we get an estimator for the criterion function above? We split the sample
into n groups

Group 1 : x2 , x3 , . . . , xn
Group 2 : x1 , x3 , . . . , xn
...
Group n : x1 , x2 , x3 , . . . , xn−1

and calculated an estimator for f based on group 1, group 2, . . ., group n.

■ Here we will follow a similar strategy. We divide the sample
(Y1 , x1 ), . . . , (Yn , xn ) into K > 1 groups. One could take, for instance, K = 10.
The groups are then (for ease of notation we assume that n is divisible by K).

Group 1 : (Y1 , x1 ), . . . , (Y Kn , x Kn ); Group 2 : (Y Kn +1 , x Kn +1 ), . . . , (Y 2n , x 2n ); . . .

K K

27
Cross validation LASSO (cont’d)
■ Now to estimate the expectation we first leave out the first group and estimate β
1
based on group 2,..., group K. Denote this leave group one out estimate by β̂ (λ).
We then estimate the average squared prediction error by
n  2
K d
K X X
L1 (λ) = Yi − β̂j1 (λ)xij  .
n
i=1 j=1

■ Next we leave out group 2 when estimating β. The estimate based on group 1,
2
group 3,. . . , group K is denoted by β̂ (λ). We then estimate the average squared
prediction error by
2n  2
K d
K X X
L2 (λ) = Yi − β̂j2 (λ)xij  .
n n
i= K +1 j=1

■ This process is performed K times giving us K estimates for the squared error
prediction loss.

28
Cross validation LASSO (cont’d)
■ Finally we average L1 (λ), . . . , LK (λ) and get
K
1 X i
L̄(λ) = L (λ).
K
i=1

■ In words our L̄(λ) gives us an estimate for the expected squared error prediction
loss if we use λ for the LASSO estimate of β.
■ Given an estimate for L̄(λ) for every λ or at least for a grid of λs we would choose
λ (recalling what we did above for density estimation) as the value for which L̄(λ)
is minimal; mathematically we write this as

λ̂optimal = arg min L̄(λ).

■ As for the density estimator Important message of λ̂optimal =

arg minλ L̄(λ): We turned the somehow arbitrary λ into a data dependent or data
based choice.
■ !!!! Note this choice gives us an β with low prediction error.
29

Text Data Management and Analysis PDF
100% (3)
Text Data Management and Analysis PDF
531 pages
DH76 Manual
29% (7)
DH76 Manual
287 pages
DL Full Merged
No ratings yet
DL Full Merged
454 pages
Chapter 3
No ratings yet
Chapter 3
141 pages
Synopsis PHD Pacific University
No ratings yet
Synopsis PHD Pacific University
24 pages
Linear Model Recap 2
No ratings yet
Linear Model Recap 2
313 pages
Data Analyst - Test Data
No ratings yet
Data Analyst - Test Data
316 pages
Derivation of BLUE Property of OLS Estimators
100% (2)
Derivation of BLUE Property of OLS Estimators
4 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
Time Spent On Digital Devices and Its Effect On Academic Performance of Grade 12 Students of San Roque National High School, SY. 2024-2025
No ratings yet
Time Spent On Digital Devices and Its Effect On Academic Performance of Grade 12 Students of San Roque National High School, SY. 2024-2025
29 pages
Econ-607 - Unit2-W1-3
No ratings yet
Econ-607 - Unit2-W1-3
117 pages
Skewness and Kurtosis 2023
No ratings yet
Skewness and Kurtosis 2023
18 pages
Air Pollution Modelling
No ratings yet
Air Pollution Modelling
14 pages
14 Aos1221
No ratings yet
14 Aos1221
37 pages
Classical Linear Regression and Its Assumptions
No ratings yet
Classical Linear Regression and Its Assumptions
63 pages
B.B.A. (2024-25)
No ratings yet
B.B.A. (2024-25)
45 pages
Topics 2011
No ratings yet
Topics 2011
21 pages
Regression Analysis
100% (1)
Regression Analysis
280 pages
EC1 Slides Part4
No ratings yet
EC1 Slides Part4
35 pages
Design of Experiments and Response Surface
No ratings yet
Design of Experiments and Response Surface
27 pages
The Influence of Metacognition On Learning Engagement The Mediating Effect of Learning Strategy and Learning Behavior
No ratings yet
The Influence of Metacognition On Learning Engagement The Mediating Effect of Learning Strategy and Learning Behavior
13 pages
Makalah Written Documentation (Revised)
No ratings yet
Makalah Written Documentation (Revised)
21 pages
Quant Chapter 05 Ols
No ratings yet
Quant Chapter 05 Ols
15 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Prem Mann, Introductory Statistics, 8/E
No ratings yet
Prem Mann, Introductory Statistics, 8/E
49 pages
Density Estimation 36-708
No ratings yet
Density Estimation 36-708
32 pages
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
No ratings yet
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
19 pages
SaP 3Q M21 W09 Student
No ratings yet
SaP 3Q M21 W09 Student
19 pages
Healthcare Stats Final Exam 2
No ratings yet
Healthcare Stats Final Exam 2
17 pages
Linear Model
No ratings yet
Linear Model
14 pages
Raghu Meka Notes
No ratings yet
Raghu Meka Notes
7 pages
Kuliah 14
No ratings yet
Kuliah 14
23 pages
LECTURE2
No ratings yet
LECTURE2
13 pages
Multiple Regression Analysis 1
No ratings yet
Multiple Regression Analysis 1
57 pages
Lect 6
No ratings yet
Lect 6
20 pages
Lecture BDS 6-23-24 Print
No ratings yet
Lecture BDS 6-23-24 Print
10 pages
SL 3
No ratings yet
SL 3
11 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Economic Statistical Design of X Bar Control Chart
No ratings yet
Economic Statistical Design of X Bar Control Chart
14 pages
Multiple Linear Reegression
No ratings yet
Multiple Linear Reegression
21 pages
Ch5 Regularization
No ratings yet
Ch5 Regularization
23 pages
Lecture BDS 7-23-24 Print
No ratings yet
Lecture BDS 7-23-24 Print
14 pages
IA Check List 2023
No ratings yet
IA Check List 2023
5 pages
Machine Learning Lecture Notes Undergrad
No ratings yet
Machine Learning Lecture Notes Undergrad
19 pages
R Akne Ovningar Empirisk Modellering
No ratings yet
R Akne Ovningar Empirisk Modellering
23 pages
LeastSquares DeptMath
No ratings yet
LeastSquares DeptMath
7 pages
Hypothesis Testing: Categorical Data Analysis
No ratings yet
Hypothesis Testing: Categorical Data Analysis
54 pages
R300 - Summer 2018 Advanced Econometric Methods Study Aid
No ratings yet
R300 - Summer 2018 Advanced Econometric Methods Study Aid
9 pages
Improved LARS Algorithm For Adaptive LASSO in The Linearregression Model1
No ratings yet
Improved LARS Algorithm For Adaptive LASSO in The Linearregression Model1
10 pages
Lecture BDS 4 23 24 Print
No ratings yet
Lecture BDS 4 23 24 Print
14 pages
Linera Regression II PDF
No ratings yet
Linera Regression II PDF
14 pages
Econometric S 1
No ratings yet
Econometric S 1
5 pages
Sample Statistics and Mechanics Paper
No ratings yet
Sample Statistics and Mechanics Paper
22 pages
Non-Spherical Errors: 1 Efficient OLS
No ratings yet
Non-Spherical Errors: 1 Efficient OLS
14 pages
Quantile Regression
No ratings yet
Quantile Regression
3 pages
Notes 2
No ratings yet
Notes 2
16 pages
Chi Square Test
No ratings yet
Chi Square Test
16 pages
Learning Content 2
No ratings yet
Learning Content 2
9 pages
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
No ratings yet
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
12 pages
Best Notes You Can Expect
No ratings yet
Best Notes You Can Expect
4 pages
Econ 471 Notes 1
No ratings yet
Econ 471 Notes 1
14 pages
Linear Regression Analysis: Module - Vii
No ratings yet
Linear Regression Analysis: Module - Vii
10 pages
Unit - 1
No ratings yet
Unit - 1
8 pages
Math644 - Chapter 1 - Part2 PDF
No ratings yet
Math644 - Chapter 1 - Part2 PDF
14 pages
STA 3113 - Decision Theory Cat
No ratings yet
STA 3113 - Decision Theory Cat
2 pages
Econometric Theory: Module - Ii
No ratings yet
Econometric Theory: Module - Ii
11 pages
Unit 3 - Estimation And Prediction: θ 1 2 n 1 2 n 1 1 2 2 n n
No ratings yet
Unit 3 - Estimation And Prediction: θ 1 2 n 1 2 n 1 1 2 2 n n
14 pages
Samenvatting Statistiek 10tm17
No ratings yet
Samenvatting Statistiek 10tm17
11 pages
Predicting ARMA Processes: T T 2 T T
No ratings yet
Predicting ARMA Processes: T T 2 T T
8 pages
Hw2 - Raymond Von Mizener - Chirag Mahapatra
No ratings yet
Hw2 - Raymond Von Mizener - Chirag Mahapatra
13 pages
Suggested Solutions: Problem Set 3 Econ 210: April 27, 2015
No ratings yet
Suggested Solutions: Problem Set 3 Econ 210: April 27, 2015
11 pages
Skittles Project With Reflection
100% (1)
Skittles Project With Reflection
7 pages
Ec2 1
No ratings yet
Ec2 1
11 pages
Lecture 22: Review For Exam 2 1 Basic Model Assumptions (Without Gaussian Noise)
No ratings yet
Lecture 22: Review For Exam 2 1 Basic Model Assumptions (Without Gaussian Noise)
7 pages
Unit - III
No ratings yet
Unit - III
4 pages
Chapter 6: Regression
No ratings yet
Chapter 6: Regression
7 pages
A Seminar Report On Evolution of Total Quality Management Submitted To PROF. Kiran Pol
No ratings yet
A Seminar Report On Evolution of Total Quality Management Submitted To PROF. Kiran Pol
6 pages
Second-Order Nonlinear Least Squares Estimation: Liqun Wang
No ratings yet
Second-Order Nonlinear Least Squares Estimation: Liqun Wang
18 pages
FECO Note 2 - Simple Linear Regression: Xuan Chinh Mai
No ratings yet
FECO Note 2 - Simple Linear Regression: Xuan Chinh Mai
7 pages
Math644 Chapter 1 Part1
No ratings yet
Math644 Chapter 1 Part1
5 pages
Model Selection and Multiple Hypothesis Testing
No ratings yet
Model Selection and Multiple Hypothesis Testing
6 pages
Weather Wax Hastie Solutions Manual
No ratings yet
Weather Wax Hastie Solutions Manual
18 pages
Neural Network Toolbox Command List
No ratings yet
Neural Network Toolbox Command List
4 pages
Robust Regression: 1 M-Estimation
No ratings yet
Robust Regression: 1 M-Estimation
8 pages
3 The Basic Linear Model Finite Sample Results
No ratings yet
3 The Basic Linear Model Finite Sample Results
9 pages
Appendix Robust Regression
No ratings yet
Appendix Robust Regression
8 pages
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
No ratings yet
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
9 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
From Everand
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
Mohmmad Khaja Shareef
No ratings yet