0% found this document useful (0 votes)

26 views

Optimization Problem

The document discusses optimization problems and methods for solving them. It describes: 1) Optimization problems involve finding the minimum or maximum of an objective function subject to constraints. They can be unconstrained or constrained. 2) Gradient descent is an analytical method that iteratively moves the solution in the direction of steepest descent. 3) Newton's method fits a quadratic approximation to find the minimum, updating the solution faster than gradient descent due to using second-order information. 4) Conjugate gradient methods choose directions conjugate to the Hessian, allowing faster convergence than gradient descent for some problems.

Uploaded by

Lyes Br

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views

Optimization Problem

Uploaded by

Lyes Br

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Mathematics Basics

Optimization Problems

Huawei Confidential
Contents
 Mathematics and AI

 Linear Algebra

 Probability and Statistics

 Optimization Problems

• Classification of Optimization Problems

• Gradient Descent Method
• Newton's Method and Conjugate Gradient

Huawei Confidential
Optimization Problems
 Optimization problem: a problem of changing the values of parameters(decision variables) 𝑥 to
minimize or maximize objective function 𝑓(𝑥), It can be represented by

𝑥 ∗ = 𝑎𝑟𝑔min 𝑓(𝑥) , 𝑥 = (𝑥1 , 𝑥2 , ⋯ , 𝑥𝑛 )𝑇 ∈ 𝑅𝑛

𝑥

𝑠. 𝑡. 𝑐𝑖 𝑥 ≥ 0, 𝑖 = 1, 2, ⋯ , 𝑚, inequality constraint

𝑐𝑗 𝑥 = 0, 𝑗 = 1, 2, ⋯ , 𝑝, equality constraint
 Constraints define a feasible region, which is nonempty.
 If we seek a maximum of 𝑓 𝑥 it is equivalent to seeking to a min of −𝑓 𝑥 .
 In an optimization problem, if there are no other constraints for each variable except for the objective
function, then it is called an unconstrained optimization problem. Otherwise, it is called a constrained
optimization problem.

Huawei Confidential
Solutions to Optimization Problems
 Solutions to unconstrained optimization: mainly include analytical methods and direct methods.
 Direct methods are usually used when the representation of an objective function is
complicated or cannot be specified. Through numerical calculation in a series of iterative
processes, a range of points will be generated for searching for an optimal point.
 The analytical methods, also known as indirect methods, obtain the optimal solution based on
the analytical expression of the objective function that an unconstrained optimization problem
focuses on. The analytical methods mainly include gradient descent method, Newton's method,
Quasi-Newton method, conjugate direction method, and conjugate gradient method.

Huawei Confidential
Solutions to Optimization Problems
 Solutions to constrained optimization: The method of Lagrange multiplier is usually used in
solving optimization problems subject to equality constraints, while the Karush–Kuhn–Tucker
(KKT) approach is used in solving problems subject to inequality constraints. These methods turn
constrained optimization problems involving n variables and k constraints into unconstrained
optimization problems involving (n+k) variables.
 In this course, we focus on the most common solution to unconstrained optimization problems in
deep learning, that , the gradient descent method and Newton method.

Huawei Confidential
Extension to N dimensions
 How big N can be?
 Problem sizes can vary from a handful of parameters to many thousands.

 We will consider examples for N=2, so that cost function surfaces can be visualized.

Two-dimensional space
Original point

Huawei Confidential
An optimization Algorithm
 Start at 𝑥0 , k = 0.
1. Compute a search direction 𝑝𝑘 .
2. Compute a step size 𝛼𝑘 , such that 𝑓(𝑥𝑘 + 𝛼𝑘 𝑝𝑘 ) < 𝑓(𝑥𝑘 ) 𝑘 =𝑘+1

3. Update 𝑥𝑘 = 𝑥𝑘 + 𝛼𝑘 𝑝𝑘
4. Check for convergence(stopping criteria)
e.g. 𝛻𝑓(𝑥) = 0

Huawei Confidential
Contents
 Mathematics and AI

 Linear Algebra

 Probability and Statistics

 Optimization Problems

• Classification of Optimization Problems

• Gradient Descent Method
• Newton's Method and Conjugate Gradient

Huawei Confidential
Gradient Descent
 Convex function: If 𝜆 ∈ (0, 1) and any 𝑥1 , 𝑥2 ∈ 𝑅 satisfy

𝑓 𝜆𝑥1 + 1 − 𝜆 𝑥2 ≤ 𝜆𝑓 𝑥1 + 1 − 𝜆 𝑓(𝑥2 ),

𝑓(𝑥) is called a convex function. The minima of a convex function appears at the stationary points.

Huawei Confidential
Gradient Descent
 Basic principle is to minimize the N-dimensional function by a series of 1D line-minimizations:
𝑥𝑘+1 = 𝑥𝑘 + 𝛼𝑘 𝑝𝑘
 The gradient descent method chooses 𝑝𝑘 to be parallel to the gradient
𝑝𝑘 = −𝛻𝑓(𝑥𝑘 )
 Step size 𝛼𝑘 is the learning rate, a positive scalar determining step chosen to minimize 𝑓(𝑥𝑘 +
𝛼𝑘 𝑝𝑘 ).

𝑝𝑘𝑇 𝑝𝑘
𝛼𝑘 = 𝑇
𝑝𝑘 𝐻𝑝𝑘
 Gradient descent converges when every element of the gradient is zero or close to zero.

Huawei Confidential
Gradient Descent
 The gradient is everywhere perpendicular to the contour lines.
 After each line minimization the new gradient orthogonal to the previous step direction.
Therefore the iterates tend to zig-zag down the valley.

Huawei Confidential
Contents
 Mathematics and AI

 Linear Algebra

 Probability and Statistics

 Optimization Problems

• Classification of Optimization Problems

• Gradient Descent Method
• Newton's Method and Conjugate Gradient

Huawei Confidential
Newton’s Method-1D
 Fit a quadratic approximation to 𝑓 𝑥 using both gradient and curvature information at 𝑥.
 Expand 𝑓(𝑥) locally using a Taylor series.
1
𝑓 𝑥 + 𝛿𝑥 = 𝑓 𝑥 + 𝑓(𝑥)𝛿𝑥 + 𝑓 𝑥 𝛿𝑥 2 + 𝜊(𝛿𝑥 2 )
2
 Find the 𝛿𝑥 which minimizes this local quadratic approximation.

𝑓(𝑥)
𝛿𝑥 = −
𝑓 𝑥

𝑓(𝑥)
 Update 𝑥. 𝑥𝑛+1 = 𝑥𝑛 − 𝛿𝑥 = 𝑥𝑛 − 𝑓 𝑥

Huawei Confidential
Newton’s Method-N Dimension
 Expand 𝑓(𝑥) locally using a Taylor series 𝑥𝑘 .
1
𝑓 𝑥𝑘 + 𝛿𝑥 = 𝑓 𝑥𝑘 + ℊk𝑇 𝛿𝑥 + 𝛿𝑥 𝑇 𝐻𝑘 𝛿𝑥
2
Where the gradient is the vector
𝑇
𝜕𝑓 𝜕𝑓
ℊ𝑘 = 𝛻𝑓 𝑥𝑘 = …
𝑥1 𝑥𝑁

And the Hessian is the symmetric matrix

𝜕2𝑓 𝜕2𝑓
⋯
𝜕𝑥12 𝜕𝑥1 𝜕𝑥𝑁
𝐻𝑘 = 𝐻 𝑥𝑘 = ⋮ ⋱ ⋮
2
𝜕 𝑓 2
𝜕 𝑓
⋯
𝜕𝑥𝑁 𝜕𝑥1 𝜕𝑥𝑁2
Huawei Confidential
Newton’s Method-N Dimension
 For a minima we require that 𝛻𝑓 𝑥 = 0, and so 𝛻𝑓 𝑥 = ℊ𝑘 + 𝐻𝑘 𝛿𝑥 = 0
 With the solution 𝛿𝑥 = −𝐻𝑘−1 ℊ𝑘 , this gives the iterative update
𝑥𝑘+1 = 𝑥𝑘 − 𝐻𝑘−1 ℊ𝑘
 If 𝑓(𝑥) is quadratic, then the solution is found in one step.
 The method has quadratic convergence(as in the 1D case).
 The solution 𝛿𝑥 = −𝐻𝑘−1 ℊ𝑘 is guaranteed to be a downhill direction.
 Rather than jump straight to the minimum, better to perform a line minimization which ensures global
convergence
𝑥𝑘+1 = 𝑥𝑘 − 𝛼𝑘 𝐻𝑘−1 ℊ𝑘
 If 𝐻 = 𝐼 then this reduces to gradient descent.

Huawei Confidential
Newton’s Method
 Quadratic convergence(decimal accuracy doubles at each iteration)
 Global convergence of Newton’s method is poor if the starting point is too far from the minima.
 In practice, combined with a globalization strategy which reduces the step size until the function
decrease is assured.

Huawei Confidential
Conjugate Gradient
 Each direction 𝑝𝑘 is chosen to be conjugate to all previous directions with respect to Hessian 𝐻:

𝛻𝑓𝑘𝑇 𝛻𝑓𝑘
𝑝𝑖𝑇 𝐻𝑝𝑗 = 0, i ≠ 𝑗; 𝑝𝑘 = 𝛻𝑓𝑘 + 𝑝
𝑇
𝛻𝑓𝑘−1 𝛻𝑓𝑘−1 𝑘−1
 Compute step size 𝛼𝑘 for 𝑥𝑘 at Hessian 𝐻𝑘 . Set 𝑥𝑘+1 = 𝑥𝑘 + 𝛼𝑘 𝑑𝑘 and calculate 𝑓𝑘+1 = 𝑓(𝑥𝑘+1 ).

ℊ𝑘𝑇 ℊ𝑘
𝛼𝑘 = 𝑇
𝑑𝑘 𝐻𝑑𝑘
 If 𝛼𝑘 𝑑𝑘 < 𝜀, output 𝑥 ∗ = 𝑥𝑘+1 and 𝑓 𝑥 ∗ = 𝑓𝑘+1 and stop.

𝑇
ℊ𝑘+1 ℊ𝑘+1
 Compute ℊ𝑘+1 . Compute 𝛽𝑘 = ℊ𝑘𝑇 ℊ𝑘

 Generate new direction 𝑑𝑘+1 = −ℊ𝑘+1 + 𝛽𝑘 𝑑𝑘

Huawei Confidential
Conjugate Gradient
 An N-dimensional quadratic form can be minimized in at most N conjugate descent
steps.

Huawei Confidential
Summary

 This chapter mainly introduces the essential mathematics topics used in AI,
including linear algebra, probability and statistics, as well as the optimization
problems. It lays a foundation for other learning materials.

Huawei Confidential
More Information

Huawei e-Learning website

 https://support.huawei.com/learning/en/newindex.html

Huawei support case library

 https://support.huawei.com/enterprise/en/index.html

Huawei Confidential
Thank you. 把数字世界带入每个人、每个家庭、
每个组织，构建万物互联的智能世界。
Bring digital to every person, home, and
organization for a fully connected,
intelligent world.

The information in this document may contain predictive

statements including, without limitation, statements regarding
the future financial and operating results, future product
portfolio, new technology, etc. There are a number of factors that
could cause actual results and developments to differ materially
from those expressed or implied in the predictive statements.
Therefore, such information is provided for reference purpose
only and constitutes neither an offer nor an acceptance. Huawei
may change the information at any time without notice.

As 2293.1-2005 Emergency Escape Lighting and Exit Signs For Buildings System Design Installation and Operatio
0% (8)
As 2293.1-2005 Emergency Escape Lighting and Exit Signs For Buildings System Design Installation and Operatio
9 pages
Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
Material Hoist SC230-230H - XMT
No ratings yet
Material Hoist SC230-230H - XMT
2 pages
Classroom of The Elite - Year 2, Volume 04 MTL
50% (10)
Classroom of The Elite - Year 2, Volume 04 MTL
453 pages
Opt_Lec_10
No ratings yet
Opt_Lec_10
16 pages
EDO - Lecture 5 - 2024
No ratings yet
EDO - Lecture 5 - 2024
47 pages
Optimumengineeringdesign Day3a
No ratings yet
Optimumengineeringdesign Day3a
34 pages
Article+ID-+80+MS
No ratings yet
Article+ID-+80+MS
5 pages
ch4
No ratings yet
ch4
28 pages
Mathematical Methods of Optimization
No ratings yet
Mathematical Methods of Optimization
62 pages
Multi-Variable Optimization Methods
No ratings yet
Multi-Variable Optimization Methods
21 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Lecture 7 (with notes)
No ratings yet
Lecture 7 (with notes)
39 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
Optim
No ratings yet
Optim
70 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
Optimization2
No ratings yet
Optimization2
40 pages
Gradient Descent Algorithm in Machine Learning
No ratings yet
Gradient Descent Algorithm in Machine Learning
21 pages
Mscfe XXX (Course Name) - Module X: Collaborative Review Task
No ratings yet
Mscfe XXX (Course Name) - Module X: Collaborative Review Task
19 pages
MScFE 650 MLF - Video - Transcripts - M3
No ratings yet
MScFE 650 MLF - Video - Transcripts - M3
19 pages
Cours D'optimisation
No ratings yet
Cours D'optimisation
159 pages
BSC Part 3
No ratings yet
BSC Part 3
29 pages
Screenshot 2024-10-19 at 10.37.25 AM
No ratings yet
Screenshot 2024-10-19 at 10.37.25 AM
25 pages
Convex Module B
No ratings yet
Convex Module B
29 pages
Neural Network Basics (1)
No ratings yet
Neural Network Basics (1)
72 pages
Chares PHD Thesis 2007
No ratings yet
Chares PHD Thesis 2007
233 pages
Unit VI Optimization Techniques question bank solved answer
No ratings yet
Unit VI Optimization Techniques question bank solved answer
20 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
24 pages
Optimization Based On Gradient Descent
No ratings yet
Optimization Based On Gradient Descent
24 pages
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
No ratings yet
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
24 pages
Lecture 03
No ratings yet
Lecture 03
32 pages
Chapter 9st - Non-Linear Programming
No ratings yet
Chapter 9st - Non-Linear Programming
21 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
Chap_4_2
No ratings yet
Chap_4_2
214 pages
Optimization in Neural Network
No ratings yet
Optimization in Neural Network
22 pages
Steepest Descent
No ratings yet
Steepest Descent
7 pages
Download
No ratings yet
Download
7 pages
Week 12
No ratings yet
Week 12
65 pages
DNN M3 Optimization
No ratings yet
DNN M3 Optimization
81 pages
Preguntas del examen
No ratings yet
Preguntas del examen
8 pages
06_23ECE216_GradientDescent_v2
No ratings yet
06_23ECE216_GradientDescent_v2
73 pages
20 Notes 6250 f13
No ratings yet
20 Notes 6250 f13
8 pages
Jiyue Zeng Honors Thesis
No ratings yet
Jiyue Zeng Honors Thesis
59 pages
L2
No ratings yet
L2
35 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
Gradient Descent
No ratings yet
Gradient Descent
18 pages
5 NLP Models PDF
No ratings yet
5 NLP Models PDF
50 pages
L3 Linear Regression and Gradient Descent
No ratings yet
L3 Linear Regression and Gradient Descent
46 pages
Mlfa Autumn 23 Optimization
No ratings yet
Mlfa Autumn 23 Optimization
37 pages
55_Optimization
No ratings yet
55_Optimization
21 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Gradient Descent PDF
No ratings yet
Gradient Descent PDF
9 pages
UNIT12345
No ratings yet
UNIT12345
14 pages
SGD
No ratings yet
SGD
19 pages
L5 - UCLxDeepMind DL2020
No ratings yet
L5 - UCLxDeepMind DL2020
52 pages
8.2. SE5072_Optimization
No ratings yet
8.2. SE5072_Optimization
73 pages
Optimization and Gradient Descent Algorithm
No ratings yet
Optimization and Gradient Descent Algorithm
37 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
NLO Notes
No ratings yet
NLO Notes
75 pages
Mirror Descent Slides
No ratings yet
Mirror Descent Slides
35 pages
Steepest Descent Algorithm
No ratings yet
Steepest Descent Algorithm
28 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
3 Nursing Care of The High Risk Pregnant Client With Gestational Condition
No ratings yet
3 Nursing Care of The High Risk Pregnant Client With Gestational Condition
15 pages
Workshop 2
No ratings yet
Workshop 2
9 pages
Proposed Ecological 5-Star Hotel and Beach Resort in Mindanao
No ratings yet
Proposed Ecological 5-Star Hotel and Beach Resort in Mindanao
19 pages
Ripley Under Water
100% (1)
Ripley Under Water
172 pages
Gasdynamics PDF
No ratings yet
Gasdynamics PDF
380 pages
The Elements and Organization of Art
No ratings yet
The Elements and Organization of Art
3 pages
(L3) - (JEE 2.0) - Complex Numbers - 21st Oct PDF
100% (1)
(L3) - (JEE 2.0) - Complex Numbers - 21st Oct PDF
41 pages
AXINTRA SWEP RefrigerantHandbook EN
No ratings yet
AXINTRA SWEP RefrigerantHandbook EN
173 pages
Alp DTPC Circuit Dev Plan
No ratings yet
Alp DTPC Circuit Dev Plan
92 pages
Malla Reddy Institute of Engineering & Technology
No ratings yet
Malla Reddy Institute of Engineering & Technology
2 pages
MATHS G2 QUESTION
No ratings yet
MATHS G2 QUESTION
10 pages
Analysis of Digital Image Forgery Detection Using Adaptive Over-Segmentation Based On Feature Point Extraction and Matching
No ratings yet
Analysis of Digital Image Forgery Detection Using Adaptive Over-Segmentation Based On Feature Point Extraction and Matching
9 pages
Drought in India - Causes, Effects and Measures
100% (6)
Drought in India - Causes, Effects and Measures
55 pages
Ansys Manual 2013
No ratings yet
Ansys Manual 2013
111 pages
OrganicAgriculture v2
No ratings yet
OrganicAgriculture v2
15 pages
Chap-11 - Transport in Plants (27) - E
No ratings yet
Chap-11 - Transport in Plants (27) - E
56 pages
Hunting Down The Bambi
No ratings yet
Hunting Down The Bambi
165 pages
CONCRETE BRIDGE CODE 2014 (A - CS 1 To 7) - 22
No ratings yet
CONCRETE BRIDGE CODE 2014 (A - CS 1 To 7) - 22
1 page
Dataman 8600 Series: Handheld - Industrial Id - 1-D/2-D Barcode Reading - Direct Part Mark
No ratings yet
Dataman 8600 Series: Handheld - Industrial Id - 1-D/2-D Barcode Reading - Direct Part Mark
2 pages
Laundry
No ratings yet
Laundry
8 pages
Plasma 120 S
No ratings yet
Plasma 120 S
36 pages
CHEMY101- EXPERIMENT 7
No ratings yet
CHEMY101- EXPERIMENT 7
4 pages
SW Agreement-Edited No Address
No ratings yet
SW Agreement-Edited No Address
6 pages
Chemistry 11th Full Book Test 27-04-2024
No ratings yet
Chemistry 11th Full Book Test 27-04-2024
4 pages
SF6 Gas Insulated Switch Gear (GIS)
100% (1)
SF6 Gas Insulated Switch Gear (GIS)
12 pages
Ubk Rasyiidu - Soal Dan Pembahasan (Bing-Ips Paket 1)
100% (1)
Ubk Rasyiidu - Soal Dan Pembahasan (Bing-Ips Paket 1)
63 pages
Pt. Amanah Nusantara Sejahtera - Hydrotest Procedure
No ratings yet
Pt. Amanah Nusantara Sejahtera - Hydrotest Procedure
5 pages

Optimization Problem

Uploaded by

Optimization Problem

Uploaded by

Mathematics Basics

 Probability and Statistics

• Classification of Optimization Problems

𝑥 ∗ = 𝑎𝑟𝑔min 𝑓(𝑥) , 𝑥 = (𝑥1 , 𝑥2 , ⋯ , 𝑥𝑛 )𝑇 ∈ 𝑅𝑛

 Probability and Statistics

• Classification of Optimization Problems

 Probability and Statistics

• Classification of Optimization Problems

And the Hessian is the symmetric matrix

 Generate new direction 𝑑𝑘+1 = −ℊ𝑘+1 + 𝛽𝑘 𝑑𝑘

Huawei e-Learning website

Huawei support case library

Copyright©2020 Huawei Technologies Co., Ltd.

The information in this document may contain predictive

You might also like