Optimization for Deep Learning
Prof. Seungchul Lee
Industrial AI Lab.
Optimization
2
Optimization
3
Optimization: Mathematical Expression
4
Optimization: Mathematical Expression
5
Solving Optimization Problems
6
Solving Optimization Problems
7
Solving Optimization Problems
8
9
10
Descent Direction (1D)
11
Positive: shift to the left
Negative: shift to the right
Gradient Descent
12
…
Stopping Criteria
13
…
14
Too small: converge very slow
Too big: overshoot and even diverge
Reduce size over time
Where will We Converge?
15
Convex
Any local minimum is a global minimum
Non-convex
Multiple local minima may exist
Gradient Descent in High Dimension
16
…
Gradient Descent in High Dimension
17
Practically Solving Optimization Problems
18
Gradient Descent
19