DLBasic: Components of Deep Learning
1
Introduction followed by a review of important Deep Learning topics including Activation: (ReLU, Sigmoid, Tanh, Softmax), Loss Function, Optimizer, Stochastic Gradient Descent, Back Propagation, One-hot Vector, Vanishing Gradient, Hyperparameter, Label Smoothing
Digital Science Center
Digital Science Center
DLBasic: Examples, Components and Types
1) Examples of use
2) Discussion of smaller building blocks or network components
3) Types of Network such as fully connected, convolutional or recurrent. This is just a short summary
2
Digital Science Center
Deep Learning Terms often Used
3
Digital Science Center
Activation Functions
4
ANN artificial Neuron (Node)
Nature’s Neuron
Digital Science Center
5
Digital Science Center
What is an Activation Function?
6
Digital Science Center
ReLU Rectified Linear Unit
7
Digital Science Center
The Purpose of ReLU
8
Digital Science Center
Sigmoid Activation Function
9
Digital Science Center
Tanh Activation Function
10
Digital Science Center
Softmax Activation Layer
11
Digital Science Center
Scaled Exponential Linear Unit. SELU
12
Digital Science Center
Loss Functions I
13
Digital Science Center
Loss Functions II
14
Digital Science Center
Loss Functions III
15
Digital Science Center
Optimization and Stochastic Gradient Descent SGD I
16
Digital Science Center
Optimization and Stochastic Gradient Descent SGD II
17
Digital Science Center
Optimization and Stochastic Gradient Descent SGD III
18
Digital Science Center
Back Propagation I
19
Output = f(signals from hidden layer 3)
Signals in hidden layer 3 = g(signals from hidden layer 2)
Digital Science Center
Back Propagation II
20
Output = f(signals from hidden layer 3)
Signals in hidden layer 3 = g(signals from hidden layer 2)
So recursion starts at the output (back) and precedes one layer at a time to the input
Digital Science Center
One-Hot Vector
21
Digital Science Center
Vanishing Gradient Problem
22
Digital Science Center
DeepAI on Hyperparameters
23
Digital Science Center
Label Smoothing
24
From Paper: LS is Label Smoothing
ResNet training for classifying 3 image categories: “beaver, dolphin and otter”…note the tremendous difference in cluster tightness.
Digital Science Center