STATS / DATA SCI 315
Lecture 05
Classification
Softmax
Cross-entropy loss
Classification Problems
Regression vs Classification
Toy problem
What does our model output?
Network Architecture
Compact matrix notation
Parameterization Cost
Parameters in a fully connected layer
Softmax Operation
Why softmax?
Softmax function
Properties of softmax function
Softmax output as conditional probabilities
Likelihood
Likelihood
Likelihood
Log Likelihood
Cross-entropy loss