Identifying Global Minimum of High-Dimensional Non-Convex functions
Motivation: Structural Biology
Can NNs identify the global minimum? No.
Protein Trajectory
Let’s model the protein’s trajectory with an SDE
How can we identify the global minimum?
Forcing Gradient Descent Trajectories to Converge: Options
Forcing Gradient Descent Trajectories to Converge: Try 1
Consistency between decoder and encoder for latent space point T(L)
Encoder
Decoder
Decoder
Encoder
x
x - f’(x)
After every step of gradient descent, the point in the latent space needs to converge. We apply a contractive function, T, with Lipschitz(T) < 1.
L
T(L)
x
x - f’(x)
Loss = MSE(x, D(E(x))) + MSE(x - f’(x), D(T(E(x)))) + MSE(E(D(T(L))), T(L))
Forcing Gradient Descent Trajectories to Converge: Try 2
Loss = MSE(D(E(x)), x) + MSE(D(E(x-f’(x)), x- f’(x)) - α
Encoder
Decoder
Decoder
Encoder
x
x - f’(x)
P
After every step of gradient descent, the latent space vector needs to be closer to a predefined “central” point P.
L1 = E(x)
L2 = E(x-f’(x))
Forcing Gradient Descent Trajectories to Converge: Try 3
Encoder
Decoder
Decoder
Encoder
x
x - f’(x)
P
After every step of gradient descent, the latent space vector needs to be closer to a predefined “central” point P.
L1 = E(x)
L2 = E(x-f’(x))
Broader Thoughts