Deep Reinforcement Learning for Optimal Order Placement in a Limit Order Book
Ilija Ilievski, PhD candidate, Learning & Vision Lab
NGS, National University of Singapore
Deep Reinforcement Learning
2/33
ilija139.github.io
Deep Reinforcement Learning
3/33
ilija139.github.io
Deep Reinforcement Learning
4/33
ilija139.github.io
Deep Reinforcement Learning
5/33
ilija139.github.io
Deep Reinforcement Learning
6/33
ilija139.github.io
Deep Reinforcement Learning
7/33
ilija139.github.io
Deep Reinforcement Learning
8/33
ilija139.github.io
Deep Reinforcement Learning for Quant Finance?
9/33
ilija139.github.io
Deep Reinforcement Learning for Quant Finance?
10/33
ilija139.github.io
11
(Deep) Reinforcement Learning Essentials
12/33
ilija139.github.io
(Deep) Reinforcement Learning Essentials
a = π(s)
Qπ(s, a) = E[rt+1+γrt+2+γ2rt+3+... | s, a]
13/33
ilija139.github.io
Bellman Equation
Qπ(s, a) = E[rt+1+γrt+2+γ2rt+3+... | s, a] =
= Es',a'[r + γQπ(s', a') | s, a]
Q*(s, a) = E[r + γ maxQ*(s', a') | s, a]
14/33
ilija139.github.io
Deep Q-network
Q(s, a; θ) ≈ Qπ(s, a)
Loss(θ) = E[(r + γ maxQ(s', a'; θ) - Q(s, a; θ))2]
15/33
ilija139.github.io
Deep Q-network Problems
Loss(θ) = E[(r + γ maxQ(s', a'; θ) - Q(s, a; θ))2]
16/33
ilija139.github.io
Deep Q-network Solutions
Loss(θ) = E[(r + γ maxQ(s', a'; θtarget) - Q(s, a; θ))2]
17/33
ilija139.github.io
Q-learning Algorithm
18/33
ilija139.github.io
19
Order Placement in a Limit Order Book
20/33
ilija139.github.io
Order Placement Problem
spread cost vs order execution probability vs market impact
21/33
ilija139.github.io
22
Deep RL for Optimal Order Placement
23/33
ilija139.github.io
Deep RL for Optimal Order Placement
24/33
ilija139.github.io
Deep RL for Optimal Order Placement
25/33
Level-2 Limit Order Book
ilija139.github.io
Network Architectures: Execution Probability
26/33
ilija139.github.io
Network Architectures: LOB State Representation
27/33
ilija139.github.io
Network Architectures: Q(s, a) network
28/33
ilija139.github.io
Training Details
29/33
ilija139.github.io
Preliminary Results
30/33
ilija139.github.io
Deep Reinforcement Learning for Quant Finance?
31/33
Learn state representation with deep neural networks
Implement Q-value function as deep neural network
Do more experiments...
(I guess)
ilija139.github.io
Unresolved Issues (Future Work)
32/33
ilija139.github.io
Thank you
Questions?
ilija139.github.io