Logical Specifications meet�Learning-based Control
Jyo Deshmukh,
Thomas Lord Dept. of CS, University of Southern California (USC)
CDC Workshop: Formal Methods and Decision-making in the age of AI
Paradigm shift in design of autonomous systems
2/30
Classical approach: Model-based, Well-defined uncertainty models
Brave new world:
Highly uncertain environments, do not easily lend to modeling, data-driven and model-free
CDC Workshop: Formal Methods and Decision-making in the age of AI
Safe control problem
3/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
How do we solve these problems?
4/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
How do we solve these problems?
5/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
Signal Temporal Logic1,2,3
6/30
0
100
50
1
3
Always between time 0 and 100
Eventually at some time t between time 20 and 60
From that time t, always till the end of the signal trace
0
100
1
-0.1
+0.1
60
[1] Maler, Oded, and Dejan Nickovic. "Monitoring temporal properties of continuous signals." In Formal Techniques, Modelling and Analysis of Timed and Fault-Tolerant Systems, pp. 152-166, 2004.
[2] Donzé, Alexandre, and Oded Maler. "Robust satisfaction of temporal logic over real-valued signals." International Conference on Formal Modeling and Analysis of Timed Systems, 2010.
[3] Georgios E. Fainekos, George J. Pappas: Robustness of temporal logic specifications for continuous-time signals. Theor. Comput. Sci. 410(42): 4262-4291 (2009)
CDC Workshop: Formal Methods and Decision-making in the age of AI
Specifying autonomous driving constraints and specifications
7/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
STL : Quantitative semantics
8/30
GOOD
GOOD
How far is
bad?
0
100
50
3
BAD
0
100
50
3
BAD
How bad is the violation?
CDC Workshop: Formal Methods and Decision-making in the age of AI
Quantitative Semantics of STL or “Robustness” value
9/30
| |
| |
| |
| |
| |
| |
| |
CDC Workshop: Formal Methods and Decision-making in the age of AI
How do we solve these problems?
10/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
Neuro-symbolic Safe Controller Synthesis: High-level idea1,2
11/30
[1] N. Hashemi, X. Qin, J. Deshmukh, G. Fainekos, B. Hoxha, D. Prokhorov, T. Yamaguchi, Risk-Awareness in Learning Neural Controllers for Temporal Logic Objectives, Proc. of ACC’23.
[2] N. Hashemi, B. Hoxha, T. Yamaguchi, D. V. Prokhorov, G. Fainekos, J. V. Deshmukh:
A Neurosymbolic Approach to the Verification of Temporal Logic Properties of Learning-enabled Control Systems. Proc. of ICCPS 2023: 98-109
CDC Workshop: Formal Methods and Decision-making in the age of AI
Neuro-symbolic representation of STL
12/30
[1] N. Hashemi, B. Hoxha, T. Yamaguchi, D. V. Prokhorov, G. Fainekos, J. V. Deshmukh: A Neurosymbolic Approach to the Verification of Temporal Logic Properties of Learning-enabled Control Systems. Proc. of ICCPS 2023: 98-109
CDC Workshop: Formal Methods and Decision-making in the age of AI
Neuro-symbolic Safe Controller Synthesis: High-level idea1,2
13/30
[1] N. Hashemi, X. Qin, J. Deshmukh, G. Fainekos, B. Hoxha, D. Prokhorov, T. Yamaguchi, Risk-Awareness in Learning Neural Controllers for Temporal Logic Objectives, Proc. of ACC’23.
[2] N. Hashemi, B. Hoxha, T. Yamaguchi, D. V. Prokhorov, G. Fainekos, J. V. Deshmukh:
A Neurosymbolic Approach to the Verification of Temporal Logic Properties of Learning-enabled Control Systems. Proc. of ICCPS 2023: 98-109
CDC Workshop: Formal Methods and Decision-making in the age of AI
Results on synthesis for 6D and 3D quadcopter systems1
14/30
[1] N. Hashemi, B. Hoxha, T. Yamaguchi, D. Prokhorov, G. Fainekos, J. Deshmukh., A Neurosymbolic Approach to the Verification of Temporal Logic Properties of Learning-enabled Control Systems. In ICCPS 2023
CDC Workshop: Formal Methods and Decision-making in the age of AI
Meditation on results
Scalability of controller verification:
15/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
Better optimization and longer horizons
Challenges:
Training Results (under submission):
16/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
Verification problem
17/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
Neuro-symbolic Controller Verification (Deterministic)
18/30
[1] N. Hashemi, B. Hoxha,, G. Fainekos, D. Prokhorov, T. Yamaguchi, J. Deshmukh : A Neurosymbolic Approach to the Verification of Temporal Logic Properties of Learning-enabled Control Systems, accepted for publication in the Proc. of ICCPS 2023.
CDC Workshop: Formal Methods and Decision-making in the age of AI
Scalability of verification
19/30
[1] N. Hashemi, B. Hoxha,, G. Fainekos, D. Prokhorov, T. Yamaguchi, J. Deshmukh : A Neurosymbolic Approach to the Verification of Temporal Logic Properties of Learning-enabled Control Systems, accepted for publication in the Proc. of ICCPS 2023.
Property | Time Horizon | Plant NN | Controller NN | Method | Verified? | Runtime (secs) |
| 100 | [3,10,10,10,2] | [2,50,1,2,1,2,1,1] | Exact | Y | 1167 |
Approx | N | 35 | ||||
| 50 | [4,10,10,3] | [3,100,1,2,1,2,1,1] | Exact | Y | 1903 |
Approx | N | 43 | ||||
| 53 | [2,10,10,2] | [5,20,20,20,1] | Exact | Y | 260 |
Approx | Y | 24 | ||||
| 32 | [4,8,2] | [2,8,2] | Exact | Y | 78 |
| 35 | Exact | Y | 1955 | ||
| 35 | Exact | Y | 2369 | ||
| 35 | Exact | Disproven | 1023 |
CDC Workshop: Formal Methods and Decision-making in the age of AI
Elephant in the room: the NN plant model
20/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
How do we solve these problems?
21/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
RL from Logical Specifications (LTL, STL, MTL)
22/30
…
CDC Workshop: Formal Methods and Decision-making in the age of AI
STL’s easier to digest cousin: Symbolic Automata1,2
23/30
[1] Anand Balakrishnan, Stefan Jakšić, Edgar A. Aguilar, Dejan Ničković, Jyotirmoy V. Deshmukh, Model-Free Reinforcement Learning for Symbolic Automata-encoded Objectives, to appear in the Proc. of CDC, 2023
[2] Jakšić, Stefan, Ezio Bartocci, Radu Grosu, Thang Nguyen, and Dejan Ničković. "Quantitative monitoring of STL with edit distance." Formal methods in system design 53 (2018): 83-112.
CDC Workshop: Formal Methods and Decision-making in the age of AI
Reward shaping with symbolic automaton objectives
Main ideas1:
Theorem [1]:
Optimal policy found by an RL algorithm w.r.t. symbolic potential-based reward functions maximizes the probability of satisfying the automaton objective*
Shameless plug 2: Talk on Friday (FrA23.2)
* The automaton must have a terminally accepting state, e.g., any finitely bounded horizon STL specification
24/30
[1] Anand Balakrishnan, Stefan Jakšić, Edgar A. Aguilar, Dejan Ničković, Jyotirmoy V. Deshmukh, Model-Free Reinforcement Learning for Symbolic Automata-encoded Objectives, CDC, 2023
CDC Workshop: Formal Methods and Decision-making in the age of AI
LfD + STL
25/30
STL specs + demos allows generalization over continuous and stochastic spaces while reducing sample complexity and automatically inferring rewards
A. G. Puranic, J. V. Deshmukh, and S. Nikolaidis. Learning from demonstrations using signal temporal logic. Conference on Robot Learning. (Oral Presentation at CORL 2021) PMLR, 2021.
A. G. Puranic, J. V. Deshmukh, and S. Nikolaidis, (2021). Learning from demonstrations using signal temporal logic in stochastic and continuous domains. IEEE Robotics and Automation Letters, 6(4), 6250-6257. (Oral Presentation at IROS 202)
A. G. Puranic , J. V. Deshmukh, and S. Nikolaidis. "Learning Performance Graphs From Demonstrations via Task-Based Evaluations." IEEE Robotics and Automation Letters 8, no. 1 (2022): 336-343. (To be presented at ICRA 2023)
CDC Workshop: Formal Methods and Decision-making in the age of AI
Meditation on results
26/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
Ushering a neuro-symbolic era for safe autonomy?
27/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
Confluence of logic and learning: will it be a happily ever after?
28/30
CDC Workshop: Formal Methods and Decision-making in the age of AI
Credits
CDC Workshop: Formal Methods and Decision-making in the age of AI
Thank you!
30/30
If a person is detected with a confidence score greater than or equal to c1 in a particular frame, then in the next 4 frames, the probability of detecting the same person should never drop below c2.
Some post-credits scenes
CDC Workshop: Formal Methods and Decision-making in the age of AI