1 of 23

Faithful Logical Reasoning via

Symbolic Chain-of-Thought

Jundong Xu, Hao Fei, Liangming Pan,

Qian Liu, Mong-Li Lee, Wynne Hsu

National University of Singapore, University of California,

University of Auckland

Accepted by ACL2024

2 of 23

2

Abstract

3 of 23

3

4 of 23

4

Evaluation

  • 5 standard datasets (PrOntoQA, ProofWriter, FOLIO, LogicalDeduction, AR-LSAT )
    • First-Order Logic (PrOntoQA, ProofWriter, FOLIO )
      • a formal system that uses predicates and quantifiers to express relationships between objects.
    • Constraint Optimization symbolic expressions (LogicalDeduction, AR-LSAT )
      • find optimal solutions by manipulating symbolic variables subject to defined constraints.
  • SymbCoT better than CoT

5 of 23

5

Introduction

6 of 23

6

7 of 23

7

SymbCoT for

Symbolic Reasoning

8 of 23

8

9 of 23

9

Prompt

+

Question

+

Premisse

First-order logic format

10 of 23

10

Prompt

+

Translator's output

Step-by-step

solution plan

11 of 23

11

Prompt

+

Translator's output

+

Planner's output

Step-by-step reasoning process

+

Final conclusion

12 of 23

12

Prompt

+

Original Q & context

+

Component’s I/O

Component results

+

Issue

+

Final Status

13 of 23

13

Datasets:

  • Five standard datasets: PrOntoQA, ProofWriter, FOLIO, LogicalDeduction, AR-LSAT.
  • Symbolic Structures: First-Order Logic (FOL) and Constraint Optimization (CO).

Baselines:

  • Naive Prompting, CoT, Logic-LM (GPT-3.5 and GPT-4).
  • CoT-SC, ToT, CR, DetermLR (GPT-4).

Metrics:

  • Accuracy (multiple-choice correctness).

Experiments

14 of 23

14

Results

  • Overall Performance:
    • SymbCoT significantly outperforms Naive, CoT, and Logic-LM baselines.
    • Demonstrates general versatility in different symbolic reasoning expressions.

15 of 23

15

Results

  • Ablation Study :

16 of 23

16

Analysis and Discussion

  • Reasoning Depth:
    • SymbCoT's improvement over CoT becomes more pronounced with increasing reasoning depth.

17 of 23

17

Analysis and Discussion

  • Robustness to Symbolic Syntax Errors:
    • SymbCoT achieves a high execution success rate compared to methods relying on external resolvers like Logic-LM.

18 of 23

18

Analysis and Discussion

  • Benefits of Hybrid Expression (Figure 6):
    • SymbCoT reduces errors caused by information loss and inaccurate translations by cross-referencing symbolic and natural language data.

19 of 23

19

Analysis and Discussion

  • Reasoning Faithfulness:
    • SymbCoT ensures credible, symbolic-based reasoning and reduces reliance on chance.

20 of 23

20

Analysis and Discussion

  • Impact of LLM Scale (Figure 8):
    • Performance gains are more significant when upgrading from GPT-3.5 to GPT-4.

21 of 23

21

Conclusion

Summary:

    • SymbCoT improves logical reasoning by integrating symbolic expressions and logical rules with CoT prompting.
    • Enhances vanilla CoT on logical reasoning with both FOL and CO symbolic expressions.

Significance:

    • Advances in faithfulness, flexibility, and explainability of logical reasoning.

22 of 23

22

Future Directions

Their perspective:

    • Combining SymbCoT with external solvers to leverage complementary strengths.
    • Evaluating more symbolic structures to ensure comprehensive evaluation.
    • Optimizing the framework's efficiency to reduce implementation costs.

My perspective:

    • Agent Design & Communication
    • Empowering Smaller LLMs
    • Integration with Reasoning Models(O-series, R1, S1, etc.)

23 of 23

Thanks for listening