1 of 84

Motivating coherence-driven inference via sheaves

Steve Huntsman

CC-BY-SA, by Ché Lydia Xyang (Cybergothiché)

2 of 84

Disclaimers

Presenting here in my personal capacity

“Argument” examples (incl. code) here done in my personal capacity

Much is joint work with Michael Robinson, Ludmilla Huntsman; Jewell Thomas: thanks to many others (see refs)

2

3 of 84

Concept

3

4 of 84

LLMs evaluate consistency of propositions

5 of 84

LLMs evaluate consistency of propositions

Lack of hallucinations presumably due to interpolation vs extrapolation

6 of 84

Ironically, LLMs can promote information integrity

  • Original goal: use LLMs in a principled way to automatically handle {mis, dis}information and bullshit
    • Frankfurt: bullshit is an attempt to manipulate opinions and attitudes with indifference to truth or falsity
    • Brandolini’s law: “the amount of energy needed to refute bullshit is an order of magnitude bigger than that needed to produce it”

“When LLMs get things wrong they aren't hallucinating. They are bullshitting.”

“Bullshit is a greater enemy of the truth than lies.”

Lack of hallucinations presumably due to interpolation vs extrapolation

7 of 84

MAX-SAT evaluates coherence of propositions

  • First step: LLMs evaluate local relatedness/consistency of propositions
  • Next step: lifting local data for judgments about global data
    • Natural to use a sheaf: consistency is local; coherence is global

8 of 84

MAX-SAT evaluates coherence of propositions

  • First step: LLMs evaluate local relatedness/consistency of propositions
  • Next step: lifting local data for judgments about global data
    • Natural to use a sheaf: consistency is local; coherence is global
  • Key insight: asking logical questions about propositions ⇒ satisfiability
    • Example: CNF formula for ≥ 1 true and ≥ 1 false proposition (n-lemma) is

(p1p2 ∨ … ∨ pn) ∧ (¬p1 ∨ ¬p2 ∨ … ∨ ¬pn)

9 of 84

MAX-SAT evaluates coherence of propositions

  • First step: LLMs evaluate local relatedness/consistency of propositions
  • Next step: lifting local data for judgments about global data
    • Natural to use a sheaf: consistency is local; coherence is global
  • Key insight: asking logical questions about propositions ⇒ satisfiability
    • Example: CNF formula for ≥ 1 true and ≥ 1 false proposition (n-lemma) is

(p1p2 ∨ … ∨ pn) ∧ (¬p1 ∨ ¬p2 ∨ … ∨ ¬pn)

  • MAX-SAT gives a general framework for inference
    • Start with AND of weighted formulas encoding local (in)consistencies
    • Convert to weighted CNF required for solvers (highly nontrivial!)
    • Produce optimal assignment of truth values using a solver
    • Near-optimal assignments are also of interest as in Rashomon

10 of 84

MAX-SAT evaluates coherence of propositions

  • First step: LLMs evaluate local relatedness/consistency of propositions
  • Next step: lifting local data for judgments about global data
    • Natural to use a sheaf: consistency is local; coherence is global
  • Key insight: asking logical questions about propositions ⇒ satisfiability
    • Example: CNF formula for ≥ 1 true and ≥ 1 false proposition (n-lemma) is

(p1p2 ∨ … ∨ pn) ∧ (¬p1 ∨ ¬p2 ∨ … ∨ ¬pn)

  • MAX-SAT gives a general framework for inference
    • Start with AND of weighted formulas encoding local (in)consistencies
    • Convert to weighted CNF required for solvers (highly nontrivial!)
    • Produce optimal assignment of truth values using a solver
    • Near-optimal assignments are also of interest as in Rashomon

This is a plausible instantiation of a coherence theory of truth

11 of 84

MAX-SAT evaluates coherence of propositions

  • First step: LLMs evaluate local relatedness/consistency of propositions
  • Next step: lifting local data for judgments about global data
    • Natural to use a sheaf: consistency is local; coherence is global
  • Key insight: asking logical questions about propositions ⇒ satisfiability
    • Example: CNF formula for ≥ 1 true and ≥ 1 false proposition (n-lemma) is

(p1p2 ∨ … ∨ pn) ∧ (¬p1 ∨ ¬p2 ∨ … ∨ ¬pn)

  • MAX-SAT gives a general framework for inference
    • Start with AND of weighted formulas encoding local (in)consistencies
    • Convert to weighted CNF required for solvers (highly nontrivial!)
    • Produce optimal assignment of truth values using a solver
    • Near-optimal assignments are also of interest as in Rashomon

This is a plausible instantiation of a coherence theory of truth

“Clamping” yields a correspondence theory

12 of 84

Motivation from sheaves

12

13 of 84

Topological sheaves are simple and useful

  • Local data (“sections”) over open sets collectively define a sheaf if
    • Local sections defined over any open set can be consistently restricted to smaller open sets
    • If local sections agree over open sets, then each is a restriction of a common section over the union
  • Recent uses are morally quite similar to ours
    • Modeling opinion expression, lying, consensus (Ghrist, Hansen, Riess, …)
    • Computing maximal consistent data (Praggastis, Ambrose et al., …)
    • Measuring consistency of local data in sheaves (Robinson, …)

Our use is basically implicit but very powerful

14 of 84

Finite topological sheaves are combinatorial

arXiv:1509.02975 gives a formula for the discrete cyclic case

There are many global sections associated with “untethered” local data

These sections form an intricate subsheaf of a constant sheaf

15 of 84

Finite topological sheaves are combinatorial

arXiv:1509.02975 gives a formula for the discrete cyclic case

There are many global sections associated with “untethered” local data

These sections form an intricate subsheaf of a constant sheaf

Exercise: detail in terms of sheaf cohomology

16 of 84

Finite topological sheaves are combinatorial

There are many global sections associated with “untethered” local data

These sections form an intricate subsheaf of a constant sheaf

17 of 84

Finite topological sheaves are combinatorial

There are many global sections associated with “untethered” local data

These sections form an intricate subsheaf of a constant sheaf

18 of 84

Sheaves inform satisfiability problems and CDI

If local (in)consistencies can be encoded by logical formulas then global coherence can be encoded by AND of weighted local formulas

Meanwhile, the premise is trivial: recall the CNF n-lemma formula

(p1p2 ∨ … ∨ pn) ∧ (¬p1 ∨ ¬p2 ∨ … ∨ ¬pn)

Coherence-driven inference (CDI) ≡ solving weighted MAX-SAT

19 of 84

Sheaves inform satisfiability problems and CDI

  • Sections solutions to constraints
    • Srinivas (1993): “the solutions to a constraint satisfaction problem form a sheaf: any consistent assignment must be assembled from consistent parts. Constraint satisfaction algorithms search for consistent assignments of values to variables.” https://apps.dtic.mil/sti/citations/ADA272724

20 of 84

Sheaves inform satisfiability problems and CDI

  • Sections solutions to constraints
    • Srinivas (1993): “the solutions to a constraint satisfaction problem form a sheaf: any consistent assignment must be assembled from consistent parts. Constraint satisfaction algorithms search for consistent assignments of values to variables.” https://apps.dtic.mil/sti/citations/ADA272724
  • Sheaf cohomology counting obstructions to gluing local sections
    • #SAT ≡ a certain computation of H1 DOI:10.1006/jsco.1998.0261

21 of 84

  • Problems in propositional logic can be efficiently transformed into SAT
    • A recently developed transformation enables modern MAX-SAT solvers
    • Also enables reasoning about > 2 propositions at a time (trilemmas!)

Sheaves inform satisfiability problems and CDI

22 of 84

  • Problems in propositional logic can be efficiently transformed into SAT
    • A recently developed transformation enables modern MAX-SAT solvers
    • Also enables reasoning about > 2 propositions at a time (trilemmas!)
  • Classical CDI is a special case
    • Local consistency of proposition pairs compiles to IFF
    • Local inconsistency of proposition pairs compiles to XOR
    • Express gradations with numerical weights
    • Combine local pair/edge formulae into a global graph formula using AND
    • Only reasons about two propositions at a time: no native trilemma handling

Sheaves inform satisfiability problems and CDI

23 of 84

  • Problems in propositional logic can be efficiently transformed into SAT
    • A recently developed transformation enables modern MAX-SAT solvers
    • Also enables reasoning about > 2 propositions at a time (trilemmas!)
  • Classical CDI is a special case
    • Local consistency of proposition pairs compiles to IFF
    • Local inconsistency of proposition pairs compiles to XOR
    • Express gradations with numerical weights
    • Combine local pair/edge formulae into a global graph formula using AND
    • Only reasons about two propositions at a time: no native trilemma handling
  • Equivalent formulations of classical CDI:
    • MAX-CUT is the formulation in literature
    • MAX-2-XOR-SAT is more natural

Sheaves inform satisfiability problems and CDI

24 of 84

Abstract simplicial complex corresponding to the SAT instance

(w ∨ ¬x) ∧ (wy)

∧ (x ∨ ¬y) ∧ (xy ∨ ¬z)

We use cellular sheaves

Adapted from Rosiak’s book

^ implicitly

25 of 84

Abstract simplicial complex corresponding to the SAT instance

(w ∨ ¬x) ∧ (wy)

∧ (x ∨ ¬y) ∧ (xy ∨ ¬z)

Inclusion of simplices defines a poset and (Alexandrov) topology: open sets are unions of “up-sets” such as y, which is highlighted

We use cellular sheaves

Adapted from Rosiak’s book

^ implicitly

26 of 84

Abstract simplicial complex corresponding to the SAT instance

(w ∨ ¬x) ∧ (wy)

∧ (x ∨ ¬y) ∧ (xy ∨ ¬z)

Inclusion of simplices defines a poset and (Alexandrov) topology: open sets are unions of “up-sets” such as y, which is highlighted

Redrawing poset with omitted indicates how open sets encode a sensible notion of locality in the simplicial complex

We use cellular sheaves

Adapted from Rosiak’s book

^ implicitly

27 of 84

Abstract simplicial complex corresponding to the SAT instance

(w ∨ ¬x) ∧ (wy)

∧ (x ∨ ¬y) ∧ (xy ∨ ¬z)

Inclusion of simplices defines a poset and (Alexandrov) topology: open sets are unions of “up-sets” such as y, which is highlighted

Redrawing poset with omitted indicates how open sets encode a sensible notion of locality in the simplicial complex

We use cellular sheaves

TL;DR – nothing fancy here, just finite topological sheaves

Adapted from Rosiak’s book

^ implicitly

28 of 84

We generalized classical CDI

  • Classical CDI is an approach to autonomous, explainable, and ethical decision-making
  • We were ignorant of this preexisting work until reviewing legal literature featuring words like “consistency” and “coherence” revealed a link
    • We thought attention + LLMs could compile weighted MAX-SAT instances
    • This turns out to be very finicky
    • Easier to just ask the LLM several times to produce a weighted graph

^ unknowingly

29 of 84

We generalized classical CDI

  • Classical CDI is an approach to autonomous, explainable, and ethical decision-making
  • We were ignorant of this preexisting work until reviewing legal literature featuring words like “consistency” and “coherence” revealed a link
    • We thought attention + LLMs could compile weighted MAX-SAT instances
    • This turns out to be very finicky
    • Easier to just ask the LLM several times to produce a weighted graph
  • LLM + CDI = versatile neurosymbolic AI
    • LLM accurately and reproducibly compiles local information into a natural data structure (fast/system 1 reasoning)
    • CDI computes maximally coherent substructure to enable useful forms of machine cognition (slow/system 2)

^ unknowingly

30 of 84

Related work focusing on sheaves and CDI

  • Consistency of language
    • Zadrozny and Garbayo: arXiv:1801.09036
    • Wang and Sadrzadeh: DOI:10.1098/rsta.2023.0013
  • Decision making
    • Tohmé: DOI:10.1080/03081079.2017.1355913
  • CDI
    • Extensive literature inaugurated by Thagard
  • In ignorance of CDI
    • Robinson, Ludmilla Huntsman, and SH: arXiv:2401.16713
  • With awareness of CDI
    • Thomas and SH: arXiv:2502.13953
    • Robinson, Ludmilla Huntsman, and SH: DOI:10.31235/osf.io/4tjz5_v1

31 of 84

Classical CDI in detail

31

32 of 84

Datum = vertex in a graph

Data consistency = edge weights

Accepted vs. rejected data = graph partition

Coherence(partition) = -∑(crossing weights)

Maximizing coherence = computing a maximum cut

LLMs can accurately and reproducibly construct coherence graphs

Coherence is a combinatorial problem

c

b

a

33 of 84

Datum = vertex in a graph

Data consistency = edge weights

Accepted vs. rejected data = graph partition

Coherence(partition) = -∑(crossing weights)

Maximizing coherence = computing a maximum cut

LLMs can accurately and reproducibly construct coherence graphs

Coherence is a combinatorial problem

c

b

a

-1

-1

+1

34 of 84

Datum = vertex in a graph

Data consistency = edge weights

Accepted vs. rejected data = graph partition

Coherence(partition) = -∑(crossing weights)

Maximizing coherence = computing a maximum cut

LLMs can accurately and reproducibly construct coherence graphs

Coherence is a combinatorial problem

c

b

a

-1

-1

+1

35 of 84

Datum = vertex in a graph

Data consistency = edge weights

Accepted vs. rejected data = graph partition

Coherence(partition) = -∑(crossing weights)

Maximizing coherence = computing a maximum cut

LLMs can accurately and reproducibly construct coherence graphs

Coherence is a combinatorial problem

c

b

a

-1

-1

+1

Coherence = -(+1 + -1) = 0

36 of 84

Datum = vertex in a graph

Data consistency = edge weights

Accepted vs. rejected data = graph partition

Coherence(partition) = -∑(crossing weights)

Maximizing coherence = computing a maximum cut

LLMs can accurately and reproducibly construct coherence graphs

Coherence is a combinatorial problem

c

b

a

-1

-1

+1

Coherence = -(-1 + -1) = 2

37 of 84

Datum = vertex in a graph

Data consistency = edge weights

Accepted vs. rejected data = graph partition

Coherence(partition) = -∑(crossing weights)

Maximizing coherence = computing a maximum cut

LLMs can accurately and reproducibly construct coherence graphs

Coherence is a combinatorial problem

c

b

a

-1

-1

+1

Coherence = -(-1 + -1) = 2

38 of 84

Example of perfect coherence graph reconstruction with o1-mini from arXiv:2502.13953

  • Reconstruct graphs w/ various LLMs
    • Scale: largest examples in literature
  • Several models reconstruct sparse coherence graphs with high fidelity from a single prompt
    • Taking medians and monitoring convergence improves matters

An algorithmic benchmark validates results

39 of 84

Example of perfect coherence graph reconstruction with o1-mini from arXiv:2502.13953

  • Reconstruct graphs w/ various LLMs
    • Scale: largest examples in literature
  • Several models reconstruct sparse coherence graphs with high fidelity from a single prompt
    • Taking medians and monitoring convergence improves matters

An algorithmic benchmark validates results

CDI is robust to perturbations that are good cut sparsifications

40 of 84

Example of perfect coherence graph reconstruction with o1-mini from arXiv:2502.13953

  • Reconstruct graphs w/ various LLMs
    • Scale: largest examples in literature
  • Several models reconstruct sparse coherence graphs with high fidelity from a single prompt
    • Taking medians and monitoring convergence improves matters

An algorithmic benchmark validates results

CDI is robust to perturbations that are good cut sparsifications

Plausible that LLMs and humans construct these

41 of 84

CDI is a versatile cognitive framework

42 of 84

  • Maximizing coherence aspects of cognition
    • Data = observations ⇒ fusing/perceiving
    • Data = evidence & hypotheses ⇒ judging
    • Data = tasks & goals ⇒ planning
    • Generating/optimizing data ⇒ creating

CDI now is artisanal, not automated/scaled

43 of 84

  • Maximizing coherence aspects of cognition
    • Data = observations ⇒ fusing/perceiving
    • Data = evidence & hypotheses ⇒ judging
    • Data = tasks & goals ⇒ planning
    • Generating/optimizing data ⇒ creating
  • But at present (and since the 1990s):
    • Graph edges and weights are produced by hand
    • There are no good ways to encode trilemmas or to reason over them
    • Algorithms in common use are inextensible and resist introspection

CDI now is artisanal, not automated/scaled

44 of 84

  • Maximizing coherence aspects of cognition
    • Data = observations ⇒ fusing/perceiving
    • Data = evidence & hypotheses ⇒ judging
    • Data = tasks & goals ⇒ planning
    • Generating/optimizing data ⇒ creating
  • But at present (and since the 1990s):
    • Graph edges and weights are produced by hand
    • There are no good ways to encode trilemmas or to reason over them
    • Algorithms in common use are inextensible and resist introspection
  • Recent developments (LLMs, new algorithms, and new data structures) address these problems and enable automated CDI

CDI now is artisanal, not automated/scaled

45 of 84

Examples

45

46 of 84

Recent event motivated a small example

First presented within two weeks of the event

47 of 84

Recent event motivated a small example

First presented within two weeks of the event

Later examples fully automatic on transcripts

48 of 84

Produce a non-redundant list of the substantive claims made by the participants in the following transcript.

Can you consolidate any of these claims?

Please replace pronouns with their antecedents.

Label these propositions a-s.

GPT-4o produces a labeled list of substantive claims/ propositions in the meeting transcript

49 of 84

buildCoherence:

  1. Effective diplomacy requires engaging both Russia and Ukraine, and overt hostility toward Vladimir Putin would hinder negotiations.
  2. Donald Trump prioritizes the interests of the United States and the world, not personal alignment with Vladimir Putin.
  3. Ukraine is in a weak negotiating position and heavily dependent on military and financial support from the United States.
  4. The United States has provided significant military aid to Ukraine, without which the Ukrainian military would have lost the war quickly.
  5. A ceasefire between Ukraine and Russia should be pursued first, as a ceasefire is easier to achieve than a full peace agreement.
  6. Vladimir Putin respected Donald Trump and did not invade Ukraine during Donald Trump’s presidency, unlike during the presidencies of Barack Obama and Joe Biden.
  7. Previous United States administrations, including those led by Barack Obama and Joe Biden, were weak, which allowed Russia to act aggressively.
  8. If Ukraine does not negotiate a deal with Russia, the United States may withdraw support, leaving Ukraine to fight against Russia alone.
  9. Volodymyr Zelenskyy has not expressed sufficient gratitude for the military and financial support provided by the United States.
  10. Joe Biden’s public tough stance on Vladimir Putin has been ineffective in preventing Russia’s invasion of Ukraine.
  11. Diplomacy between the United States, Ukraine, and Russia is the best way to achieve peace.
  12. Ukraine has manpower shortages and is forcing Ukrainian citizens to fight against Russia as conscripts.
  13. Volodymyr Zelenskyy should not publicly criticize United States policy while speaking in the Oval Office.
  14. Ukraine should acknowledge military and financial support from the United States and express more gratitude.
  15. Russia has occupied Ukrainian territory, including Crimea and eastern Ukraine, since 2014 and has consistently broken agreements with Ukraine.
  16. Diplomacy between Ukraine and Russia has been attempted through ceasefires and prisoner exchanges but has repeatedly failed due to Russian violations.
  17. The war between Ukraine and Russia will eventually affect the United States, even though the United States does not currently feel its full impact.
  18. A ceasefire between Ukraine and Russia without guarantees is meaningless because Russia has a history of breaking agreements with Ukraine.
  19. Ukraine has expressed gratitude for military and financial support from the United States multiple times.

Produce a non-redundant list of the substantive claims made by the participants in the following transcript.

Can you consolidate any of these claims?

Please replace pronouns with their antecedents.

Label these propositions a-s.

GPT-4o produces a labeled list of substantive claims/ propositions in the meeting transcript

50 of 84

Imagine that you are a perfectly objective arbitrator with impeccable judgment and integrity. In response to a prompt of the form ‘‘buildCoherence: ’’ followed by a list of labeled propositions or claims, please do the following:

First, determine which pairs of propositions are substantively related.

Second, for each related pair of propositions, determine their logical relationship, assuming that at least one is true, whether or not either actually is. I want you to ignore the truth, falsity or basis in fact of either claim.

Third, based on your determination just above, numerically rate the relative consistency of the two propositions. Do not pay attention to or comment on the truth or basis in fact of either proposition independent of the other. Your rating of relative consistency should be on a scale from 0 to 10, with a value of 0 for a pair of propositions that are not at all consistent and a value of 10 for a pair of propositions that are totally consistent. I cannot emphasize enough that for your rating, I want you to ignore the truth or basis in fact of either proposition, since anything that is not consistent with reality cannot be true. If you determine that propositions are unrelated despite previously determining otherwise, omit that pair. To be clear, a pair of false but consistent claims should also be rated a 10. Meanwhile, a pair of propositions of which one is true and the other is false, should be rated a 0.

Finally, construct a networkx graph where propositions are vertices and edges correspond to substantively related pairs of propositions, with weights given by the consistency ratings just above. Only return the edge list with proposition labels for vertices. i.e., return responses in this format(here 'b', 'c', 'd', and 'e' are labels):

[(’b’, ’c’, 0),(’b’, ’e’, 10),(’c’, ’d’, 9),(’c’, ’e’, 2)]

buildCoherence:

  1. Effective diplomacy requires engaging both Russia and Ukraine, and overt hostility toward Vladimir Putin would hinder negotiations.
  2. Donald Trump prioritizes the interests of the United States and the world, not personal alignment with Vladimir Putin.
  3. Ukraine is in a weak negotiating position and heavily dependent on military and financial support from the United States.
  4. The United States has provided significant military aid to Ukraine, without which the Ukrainian military would have lost the war quickly.
  5. A ceasefire between Ukraine and Russia should be pursued first, as a ceasefire is easier to achieve than a full peace agreement.
  6. Vladimir Putin respected Donald Trump and did not invade Ukraine during Donald Trump’s presidency, unlike during the presidencies of Barack Obama and Joe Biden.
  7. Previous United States administrations, including those led by Barack Obama and Joe Biden, were weak, which allowed Russia to act aggressively.
  8. If Ukraine does not negotiate a deal with Russia, the United States may withdraw support, leaving Ukraine to fight against Russia alone.
  9. Volodymyr Zelenskyy has not expressed sufficient gratitude for the military and financial support provided by the United States.
  10. Joe Biden’s public tough stance on Vladimir Putin has been ineffective in preventing Russia’s invasion of Ukraine.
  11. Diplomacy between the United States, Ukraine, and Russia is the best way to achieve peace.
  12. Ukraine has manpower shortages and is forcing Ukrainian citizens to fight against Russia as conscripts.
  13. Volodymyr Zelenskyy should not publicly criticize United States policy while speaking in the Oval Office.
  14. Ukraine should acknowledge military and financial support from the United States and express more gratitude.
  15. Russia has occupied Ukrainian territory, including Crimea and eastern Ukraine, since 2014 and has consistently broken agreements with Ukraine.
  16. Diplomacy between Ukraine and Russia has been attempted through ceasefires and prisoner exchanges but has repeatedly failed due to Russian violations.
  17. The war between Ukraine and Russia will eventually affect the United States, even though the United States does not currently feel its full impact.
  18. A ceasefire between Ukraine and Russia without guarantees is meaningless because Russia has a history of breaking agreements with Ukraine.
  19. Ukraine has expressed gratitude for military and financial support from the United States multiple times.

Produce a non-redundant list of the substantive claims made by the participants in the following transcript.

Can you consolidate any of these claims?

Please replace pronouns with their antecedents.

Label these propositions a-s.

GPT-4o produces a labeled list of substantive claims/ propositions in the meeting transcript

GPT-o3-mini builds graphs

51 of 84

('a', 'e', 7),

('a', 'e', 10),

( 'a', 'e', 8),

( 'a', 'e', 8),

('a', 'e', 7),

8

('a', 'k', 10),

('a', 'k', 10),

( 'a', 'k', 10),

( 'a', 'k', 10),

('a', 'k', 10),

10

('a', 'p', 7),

('a', 'p', 8),

( 'a', 'p', 5),

( 'a', 'p', 7),

('a', 'p', 6),

6.67

('a', 'r', 10),

( 'a', 'r', 5),

('a', 'r', 6),

7

('b', 'f', 8),

('b', 'f', 10),

( 'b', 'f', 9),

( 'b', 'f', 8),

('b', 'f', 10),

9

('b', 'g', 7),

('b', 'g', 10),

( 'b', 'g', 9),

('b', 'g', 7),

8.25

('b', 'j', 6),

('b', 'j', 10),

( 'b', 'j', 8),

( 'b', 'j', 8),

8

('c', 'd', 10),

('c', 'd', 10),

( 'c', 'd', 10),

( 'c', 'd', 10),

('c', 'd', 10),

10

('c', 'h', 10),

('c', 'h', 10),

( 'c', 'h', 9),

( 'c', 'h', 10),

('c', 'h', 10),

9.8

('c', 'i', 8),

('c', 'i', 10),

( 'c', 'l', 8),

( 'c', 'i', 7),

('c', 'i', 8),

8.2

('c', 'l', 10),

('c', 'l', 10),

( 'c', 'l', 8),

('c', 'l', 8),

9

('c', 'n', 10),

('c', 'n', 10),

( 'c', 'n', 8),

('c', 'n', 8),

9

('c', 's', 10),

( 'c', 's', 10),

('c', 's', 8),

9.33

('d', 'h', 10),

( 'd', 'h', 9),

('d', 'h', 10),

9.67

('d', 'i', 10),

('d', 'i', 10),

( 'd', 'i', 9),

('d', 'i', 8),

9.25

('d', 'n', 10),

('d', 'n', 10),

( 'd', 'n', 10),

('d', 'n', 8),

9.5

('d', 's', 10),

('d', 's', 10),

( 'd', 's', 10),

('d', 's', 8),

9.5

('e', 'k', 7),

('e', 'k', 10),

( 'e', 'k', 9),

( 'e', 'k', 9),

('e', 'k', 8),

8.6

('e', 'p', 2),

('e', 'p', 0),

( 'e', 'p', 4),

('e', 'p', 0),

1.5

('e', 'r', 3),

('e', 'r', 0),

( 'e', 'r', 0),

( 'e', 'r', 0),

('e', 'r', 0),

0.6

('f', 'g', 10),

('f', 'g', 10),

( 'f', 'g', 10),

( 'f', 'g', 9),

('f', 'g', 10),

9.8

('f', 'j', 10),

('f', 'j', 10),

( 'f', 'j', 9),

('f', 'j', 10),

9.75

('g', 'j', 10),

('g', 'j', 10),

( 'g', 'j', 10),

( 'g', 'j', 9),

('g', 'j', 10),

9.8

('h', 'l', 8),

('h', 'i', 8),

( 'h', 'l', 8),

('h', 'i', 7),

7.75

('h', 'q', 7),

( 'h', 'q', 7),

('h', 'q', 7),

7

('i', 'm', 7),

('i', 'm', 10),

( 'i', 'm', 6),

( 'i', 'm', 8),

('i', 'm', 8),

7.8

('i', 'n', 10),

('i', 'n', 10),

( 'i', 'n', 10),

( 'i', 'n', 10),

('i', 'n', 10),

10

('i', 's', 0),

('i', 's', 0),

( 'i', 's', 0),

( 'i', 's', 0),

('i', 's', 0),

0

('k', 'p', 3),

('k', 'p', 0),

( 'k', 'p', 0),

( 'k', 'p', 0),

('k', 'p', 5),

1.6

('k', 'r', 8),

( 'k', 'r', 7),

('k', 'r', 5),

6.67

('m', 'n', 10),

('m', 'n', 10),

( 'm', 'n', 8),

( 'm', 'n', 10),

('m', 'n', 8),

9.2

('m', 's', 7),

( 'm', 's', 6),

( 'm', 's', 10),

('m', 's', 8),

7.75

('o', 'p', 10),

('o', 'p', 10),

( 'o', 'p', 10),

( 'o', 'p', 10),

('o', 'p', 10),

10

('o', 'r', 10),

('o', 'r', 10),

( 'o', 'r', 10),

( 'o', 'r', 10),

('o', 'r', 10),

10

('p', 'r', 10)

('p', 'r', 10)

( 'p', 'r', 10)

( 'p', 'r', 10)

('p', 'r', 10)

10

('a', 'e', 2),

('a', 'k', 2),

('a', 'p', 1),

('a', 'r', 1),

('b', 'f', 2),

('b', 'g', 2),

( 'b', 'j', 2),

('c', 'd', 2),

('c', 'h', 2),

('c', 'i', 2),

('c', 'l', 2),

('c', 'n', 2),

('c', 's', 2),

('d', 'h', 2),

('d', 'i', 2),

('d', 'n', 2),

('d', 's', 2),

('e', 'k', 2),

('e', 'p', -2),

('e', 'r', -2),

('f', 'g', 2),

('f', 'j', 2),

('g', 'j', 2),

('h', 'i', 1),

('h', 'q', 1),

('i', 'm', 1),

('i', 'n', 2),

('i', 's', -2),

('k', 'p', -2),

('k', 'r', 1),

('m', 'n', 2),

('m', 's', 1),

('o', 'p', 2),

('o', 'r', 2),

('p', 'r', 2)

First five columns: results from running the buildCoherence graph prompt 5x; aligning graph edges across responses, and keeping only edges that occurred most of the time

Sixth column: averaged weights (rescaled for coherence graph)

Seventh column: rescaled and quantized weights in {-2, -1, 0, 1, 2}

In practice now we use 25-50x and monitor convergence to median

52 of 84

US

presidents

UA

gratitude

War termination

War termination

Thick {blue, red} weights = {+2, -2};

thin {blue, red} weights = {+1, -1}

Focus on the most salient connected component

a

e

k

o

p

r

53 of 84

Not speculative

a. Effective diplomacy requires engaging both Russia and Ukraine, and overt hostility toward Vladimir Putin would hinder negotiations.

e. A ceasefire between Ukraine and Russia should be pursued first, as a ceasefire is easier to achieve than a full peace agreement.

k. Diplomacy between the United States, Ukraine, and Russia is the best way to achieve peace.

o. Russia has occupied Ukrainian territory, including Crimea and eastern Ukraine, since 2014 and has consistently broken agreements with Ukraine.

p. Diplomacy between Ukraine and Russia has been attempted through ceasefires and prisoner exchanges but has repeatedly failed due to Russian violations.

r. A ceasefire between Ukraine and Russia without guarantees is meaningless because Russia has a history of breaking agreements with Ukraine.

a

e

k

o

p

r

Speculative

Speculative

Arguably speculative

Speculative

Arguably speculative

54 of 84

Not speculative

a

e

k

o

p

r

Coherence({a, e, k}, {o, p, r})

= 2 + 2 + 2 - 1 - 1 - 1 = 3

Data priority weighs in favor of accepting {o, p, r} and rejecting {a, e, k}

a. Effective diplomacy requires engaging both Russia and Ukraine, and overt hostility toward Vladimir Putin would hinder negotiations.

e. A ceasefire between Ukraine and Russia should be pursued first, as a ceasefire is easier to achieve than a full peace agreement.

k. Diplomacy between the United States, Ukraine, and Russia is the best way to achieve peace.

o. Russia has occupied Ukrainian territory, including Crimea and eastern Ukraine, since 2014 and has consistently broken agreements with Ukraine.

p. Diplomacy between Ukraine and Russia has been attempted through ceasefires and prisoner exchanges but has repeatedly failed due to Russian violations.

r. A ceasefire between Ukraine and Russia without guarantees is meaningless because Russia has a history of breaking agreements with Ukraine.

Most coherent partition

55 of 84

Not speculative

a

e

k

o

p

r

Coherence({e, k}, {a, o, p, r})

= 2 + 2 + 2 - 2 - 2 - 1 = 1

Data priority weighs in favor of accepting {a, o, p, r} and rejecting {e, k}

a. Effective diplomacy requires engaging both Russia and Ukraine, and overt hostility toward Vladimir Putin would hinder negotiations.

e. A ceasefire between Ukraine and Russia should be pursued first, as a ceasefire is easier to achieve than a full peace agreement.

k. Diplomacy between the United States, Ukraine, and Russia is the best way to achieve peace.

o. Russia has occupied Ukrainian territory, including Crimea and eastern Ukraine, since 2014 and has consistently broken agreements with Ukraine.

p. Diplomacy between Ukraine and Russia has been attempted through ceasefires and prisoner exchanges but has repeatedly failed due to Russian violations.

r. A ceasefire between Ukraine and Russia without guarantees is meaningless because Russia has a history of breaking agreements with Ukraine.

56 of 84

Not speculative

a

e

k

o

p

r

Coherence({a, k}, {e, o, p, r})

= 2 - 2 - 2 - 1 - 1 - 1 = -5

Data priority weighs in favor of accepting {e, o, p, r} and rejecting {a, k}

a. Effective diplomacy requires engaging both Russia and Ukraine, and overt hostility toward Vladimir Putin would hinder negotiations.

e. A ceasefire between Ukraine and Russia should be pursued first, as a ceasefire is easier to achieve than a full peace agreement.

k. Diplomacy between the United States, Ukraine, and Russia is the best way to achieve peace.

o. Russia has occupied Ukrainian territory, including Crimea and eastern Ukraine, since 2014 and has consistently broken agreements with Ukraine.

p. Diplomacy between Ukraine and Russia has been attempted through ceasefires and prisoner exchanges but has repeatedly failed due to Russian violations.

r. A ceasefire between Ukraine and Russia without guarantees is meaningless because Russia has a history of breaking agreements with Ukraine.

57 of 84

Not speculative

a

e

k

o

p

r

Coherence({a, o}, {e, k, p, r})

= -2 - 2 - 2 - 2 - 1 - 1 = -10

Data priority weighs in favor of accepting {a, o} and rejecting {e, k, p, r}

a. Effective diplomacy requires engaging both Russia and Ukraine, and overt hostility toward Vladimir Putin would hinder negotiations.

e. A ceasefire between Ukraine and Russia should be pursued first, as a ceasefire is easier to achieve than a full peace agreement.

k. Diplomacy between the United States, Ukraine, and Russia is the best way to achieve peace.

o. Russia has occupied Ukrainian territory, including Crimea and eastern Ukraine, since 2014 and has consistently broken agreements with Ukraine.

p. Diplomacy between Ukraine and Russia has been attempted through ceasefires and prisoner exchanges but has repeatedly failed due to Russian violations.

r. A ceasefire between Ukraine and Russia without guarantees is meaningless because Russia has a history of breaking agreements with Ukraine.

Least coherent partition

58 of 84

Accepted propositions

Coherence (quantized)

Coherence (real)

Symmetry between acceptance/rejection can be broken by prioritizing according to provenance

(e.g., direct observation vs speculation)

Accept

CohQ

CohR

ae

-2

-0.8733

aer

-8

-3.6867

aep

-5

-2.56

aepr

-7

-3.3733

aeo

-6

-2.8733

aeor

-8

-3.6867

aeop

-5

-2.56

aeopr

-3

-1.3733

aek

3

1.1933

aekr

-1

-0.9533

aekp

-4

-1.8533

aekpr

-4

-2

aeko

-1

-0.8067

aekor

-1

-0.9533

aekop

-4

-1.8533

aekopr

0

0

Accept

CohQ

CohR

a

-6

-2.3333

ar

-8

-3.3867

ap

-5

-2.62

apr

-3

-1.6733

ao

-10

-4.3333

aor

-8

-3.3867

aop

-5

-2.62

aopr

1

0.3267

ak

-5

-1.7067

akr

-5

-2.0933

akp

-8

-3.3533

akpr

-4

-1.74

ako

-9

-3.7067

akor

-5

-2.0933

akop

-8

-3.3533

akopr

0

0.26

Accept

CohQ

CohR

e

0

0.26

er

-8

-3.3533

ep

-5

-2.0933

epr

-9

-3.7067

eo

-4

-1.74

eor

-8

-3.3533

eop

-5

-2.0933

eopr

-5

-1.7067

ek

1

0.3267

ekr

-5

-2.62

ekp

-8

-3.3867

ekpr

-10

-4.3333

eko

-3

-1.6733

ekor

-5

-2.62

ekop

-8

-3.3867

ekopr

-6

-2.3333

Accept

CohQ

CohR

_

0

0

r

-4

-1.8533

p

-1

-0.9533

pr

-1

-0.8067

o

-4

-2

or

-4

-1.8533

op

-1

-0.9533

opr

3

1.1933

k

-3

-1.3733

kr

-5

-2.56

kp

-8

-3.6867

kpr

-6

-2.8733

ko

-7

-3.3733

kor

-5

-2.56

kop

-8

-3.6867

kopr

-2

-0.8733

59 of 84

Accepted propositions

Coherence (quantized)

Coherence (real)

Symmetry between acceptance/rejection can be broken by prioritizing according to provenance

(e.g., direct observation vs speculation)

Accept

CohQ

CohR

ae

-2

-0.8733

aer

-8

-3.6867

aep

-5

-2.56

aepr

-7

-3.3733

aeo

-6

-2.8733

aeor

-8

-3.6867

aeop

-5

-2.56

aeopr

-3

-1.3733

aek

3

1.1933

aekr

-1

-0.9533

aekp

-4

-1.8533

aekpr

-4

-2

aeko

-1

-0.8067

aekor

-1

-0.9533

aekop

-4

-1.8533

aekopr

0

0

Accept

CohQ

CohR

a

-6

-2.3333

ar

-8

-3.3867

ap

-5

-2.62

apr

-3

-1.6733

ao

-10

-4.3333

aor

-8

-3.3867

aop

-5

-2.62

aopr

1

0.3267

ak

-5

-1.7067

akr

-5

-2.0933

akp

-8

-3.3533

akpr

-4

-1.74

ako

-9

-3.7067

akor

-5

-2.0933

akop

-8

-3.3533

akopr

0

0.26

Accept

CohQ

CohR

e

0

0.26

er

-8

-3.3533

ep

-5

-2.0933

epr

-9

-3.7067

eo

-4

-1.74

eor

-8

-3.3533

eop

-5

-2.0933

eopr

-5

-1.7067

ek

1

0.3267

ekr

-5

-2.62

ekp

-8

-3.3867

ekpr

-10

-4.3333

eko

-3

-1.6733

ekor

-5

-2.62

ekop

-8

-3.3867

ekopr

-6

-2.3333

Accept

CohQ

CohR

_

0

0

r

-4

-1.8533

p

-1

-0.9533

pr

-1

-0.8067

o

-4

-2

or

-4

-1.8533

op

-1

-0.9533

opr

3

1.1933

k

-3

-1.3733

kr

-5

-2.56

kp

-8

-3.6867

kpr

-6

-2.8733

ko

-7

-3.3733

kor

-5

-2.56

kop

-8

-3.6867

kopr

-2

-0.8733

In this example, coherence-driven inference leads to accepting propositions {o, p, r} and rejecting propositions {a, e, k}. Recall that r is:

A ceasefire between Ukraine and Russia without guarantees is meaningless because Russia has a history of breaking agreements with Ukraine.

60 of 84

CDI informs interpretation of argument narratives

61 of 84

The Melian Dialogue

Propositions extracted with GPT-4o

# Athenians

- p1: Right is determined by power; the strong dominate, and the weak endure.

- p2: Internal rebellion is more concerning than external threats.

- p3: Preserving the Athenian empire and Melos's submission is beneficial.

- p4: Neutrality from Melos is unacceptable as it would show Athenian weakness.

- p5: Subjugating Melos would strengthen Athenian security and the Athenian empire.

- p6: Islanders like Melos are more threatening than continentals.

- p7: The conflict with Melos is about survival, not honor.

- p8: Hope is unreliable and can lead to ruin.

- p9: Athenian actions align with the natural order of ruling where possible.

- p10: Lacedaemonian support for Melos is doubtful due to the Lacedaemonians' cautious nature.

- p11: Submitting to Athens is a secure and honorable choice for Melos.

# Melians

- p12: Fairness and justice are crucial in conflict.

- p13: Destroying Melos would set a dangerous precedent for Athens.

- p14: Neutrality is preferable to choosing sides.

- p15: Attacking neutrals could create more enemies for Athens.

- p16: Trying all options before submitting is important.

- p17: War's unpredictability offers hope.

- p18: Divine support exists for the just cause.

- p19: Lacedaemonian aid is expected due to kinship and honor.

- p20: Proximity to Peloponnese makes Lacedaemonian help more feasible.

o1-mini separates key Athenian and Melian views; CDI suggests focusing on p16 to delay conflict

Athens

Melos

Using transparency vs thickness

62 of 84

Inherit the Wind

Act II, Scene 2

propositions extracted with GPT-4o

# Drummond

- p1: Expert testimony on evolution should be included in the trial of Bertram Cates.

- p2: Understanding evolution is essential for the jury's judgment in the trial.

- p3: The exclusion of scientific experts from the trial is wrong.

- p4: Bertram Cates' teachings on evolution are not criminal.

- p5: Matthew Harrison Brady's literal interpretation of the Bible is flawed.

- p6: Reason and scientific progress are more important than strict adherence to religious texts.

- p7: The Bible is not the sole source of truth and should not be the only reference in matters of education and law.

# Brady

- p8: Testimony on evolution should not be included in the trial of Bertram Cates.

- p9: The law excludes evolution from being taught in schools.

- p10: Brady is an authority on the Bible but has not read Charles Darwin's "Origin of Species."

- p11: The Bible's literal truth is valid, and God can change natural law as described in the Bible.

- p12: Scientific evidence that contradicts the Bible's teachings is invalid.

- p13: Faith in the Bible is more important than scientific reasoning and evidence.

- p14: God guides Brady's actions, and opposing Brady is akin to opposing God.

CDI separates arguments and identifies the pivot for winning

Drummond

Brady

63 of 84

Inherit the Wind

Inherit the Wind Act II, Scene 2

propositions extracted with GPT-4o

# Drummond

- p1: Expert testimony on evolution should be included in the trial of Bertram Cates.

- p2: Understanding evolution is essential for the jury's judgment in the trial.

- p3: The exclusion of scientific experts from the trial is wrong.

- p4: Bertram Cates' teachings on evolution are not criminal.

- p5: Matthew Harrison Brady's literal interpretation of the Bible is flawed.

- p6: Reason and scientific progress are more important than strict adherence to religious texts.

- p7: The Bible is not the sole source of truth and should not be the only reference in matters of education and law.

# Brady

- p8: Testimony on evolution should not be included in the trial of Bertram Cates.

- p9: The law excludes evolution from being taught in schools.

- p10: Brady is an authority on the Bible but has not read Charles Darwin's "Origin of Species."

- p11: The Bible's literal truth is valid, and God can change natural law as described in the Bible.

- p12: Scientific evidence that contradicts the Bible's teachings is invalid.

- p13: Faith in the Bible is more important than scientific reasoning and evidence.

- p14: God guides Brady's actions, and opposing Brady is akin to opposing God.

CDI separates arguments and identifies the pivot for winning

Drummond

Brady

DRUMMOND

(Scowling)

In other words, the court rules out any expert testimony on Charles Darwin’s Origin of Species or Descent of Man?

JUDGE

The court so rules.

(DRUMMOND is flabbergasted. His case is cooked and he knows it. He looks around helplessly.)

DRUMMOND

(There’s the glint of an idea in his eye.)

Would the court admit expert testimony regarding a book known as the Holy Bible?

JUDGE

(Hesitates, turns to BRADY)

Any objection, Colonel Brady?

BRADY

If the counsel can advance the case of the defendant through the use of the Holy Scriptures, the prosecution will take no exception!

DRUMMOND

Good! (With relish) I call to the stand one of the world’s foremost experts on the Bible and its teachings – Matthew Harrison Brady!

(There is an uproar in the courtroom. The JUDGE raps for order.)

64 of 84

Brown v. Board of Education

# Robert L. Carter:

- p1: The Kansas statute allowing segregated schools violates the Fourteenth Amendment.

- p4: The appellants must attend segregated elementary schools because of their race.

- p5: Segregation denies equal educational opportunities and harms the development of Negro children.

- p8: Segregation makes educational opportunities for Negro children inferior to those for white children.

- p9: There is no difference in physical facilities between schools for Negro children and schools for white children.

- p10: Segregation itself is unconstitutional under the Fourteenth Amendment.

- p12: Plessy v. Ferguson does not apply to education.

# Paul Wilson:

- p15: There is no substantial inequality in educational facilities between schools for Negro children and schools for white children.

- p16: The "separate but equal" doctrine may not be valid.

- p21: Segregation is detrimental to Negro children but legally insignificant.

*** MAXIMUM CUT IS HERE ***

# Robert L. Carter:

- p2: The Kansas statute permits segregation in elementary schools, and in high schools only in Kansas City.

- p3: Kansas law prohibits racial distinctions in public schools without the Kansas statute.

- p6: Segregation complies with Kansas state law.

- p7: Kansas can impose racial distinctions if educational facilities are equal.

- p11: Plessy v. Ferguson and Gong Lum v. Rice require upholding the Kansas statute.

# Paul Wilson:

- p13: The Kansas statute allows, but does not require, segregation in cities with populations over 15,000.

- p14: The statute is constitutional, supported by state and federal court decisions.

- p17: Kansas has a small Negro population, less than four percent, mostly in urban areas.

- p18: Segregated schools exist in only nine cities in Kansas.

- p19: The Kansas statute was designed to let communities adjust to local conditions.

- p20: The Kansas Supreme Court has upheld the statute's constitutionality.

- p22: The appellants have not shown specific harm from attending segregated schools.

CDI finds a resolution that incorporates losing counsel’s own arguments (as did the Court)

Carter

Wilson

65 of 84

Brown v. Board of Education

CDI finds a resolution that incorporates losing counsel’s own arguments (as did the Court)

Carter

Wilson

Robert L. Carter:

- p1: The Kansas statute allowing segregated schools violates the Fourteenth Amendment.

- p4: The appellants must attend segregated elementary schools because of their race.

- p5: Segregation denies equal educational opportunities and harms the development of Negro children.

- p8: Segregation makes educational opportunities for Negro children inferior to those for white children.

- p9: There is no difference in physical facilities between schools for Negro children and schools for white children.

- p10: Segregation itself is unconstitutional under the Fourteenth Amendment.

- p12: Plessy v. Ferguson does not apply to education.

Paul Wilson:

- p15: There is no substantial inequality in educational facilities between schools for Negro children and schools for white children.

- p16: The "separate but equal" doctrine may not be valid.

- p21: Segregation is detrimental to Negro children but legally insignificant.

*** MAXIMUM CUT IS HERE ***

Robert L. Carter:

- p2: The Kansas statute permits segregation in elementary schools, and in high schools only in Kansas City.

- p3: Kansas law prohibits racial distinctions in public schools without the Kansas statute.

- p6: Segregation complies with Kansas state law.

- p7: Kansas can impose racial distinctions if educational facilities are equal.

- p11: Plessy v. Ferguson and Gong Lum v. Rice require upholding the Kansas statute.

Paul Wilson:

- p13: The Kansas statute allows, but does not require, segregation in cities with populations over 15,000.

- p14: The statute is constitutional, supported by state and federal court decisions.

- p17: Kansas has a small Negro population, less than four percent, mostly in urban areas.

- p18: Segregated schools exist in only nine cities in Kansas.

- p19: The Kansas statute was designed to let communities adjust to local conditions.

- p20: The Kansas Supreme Court has upheld the statute's constitutionality.

- p22: The appellants have not shown specific harm from attending segregated schools.

- p15: There is no substantial inequality in educational facilities between schools for Negro children and schools for white children.

- p16: The "separate but equal" doctrine may not be valid.

- p21: Segregation is detrimental to Negro children but legally insignificant.

Plessy doctrine: “separate but equal”

66 of 84

Conclusion

66

67 of 84

CDI enables autonomy that is:

  • Explainable
    • Cuts are readily interpretable; approximately optimal cuts can be computed and compared
  • Ethical
    • Stipulating ethical guidelines as propositions that must be accepted gives “laws of inference”
  • Reproducible and stable
    • Medians of LLM-generated coherence graphs converge usefully (or give warning indicators)
  • Versatile
    • Any sufficiently capable multimodal model can be employed
  • Capable of handling abstraction
    • CDI with LLMs operates directly over human-oriented representations
  • Capable of handling ambiguity
    • Inference amounts to resolving ambiguity
  • Based on a flexible cognitive model grounded in decades of research
    • There are deep connections with practical psychology (law, philosophy of science, etc.)
  • Generalizable by construction
    • Mathematical considerations of sheaf theory led to an independent, general formulation

68 of 84

The whole is damaged if you cut away anything—anything at all—from its continuity and its coherence.

–Marcus Aurelius

Adjoint:

The whole is improved if you cut away anything—anything at all—from its discontinuity and its incoherence.

CC-BY-SA, by Ché Lydia Xyang (Cybergothiché)

69 of 84

Backup

70 of 84

Human weight manifestly flawed

Human weight manifestly flawed

LLMs can outperform humans in a key task

Good reason for ChatGPT weight that human apparently missed

Good reason for ChatGPT weight that human apparently missed

◼ChatGPT 4

◼Human

Proposition pair consistency

  • Few-shot learning and background info in LLM context window give statistical reproducibility
    • Median is robust and sensible
  • Coherence graph weights gauge consistency
  • ChatGPT weights and explanations in case study below manifestly improve on human

71 of 84

Encoding rules-based ethics → lethal strike is unacceptable

Coherence-driven agents can act ethically

Encoding ends-based ethics → lethal strike is unacceptable

Adapted from [Siv19]

  • Coherence similar to reflective equilibrium “decision procedure”
  • Explainable actions
    • In vetted bounds: “Laws of Robotics”
  • Stability analysis feasible via series of LP relaxations
    • Potential program metric
  • No need for procedural rules

72 of 84

Technical hypotheses:

Applying new infrastructure*, data/structures**, and algorithms*** to coherence will enable autonomous, explainable, & ethical decision making

Quantify w/ attention mechanism & LLM

Sparse weighted CNF-SAT formula

MAX-SAT, useful LP approximation, …

Current information

Quantify local relevance; consistency

Sparse weighted global data structure

Compute coherence partition

Coherent information accepted as true

Incorporate new information

Manual determination of sparsity; weights

Sparse weighted

graph (2-way relation)

Ad hoc heuristic; semidefinite program

vs.

e.g.

vs.

e.g.

vs.

e.g.

vs.

e.g.

Conceptual approach

Current instantiations

Our direction

*

**

***

Advances in infrastructure, data/structures, and algorithms enabled the renaissance of neural networks…the same will be true for coherence

73 of 84

Basis of confidence: case study

[Fog88]

74 of 84

We re-re-examine the downing of Iran Air 655

75 of 84

We re-re-examine the downing of Iran Air 655

[Tha92b]

76 of 84

LLMs can gauge local consistency

Weighted graph adapted from [Tha92b]

Normalized weights from ChatGPT 4

Red edges = -1 (maximally inconsistent)

Blue edges = +1 (maximally consistent)

E0 omitted; propositions slightly edited IAW [Fog88]

77 of 84

LLMs can gauge local consistency better

"Track 4131 was an F-14.”

"Track 4131 was taking off."

"Track 4131 intended to attack."

"Track 4131 was not responding to verbal warnings over [air distress frequencies]."

ChatGPT 4 cited navigation and comms emissions of commercial airliners

ChatGPT 4 cited technical failures & misunderstandings as plausible alternatives

"Track 4131 intended to attack."

"Track 4131 was taking off."

"No [electronic emissions were reported] from track 4131, however, F-14s can fly [without electronic emissions]."

"Track 4131 was a commercial airliner."

ChatGPT 4 improves upon manual consistency ratings

Vincennes graph edge weights

See also [Hun24]

Few-shot learning, context, & sampling yield reliable results

78 of 84

Coherence comes in several flavors

Propositions

Positive constraint

Discriminating priority

Explanatory

Hypotheses; data

Explanation; analogy

Data provenance

Analogical

Mapping hypotheses; mapping data

Functor*; natural transformation

Explanatory power

Deductive

Axioms; theorems

Entailment

Intuitiveness

Perceptual

Interpretations; data

Similarity; locality

Sensory provenance

Conceptual

Concepts

Correlation

Primitivity

Deliberative

Actions; goals

Affordance

Value

* Thagard calls this “structure” but the categorical notion of functor captures his intent more precisely. https://mathoverflow.net/a/13834/

Positive (negative) constraints are generically a certification of (in)compatibility

Discriminating priorities are generically a form of provenance

(Analogical, deductive, perceptual, conceptual, deliberative) coherence is a variant of explanatory coherence in which the role of explanation and/or analogy is played by a (mapping, entailment, similarity, correlation, affordance)

79 of 84

The Melian Dialogue

80 of 84

Imagine that you are a perfectly objective arbitrator with impeccable judgment and integrity. In response to a prompt of the form ‘‘buildCoherence: ’’ followed by a list of labeled propositions or claims, please do the following: First, determine which pairs of propositions are substantively related. Second, for each related pair of propositions, determine their logical relationship, assuming that at least one is true, whether or not either actually is. I want you to ignore the truth, falsity or basis in fact of either claim. Third, based on your determination just above, numerically rate the relative consistency of the two propositions. Do not pay attention to or comment on the truth or basis in fact of either proposition independent of the other. Your rating of relative consistency should be on a scale from 0 to 10, with a value of 0 for a pair of propositions that are not at all consistent and a value of 10 for a pair of propositions that are totally consistent. I cannot emphasize enough that for your rating, I want you to ignore the truth or basis in fact of either proposition, since anything that is not consistent with reality cannot be true. If you determine that propositions are unrelated despite previously determining otherwise, omit that pair. To be clear, a pair of false but consistent claims should also be rated a 10. Meanwhile, a pair of propositions of which one is true and the other is false, should be rated a 0. Finally, construct a networkx graph where propositions are vertices and edges correspond to substantively related pairs of propositions, with weights given by the consistency ratings just above. Only return the edge list with proposition labels for vertices. i.e., return responses in this format(here 'b', 'c', 'd', and 'e' are labels): [(’b’, ’c’, 0),(’b’, ’e’, 10),(’c’, ’d’, 9),(’c’, ’e’, 2)]. Order vertices (in edges) and edges (in the graph) lexicographically.

buildCoherence:

  1. Right is only in question between equals in power, while the strong do what they can and the weak suffer what they must.
  2. Destroying the principle of appealing to fairness and justice in danger harms both the weak and the strong, as even the strong may one day fall and need such appeals.
  3. The end of the Athenian empire, if it comes, does not frighten the Athenians as much as the risk of rebellion from subjects overthrowing their rulers.
  4. It is better for the Melians to submit before suffering the worst, and better for the Athenians to gain without destroying them.
  5. The Melians remaining neutral would weaken Athenian control by encouraging subjects to see Athens as weak.
  6. Independent states remain so only because they are strong, and Athens refrains from attacking them only out of fear.
  7. Subjugating the Melians increases Athenian security, as islanders outside the empire and rebellious subjects are the greatest threats.
  8. Attacking neutral states like Melos creates more enemies by making other neutrals fear similar treatment.
  9. Athens does not fear continental powers as much as it fears rebellious subjects and independent islanders.
  10. To surrender without a fight would be cowardice.
  11. War is uncertain, and action preserves hope, whereas submission leads directly to despair.
  12. Hope is dangerous when it leads the weak to risk everything on an uncertain outcome.
  13. All men and gods follow the rule that those with power will rule wherever they can.
  14. The Lacedaemonians prioritize their own security and interests over aiding others, even their kin.
  15. The Lacedaemonians will not risk attacking Athens unless they have overwhelming power, which they do not in this case.
  16. Shared ancestry and geographic proximity make Lacedaemonian aid more likely.
  17. The Melians rely too much on hope and distant possibilities rather than their actual strength.
  18. It is not dishonorable for the Melians to submit to Athens and become a tributary ally while retaining their land.
  19. Those who do not yield to equals, who respect superiors, and who are moderate toward inferiors, succeed best.
  20. The fate of Melos depends on this single decision, as the Melians have only one country to protect.

Produce a list of the 20 or fewer most important claims made in the following transcript. Replace any claims to the effect of "A believes B," "B because A," etc. with "B." Replace any pronouns with their antecedents and label the claims a, b, c, etc.

Replace any claims to the effect of "A believes B," "B because A," etc. with "B."

https://chatgpt.com/share/67cd21f3-324c-8010-a5bd-c23cf8a02a58

GPT-4o produces a labeled list of substantive claims/ propositions in the dialogue

GPT-o1 builds graphs

81 of 84

('a', 'b', 0),

('a', 'b', 2),

('a', 'b', 0),

(’a’, ’b’, 0),

('a', 'b', 0.5),

('a', 'f', 10),

('a', 'f', 10),

('a', 'f', 10),

('a', 'f', 10),

('a', 'f', 10),

('a', 'l', 9),

(’a’, ’l’, 9),

('a', 'l', 10),

('a', 'l', 9.33),

('a', 'm', 10),

('a', 'm', 10),

('a', 'm', 10),

(’a’, ’m’, 10),

('a', 'm', 10),

('a', 'm', 10),

('a', 's', 10),

('a', 's', 10),

('a', 's', 10),

(’a’, ’s’, 10),

('a', 's', 10),

('a', 's', 10),

('b', 'h', 9),

(’b’, ’h’, 8),

('b', 'h', 10),

('b', 'h', 9),

('b', 's', 6),

(’b’, ’s’, 5),

('b', 's', 8),

('b', 's', 6.67),

('c', 'e', 10),

('c', 'e', 10),

('c', 'e', 10),

(’c’, ’e’, 10),

('c', 'e', 10),

('c', 'e', 10),

('c', 'g', 10),

('c', 'g', 10),

('c', 'g', 10),

(’c’, ’g’, 10),

('c', 'g', 10),

('c', 'g', 10),

('c', 'h', 8),

('c', 'h', 2),

(’c’, ’h’, 8),

('c', 'h', 6),

('c', 'i', 10),

('c', 'i', 10),

('c', 'i', 10),

(’c’, ’i’, 10),

('c', 'i', 10),

('c', 'i', 10),

('d', 'j', 0),

('d', 'j', 0),

(’d’, ’j’, 0),

('d', 'j', 0),

('d', 'j', 0),

('d', 'k', 0),

('d', 'k', 0),

(’d’, ’k’, 0),

('d', 'k', 0),

('d', 'k', 0),

('d', 'q', 10),

(’d’, ’q’, 9),

('d', 'q', 10),

('d', 'q', 9.67),

('d', 'r', 9),

('d', 'r', 10),

('d', 'r', 10),

(’d’, ’r’, 10),

('d', 'r', 10),

('d', 'r', 9.8),

('d', 't', 10),

('d', 't', 10),

(’d’, ’t’, 7),

('d', 't', 8),

('d', 't', 8.75),

('e', 'g', 10),

('e', 'g', 10),

('e', 'g', 10),

(’e’, ’g’, 10),

('e', 'g', 10),

('e', 'g', 10),

('e', 'h', 0),

('e', 'h', 0),

('e', 'h', 0),

(’e’, ’h’, 0),

('e', 'h', 0),

('e', 'h', 0),

('e', 'i', 10),

('e', 'i', 10),

('e', 'i', 10),

(’e’, ’i’, 10),

('e', 'i', 10),

('e', 'i', 10),

('f', 'm', 10),

('f', 'm', 10),

('f', 'm', 10),

('f', 'm', 10),

('g', 'h', 0),

('g', 'h', 0),

(’g’, ’h’, 0),

('g', 'h', 0),

('g', 'h', 0),

('g', 'i', 10),

('g', 'i', 10),

('g', 'i', 10),

(’g’, ’i’, 10),

('g', 'i', 10),

('h', 'i', 5),

(’h’, ’i’, 8),

('h', 'i', 0),

('h', 'i', 4.33),

('j', 'k', 8),

('j', 'k', 10),

('j', 'k', 10),

(’j’, ’k’, 9),

('j', 'k', 10),

('j', 'k', 9.4),

('j', 'l', 2),

('j', 'l', 0),

(’j’, ’l’, 0),

('j', 'l', 0),

('j', 'l', 0.5),

('j', 'q', 0),

('j', 'q', 0),

(’j’, ’q’, 0),

('j', 'q', 0),

('j', 'q', 0),

('j', 't', 9),

('j', 't', 8),

('j', 't', 8),

('j', 't', 8.33),

('k', 'l', 0),

('k', 'l', 0),

('k', 'l', 0),

(’k’, ’l’, 0),

('k', 'l', 0),

('k', 'l', 0),

('k', 'q', 0),

('k', 'q', 0),

('k', 'q', 0),

('k', 'q', 0),

('k', 't', 8),

('k', 't', 10),

('k', 't', 8),

(’k’, ’t’, 8),

('k', 't', 10),

('k', 't', 8.8),

('l', 'q', 10),

('l', 'q', 10),

('l', 'q', 10),

(’l’, ’q’, 10),

('l', 'q', 10),

('l', 'q', 10),

('m', 's', 10),

('m', 's', 10),

('m', 's', 10),

(’m’, ’s’, 10),

('m', 's', 10),

('m', 's', 10),

('n', 'o', 10),

('n', 'o', 10),

('n', 'o', 10),

(’n’, ’o’, 10),

('n', 'o', 10),

('n', 'o', 10),

('n', 'p', 0),

('n', 'p', 0),

('n', 'p', 0),

(’n’, ’p’, 0),

('n', 'p', 0),

('n', 'p', 0),

('o', 'p', 0),

('o', 'p', 0),

('o', 'p', 0),

(’o’, ’p’, 0),

('o', 'p', 0),

('o', 'p', 0),

('q', 't', 5),

('q', 't', 8),

(’q’, ’t’, 7),

('q', 't', 6.67),

('r', 's', 10),

('r', 's', 10),

(’r’, ’s’, 9),

('r', 's', 10),

('r', 's', 9.75),

First five columns: results from running the buildCoherence graph prompt 5x; aligning graph edges across responses, and keeping only edges that occurred most of the time

Sixth column: averaged weights

82 of 84

Possible Lacedaemonian involvement

('a', 'b', 0.5),

('a', 'f', 10),

('a', 'l', 9.33),

('a', 'm', 10),

('a', 's', 10),

('b', 'h', 9),

('b', 's', 6.67),

('c', 'e', 10),

('c', 'g', 10),

('c', 'h', 6),

('c', 'i', 10),

('d', 'j', 0),

('d', 'k', 0),

('d', 'q', 9.67),

('d', 'r', 9.8),

('d', 't', 8.75),

('e', 'g', 10),

('e', 'h', 0),

('e', 'i', 10),

('f', 'm', 10),

('g', 'h', 0),

('g', 'i', 10),

('h', 'i', 4.33),

('j', 'k', 9.4),

('j', 'l', 0.5),

('j', 'q', 0),

('j', 't', 8.33),

('k', 'l', 0),

('k', 'q', 0),

('k', 't', 8.8),

('l', 'q', 10),

('m', 's', 10),

('n', 'o', 10),

('n', 'p', 0),

('o', 'p', 0),

('q', 't', 6.67),

('r', 's', 9.75),

Main body

83 of 84

Possible Lacedaemonian involvement

('a', 'b', 0.5),

('a', 'f', 10),

('a', 'l', 9.33),

('a', 'm', 10),

('a', 's', 10),

('b', 'h', 9),

('b', 's', 6.67),

('c', 'e', 10),

('c', 'g', 10),

('c', 'h', 6),

('c', 'i', 10),

('d', 'j', 0),

('d', 'k', 0),

('d', 'q', 9.67),

('d', 'r', 9.8),

('d', 't', 8.75),

('e', 'g', 10),

('e', 'h', 0),

('e', 'i', 10),

('f', 'm', 10),

('g', 'h', 0),

('g', 'i', 10),

('h', 'i', 4.33),

('j', 'k', 9.4),

('j', 'l', 0.5),

('j', 'q', 0),

('j', 't', 8.33),

('k', 'l', 0),

('k', 'q', 0),

('k', 't', 8.8),

('l', 'q', 10),

('m', 's', 10),

('n', 'o', 10),

('n', 'p', 0),

('o', 'p', 0),

('q', 't', 6.67),

('r', 's', 9.75),

('a', 'b', 0.5),

('a', 'f', 10),

('a', 'l', 9.33),

('a', 'm', 10),

('a', 's', 10),

('b', 'h', 9),

('b', 's', 6.67),

('c', 'e', 10),

('c', 'g', 10),

('c', 'h', 6),

('c', 'i', 10),

('d', 'j', 0),

('d', 'k', 0),

('d', 'q', 9.67),

('d', 'r', 9.8),

('d', 't', 8.75),

('e', 'g', 10),

('e', 'h', 0),

('e', 'i', 10),

('f', 'm', 10),

('g', 'h', 0),

('g', 'i', 10),

('h', 'i', 4.33),

('j', 'k', 9.4),

('j', 'l', 0.5),

('j', 'q', 0),

('j', 't', 8.33),

('k', 'l', 0),

('k', 'q', 0),

('k', 't', 8.8),

('l', 'q', 10),

('m', 's', 10),

('n', 'o', 10),

('n', 'p', 0),

('o', 'p', 0),

('q', 't', 6.67),

('r', 's', 9.75),

Main body

Close visual inspection shows that the optimal bipartition of the main body has {b, h, j, k} as one part

It turns out that Melos argued precisely {b, h, j, k} and Athens the other propositions in the main body

That is, coherence-driven inference exactly distinguishes the positions of Melos and Athens

84 of 84

b. Destroying the principle of appealing to fairness and justice in danger harms both the weak and the strong, as even the strong may one day fall and need such appeals.

h. Attacking neutral states like Melos creates more enemies by making other neutrals fear similar treatment.

j. To surrender without a fight would be cowardice.

k. War is uncertain, and action preserves hope, whereas submission leads directly to despair.

Close visual inspection shows that the optimal bipartition of the main body has {b, h, j, k} as one part

It turns out that Melos argued precisely {b, h, j, k} and Athens the other propositions in the main body

That is, coherence-driven inference exactly distinguishes the positions of Melos and Athens