1 of 67

🧠 Intelligence Beyond Commitment Devices 🗳️

Xinyuan Sun (Xyn)

Research, Flashbots ⚡🤖

2 of 67

Alignment and Coordination

Alignment of AI is choosing desired outcomes.

AI <> AI

AI <> human

AI + human <> AI + human

Choosing desired outcomes is coordination.

Along the way we make some over-approximations as compromises.

One of the most popular way to implement coordination, is using commitments (e.g., decision theory, mechanism design, commitment devices).

How do we study commitments? Will commitments for AIs exhibit unique properties? If so, what are they? How do we leverage/mitigate those properties?

3 of 67

Alignment and Coordination

Alignment of AI is choosing desired outcomes.

AI <> AI

AI <> human

AI + human <> AI + human

Choosing desired outcomes is coordination.

Along the way we make some over-approximations as compromises.

One of the most popular way to implement coordination, is using commitments (e.g., decision theory, mechanism design, commitment devices).

How do we study commitments? Will commitments for AIs exhibit unique properties? If so, what are they? How do we leverage/mitigate those properties?

4 of 67

Alignment and Coordination

Alignment of AI is choosing desired outcomes.

AI <> AI

AI <> human

AI + human <> AI + human

Choosing desired outcomes is coordination.

Along the way we make some over-approximations as compromises.

One of the most popular way to implement coordination, is using commitments (e.g., decision theory, mechanism design, commitment devices).

How do we study commitments? Will commitments for AIs exhibit unique properties? If so, what are they? How do we leverage/mitigate those properties?

5 of 67

This is Kim

Tips for nerds: this box is a commitment device, since it is common knowledge that this commitment is credible, we say it is credible commitment device

6 of 67

This is Kim

Kim has a box.

This box allows him to delegate arbitrary actions, and it is common knowledge that this box executes things faithfully.

Now Kim wants to use this box to improve the efficiency of the games that he is playing with his frienemy Don.

Tips for nerds: this box is a commitment device, since it is common knowledge that this commitment is credible, we say it is credible commitment device

7 of 67

This is Kim

Kim has a box.

This box allows him to delegate arbitrary actions, and it is common knowledge that this box executes things faithfully.

Now Kim wants to use this box to improve the efficiency of the games that he is playing with his frienemy Don.

Tips for nerds: this box is a commitment device, since it is common knowledge that this commitment is credible, we say it is credible commitment device

8 of 67

Prisoner’s Dilemma

Kim would want to delegate the box prior to game begins.

9 of 67

Prisoner’s Dilemma

Now box makes the game work, great!

Tips for nerds: Folk theorem for commitment device states that any payoff within the convex hull of individually rational payoff sets are achievable (for single-shot games even)

10 of 67

Commitment Devices

Enforcement (strategic certainty)
Common Knowledge of enforcement

11 of 67

Commitment Devices

Enforcement (strategic certainty)
Common Knowledge of enforcement

12 of 67

Crypto-economic Commitment Devices

Enforcement (strategic certainty)

Via delegated game play to EVM semantics secured by disinterested parties holding the commitment device

Common Knowledge of enforcement

Via consensus protocols to make public announcement broadcasts

Enforcement means commitments can credibly (with high confidence) predicate/predict agent’s behavior. E.g., smoking, grim-trigger in cartels, …

Common knowledge of enforcements means the increased certainty (predicated strategy space of agents) can shift equilibria of the game.

The computation of delegation and implementation of common knowledge takes time.

The blocktime.

Let’s call the blocktime t.

13 of 67

Crypto-economic Commitment Devices

Enforcement (strategic certainty)

Via delegated game play to EVM semantics secured by disinterested parties holding the commitment device

Common Knowledge of enforcement

Via consensus protocols to make public announcement broadcasts

Enforcement means commitments can credibly (with high confidence) predicate/predict agent’s behavior. E.g., smoking, grim-trigger in cartels, …

Common knowledge of enforcements means the increased certainty (predicated strategy space of agents) can shift equilibria of the game.

The computation of delegation and implementation of common knowledge takes time.

The blocktime.

Let’s call the blocktime t.

14 of 67

Crypto-economic Commitment Devices

Enforcement (strategic certainty)

Via delegated game play to EVM semantics secured by disinterested parties holding the commitment device

Common Knowledge of enforcement

Via consensus protocols to make public announcement broadcasts

Enforcement means commitments can credibly (with high confidence) predicate/predict agent’s behavior. E.g., smoking, grim-trigger in cartels, …

Common knowledge of enforcements means the increased certainty (predicated strategy space of agents) can shift equilibria of the game.

The computation of delegation and implementation of common knowledge takes time.

The blocktime.

Let’s call the blocktime t.

15 of 67

Crypto-economic Commitment Devices

Suppose we start at time T, within blocktime t:

Transactions (commitments, denoted by Com) are sent to the commitment device.

At the end of the blocktime, the commitment device implements common knowledge of the commitments and their settlement (denoted by a function F mapping a list of all commitments COM to a commitment device state) by making a public broadcast of the results.

Tips for nerds: Here I’m equating consensus broadcast at the end of each slot as finality of the transactions, this is true only on blockchains that have single-slot finality.

16 of 67

Crypto-economic Commitment Devices

Suppose we start at time T, within blocktime t:

Transactions (commitments, denoted by Com) are sent to the commitment device.

Tips for nerds: Here I’m equating consensus broadcast at the end of each slot as finality of the transactions, this is true only on blockchains that have single-slot finality.

17 of 67

Crypto-economic Commitment Devices

Formally, we can define crypto as a Permissionless Credible Commitment Device (PCCD) that consists of:

State S that includes all settled commitments
Commitment constructors CCom that produces commitments Com1, Com2, ...
Commitment semantics/settlement function F: [Com] -> S
Blocktime t the time it needs to collect/compute/settle/broadcast commitments.

PCCDs implement coordination via common knowledge.

But for games played within blocktime, t’ < t, it cannot coordinate or align! Because it cannot choose the desired outcome for lack of enforcement and common knowledge.

Also, the mediator/auctioneer’s (who execute the commitment semantics) strategy is not certain (action space is not predicated). Because it’s a game played in t’. And often, the mediator is a set of profit-seeking or even malicious agents that could collude! E.g., algorithmic pricing, google ad auction, …

18 of 67

Problem with Prisoner’s Dilemma

Suppose now the Commitment Constructor CCom allows payments.

Tips for nerds: this is the classic Stackelberg competition scenario, where the leader or the mechanism designer gets an asymmetric payoff

19 of 67

Problem with Prisoner’s Dilemma

Now the only payoff possible is (3,1), where Don is indifferent between using the device and not.

20 of 67

Problem with Prisoner’s Dilemma

Suppose both Kim and Don can use the box, now (3,1) (1,3) are the set of implementable payoffs. Not sure if desirable.

21 of 67

Cooperative Games

Let’s abstract away the mediator - “person with box,” call her C, and we call Kim&Don A&B.

C controls the commitment semantics F because her strategy is not certain (because C’s choice/computation/settlement of commitments is a game played within blocktime t.)

We have the box, which is a credible commitment device, so it feels natural to model this as cooperative games. Specifically, suppose ABC use the box to form coalitions.

v(A) = v(B) = 1, v(C) = 0

v(AB) = 2, v(AC) = v(BC) = 3

v(ABC) = 4

Core of this game (stable, no sub-coalition could deviate profitably) is (A: 1, B: 1, C: 2)

22 of 67

Cooperative Games

Let’s abstract away the mediator - “person with box,” call her C, and we call Kim&Don A&B.

C controls the commitment semantics F because her strategy is not certain (because C’s choice/computation/settlement of commitments is a game played within blocktime t.)

Now, will the commitment game be different? We start by modeling the “box” as a PCCD for playing cooperative games. Specifically, suppose ABC use the box to form coalitions.

v(A) = v(B) = 1, v(C) = 0

v(AB) = 2, v(AC) = v(BC) = 3

v(ABC) = 4

Core of this game (stable, no sub-coalition could deviate profitably) is (A: 1, B: 1, C: 2)

23 of 67

Cooperative Games

Let’s abstract away the mediator - “person with box,” call her C, and we call Kim&Don A&B.

C controls the commitment semantics F because her strategy is not certain (because C’s choice/computation/settlement of commitments is a game played within blocktime t.)

Now, will the commitment game be different? We start by modeling the “box” as a PCCD for playing cooperative games. Specifically, suppose ABC use the box to form coalitions.

v(A) = v(B) = 1, v(C) = 0

v(AB) = 2, v(AC) = v(BC) = 3

v(ABC) = 4

Core of this game (stable, no sub-coalition could deviate profitably) is (A: 1, B: 1, C: 2)

24 of 67

Core

The core manifests itself, e.g., Kim and Don can both use the box and bid in first price auction.

25 of 67

Core

There is no point in using the box anymore. The box is useless.

26 of 67

Concrete Example - Trading

At time T, There exists some liquidity on an Automated Market Maker (AMM). User wants to trade some assets potentially utilizing the AMM liquidity.

Suppose users have time preferences and want to finish trade in one block.

Currently, on Ethereum base layer protocol, most users just send a swap trading against the AMM. And their swaps gets picked off by sophisticated parties.

One big reason is because users cannot coordinate with other users on trading against each other first and then settling against the AMM liquidity (walrasian style).

This coordination game between users happens at time t’ < t the blocktime.

Obviously the latter is a “desired outcome” where higher welfare is achieved (trade with less fees), and we have “alignment” across users of the PCCD.

27 of 67

Concrete Example - Trading

At time T, There exists some liquidity on an Automated Market Maker (AMM). User wants to trade some assets potentially utilizing the AMM liquidity.

Suppose users have time preferences and want to finish trade in one block.

Currently, on Ethereum base layer protocol, most users just send a swap trading against the AMM. And their swaps gets picked off by sophisticated parties (bots).

One big reason is because users cannot coordinate with other users on trading against each other first and then settling against the AMM liquidity (walrasian style).

This coordination game between users happens at time t’ < t the blocktime.

Obviously the latter is a “desired outcome” where higher welfare is achieved (trade with less fees), and we have “alignment” across users of the PCCD.

28 of 67

Concrete Example - Trading

At time T, There exists some liquidity on an Automated Market Maker (AMM). User wants to trade some assets potentially utilizing the AMM liquidity.

Suppose users have time preferences and want to finish trade in one block.

Currently, on Ethereum base layer protocol, most users just send a swap trading against the AMM. And their swaps gets picked off by sophisticated parties (bots).

One big reason is because users cannot coordinate with other users on trading against each other first and then settling against the AMM liquidity (walrasian style).

This coordination game between users happens at time t’ < t the blocktime.

Obviously the latter is a “desired outcome” where higher welfare is achieved (trade with less fees), and we have “alignment” across users of the PCCD. But we cannot.

29 of 67

Maximal Extractable Value (MEV) games

In both the Prisoner’s Dilemma and the AMM liquidity example, we see the PCCD commitment game achieve undesirable outcomes because there are some games played within the blocktime t. And since the “speed of commitments” is slower than the speed at which those games are played, the PCCD fails to align/coordinate.

And in those games, some value/welfare is transferred unfairly (handed to the mafia and the monarch) or destroyed (handed to the moloch).

We call those value MEV. And we call those games the MEV game.

MEV is a big industry, with active players and adversarial incentives.

30 of 67

Maximal Extractable Value (MEV) games

And in those games, some value/welfare is transferred undesirably (handed to the rent-seeking mafia and the monarch) or destroyed (burnt to tribute the moloch).

We call those value MEV. And we call those games the MEV game of a PCCD.

MEV is a big industry, with active players and adversarial incentives.

MEV games are exactly same as the AI alignment/coordination game.

31 of 67

Maximal Extractable Value (MEV) games

And in those games, some value/welfare is transferred undesirably (handed to the rent-seeking mafia and the monarch) or destroyed (burnt to tribute the moloch).

We call those value MEV. And we call those games the MEV game of a PCCD.

MEV is a big industry, with active players and adversarial incentives.

MEV games are exactly same as the AI alignment/coordination game.

32 of 67

Correspondence

The study of MEV enables

AI alignment and cooperative AI.

Why? On a high-level

Commitments carve out a large section of inefficiencies, but it’s not enough
MEV games are good theoretical frameworks to think about alignment, especially decision theory problems, e.g., ond-boxing in Newcomb’s paradox as MEV game
MEV is dealing with intelligence beyond the commitments
MEV games are specifically about how dumb user transactions interact with misaligned smart “searcher” (HFT trading firm) transactions.
MEV “searchers” are actually pretty good estimations of AI or AGI (optimizor)

33 of 67

Correspondence

The study of MEV enables

AI alignment and cooperative AI.

Why? On a high-level

Commitments carve out a large section of inefficiencies, but it’s not enough
MEV games are good theoretical frameworks to think about alignment, especially decision theory problems, e.g., ond-boxing in Newcomb’s paradox as MEV game
MEV is dealing with intelligence beyond the commitments
MEV games are specifically about how dumb user transactions interact with misaligned smart “searcher” (HFT trading firm) transactions.
MEV “searchers” are actually pretty good estimations of AI or AGI (optimizor)

34 of 67

Elaboration

One-boxing in Newcomb’s paradox as MEV game. Credible commitment by the predictor and the decision in the (causal but not MEV-time) past conditions on the decision in the future.

It’s a flashloan!

35 of 67

Correspondence

The study of MEV enables

AI alignment and cooperative AI.

Why? On a high-level

Commitments carve out a large section of inefficiencies, but it’s not enough
MEV games are good theoretical frameworks to think about alignment, especially decision theory problems, e.g., ond-boxing in Newcomb’s paradox as MEV game
MEV is dealing with intelligence beyond commitments, absence of interpretability
MEV games are specifically about how dumb user transactions interact with misaligned smart “searcher” (HFT trading firm) transactions.
MEV “searchers” are actually pretty good estimations of AI or AGI (optimizor)

36 of 67

Correspondence

The study of MEV enables

AI alignment and cooperative AI.

Why? On a high-level

Commitments carve out a large section of inefficiencies, but it’s not enough
MEV games are good theoretical frameworks to think about alignment, especially decision theory problems, e.g., ond-boxing in Newcomb’s paradox as MEV game
MEV is dealing with intelligence beyond commitments, absence of interpretability
MEV games are specifically about how dumb user transactions interact with misaligned smart “searcher” (HFT trading firm) transactions.
MEV “searchers” are actually pretty good estimations of AI or AGI (optimizor)

37 of 67

Correspondence

The study of MEV enables

AI alignment and cooperative AI.

Why? On a high-level

Commitments carve out a large section of inefficiencies, but it’s not enough
MEV games are good theoretical frameworks to think about alignment, especially decision theory problems, e.g., ond-boxing in Newcomb’s paradox as MEV game
MEV is dealing with intelligence beyond commitments, absence of interpretability
MEV games are specifically about how dumb user transactions interact with misaligned smart “searcher” (HFT trading firm) transactions.
MEV “searchers” are actually pretty good estimations of AI or AGI (optimizor)

38 of 67

"But whoever lives by the truth comes into the light, so that it may be seen plainly that what they have done has been done in the sight of God."

John 3:21 (NIV)

39 of 67

The boundary of the light cone is defined by the speed of light, which is the fastest possible speed at which information can be transmitted between events. Events that are within the light cone are causally connected and can potentially influence each other, while events outside the light cone are causally disconnected and unmeasurable.

40 of 67

The ordering and settlement of commitments in a MEV game is not certain, in that there exists no “happens-before” relation like the one of “causal ordering” in distributed systems.

MEV games are defined by the fact that they are played faster than the speed of commitments. That there exists maximal value to be extracted by misaligned intelligences where light cannot shed.

41 of 67

MEV = aligning misaligned superior intelligence hiding beyond the commitment cone

AI alignment = aligning misaligned superior intelligence hiding beyond the commitment cone

42 of 67

Certainty has limit

Most technologies of implementing certainty has a speed limit, if not a fundamental physical bound.

The technology of commitments is no exception.

43 of 67

Intelligence has (almost) no limit

Intelligent agents will collaborate and compete in ultra-refine high-frequency games.

Those games are not bounded (except for physical limits) by any “speed of certainty/commitment.”

44 of 67

There will be misalignment

Games that are played faster than certainty are MEV games, and they almost always are not aligned.

Those games WILL be played by misaligned intelligent agents, hiding beyond the commitment cone, cuz that’s where there is most x-domain MEV (correlated yet not regulated games).

MEV is a phenomena.

45 of 67

Misalignment is here, concrete

Bot PvP. Hyper-financialization.

Real-incentives with billion$ value on Ethereum from MEV games.

The slower intelligence becomes the stale quote (snail commitment) to be sniped off by super-intelligences (commitments with more agency) for MEV.

We care about the slower intelligence.

46 of 67

Proof Sketch

Commitments implement alignment and coordination via enforcement and the common knowledge of enforcement.
There is a speed limit to commitment devices.
For any “slow” game that is played longer than the speed of commitments, we can largely align and we know how to do it.
Intelligence has no speed limit.
From 2 and 4, intelligent agents play games that start and finish before the commitment device could react.
Many of those games have misaligned outcomes (some PoA) and correlated payoffs with slower games.
From 5 and 6, there will always be MEV/misalignment, because we fail to align those “fast” games.

47 of 67

Proof Sketch

Commitments implement alignment and coordination via enforcement and the common knowledge of enforcement.
There is a speed limit to commitment devices.
For any “slow” game that is played longer than the speed of commitments, we can largely align and we know how to do it.
Intelligence has no speed limit.
From 2 and 4, intelligent agents play games that start and finish before the commitment device could react.
Many of those games have misaligned outcomes (some PoA) and correlated payoffs with slower games.
From 5 and 6, there will always be MEV/misalignment, because we fail to align those “fast” games.

48 of 67

Proof Sketch

Commitments implement alignment and coordination via enforcement and the common knowledge of enforcement.
There is a speed limit to commitment devices.
For any “slow” game that is played longer than the speed of commitments, we can largely align and we know how to do it.
Intelligence has no speed limit.
From 2 and 4, intelligent agents play games that start and finish before the commitment device could react.
Many of those games have misaligned outcomes (some PoA) and correlated payoffs with slower games.
From 5 and 6, there will always be MEV/misalignment, because we fail to align those “fast” games.

49 of 67

Proof Sketch

Commitments implement alignment and coordination via enforcement and the common knowledge of enforcement.
There is a speed limit to commitment devices.
For any “slow” game that is played longer than the speed of commitments, we can largely align and we know how to do it.
Intelligence has no speed limit.
From 2 and 4, intelligent agents play games that start and finish before the commitment device could react.
Many of those games have misaligned outcomes (some PoA) and correlated payoffs with slower games.
From 5 and 6, there will always be MEV/misalignment, because we fail to align those “fast” games.

50 of 67

Proof Sketch

Commitments implement alignment and coordination via enforcement and the common knowledge of enforcement.
There is a speed limit to commitment devices.
For any “slow” game that is played longer than the speed of commitments, we can largely align and we know how to do it.
Intelligence has no speed limit.
From 2 and 4, intelligent agents play games that start and finish before the commitment device could react.
Many of those games have misaligned outcomes (some PoA) and correlated payoffs with slower games.
From 5 and 6, there will always be MEV/misalignment, because we fail to align those “fast” games.

51 of 67

Proof Sketch

Commitments implement alignment and coordination via enforcement and the common knowledge of enforcement.
There is a speed limit to commitment devices.
For any “slow” game that is played longer than the speed of commitments, we can largely align and we know how to do it.
Intelligence has no speed limit.
From 2 and 4, intelligent agents play games that start and finish before the commitment device could react.
Many of those games have misaligned outcomes (some PoA) and correlated payoffs with slower games.
From 5 and 6, there will always be MEV/misalignment, because we fail to align those “fast” games.

52 of 67

Proof Sketch

Commitments implement alignment and coordination via enforcement and the common knowledge of enforcement.
There is a speed limit to commitment devices.
For any “slow” game that is played longer than the speed of commitments, we can largely align and we know how to do it.
Intelligence has no speed limit.
From 2 and 4, intelligent agents play games that start and finish before the commitment device could react.
Many of those games have misaligned outcomes (some PoA) and correlated payoffs with slower games.
From 5 and 6, there will always be MEV/misalignment, because we fail to align those “fast” games.

53 of 67

How do we align those MEV games?

How do we align the intelligence hiding behind the commitment cone, with their speed of intelligence travelling at a speed faster than the speed of commitment/certainty/common knowledge?

Either

i) slow down the speed of intelligence,

ii) speed up the speed of commitment,

iii) use technologies other than commitments to implement coordination that is not subject to speed limits,

or iv) bound the amount of misaligned games!

54 of 67

How do we align fast MEV games?

First, make more games aligned via commitments.

Privacy achieves meta-game freeness, i.e., there exists no game that is unaligned and has correlated payoff to the already aligned games. Because privacy fixes the game that is being played by over-approximation.

Essentially, you are leveraging commitments of information to threaten to “dumb the game down” so there is not much high-speed intelligences can do!

55 of 67

This is Kim

Kim has a box.

The box can mitigate coordination/alignment issues with intelligence.

With MEV, the box is useless or even harmful.

Kim is sad.

We are all Kim

56 of 67

This is Kim

Kim has a box.

The box can mitigate coordination/alignment issues with intelligence.

With MEV, the box is useless or even harmful.

Kim is sad.

We are all Kim

57 of 67

This is Kim

Kim has a box.

The box can mitigate coordination/alignment issues with intelligence.

With MEV, the box is useless or even harmful.

Kim is sad.

We are all Kim

58 of 67

This is Kim

Kim has a box.

The box can mitigate coordination/alignment issues with intelligence.

With MEV, the box is useless or even harmful.

Kim is sad.

We are all Kim

59 of 67

This is Kim

Kim has a box.

The box can mitigate coordination/alignment issues with intelligence.

With MEV, the box is useless or even harmful.

Kim is sad.

We are all Kim

60 of 67

This is Kim

Kim has a box.

The box can mitigate coordination/alignment issues with intelligence.

With MEV, the box is useless or even harmful.

Kim is sad.

We are all Kim

So let’s work together to capture

the intelligence beyond commitments.

61 of 67

@sxysun1

xinyuan@flashbots.net

63 of 67

Backup Slides

64 of 67

Crypto-economic Commitment Devices

Formally, we can define crypto as a Permissionless Credible Commitment Device (PCCD) that consists of:

State S that includes all settled commitments
Commitment constructors CCom that produces commitments Com1, Com2, ...
Commitment semantics/settlement function F: [Com] -> S
Blocktime t the time it needs to collect/compute/settle/broadcast commitments.

Tips for nerds: Here I’m equating consensus broadcast at the end of each slot as finality of the transactions, this is true only on blockchains that have single-slot finality.

65 of 67

Prisoner’s Dilemma

State S = strategies that agents play in the PD game; mapping from PlayerID to Strat

Commitment constructor CCom = takes in either a function from PD game strategy to PD game strategy (Strat -> Strat), or a PD game strategy (Strat)

Commitment semantics F: [Com] -> S = checks the list of commitments, apply commitments of type Strat to commitments of type Strat -> Strat, then puts result into the mapping of S

66 of 67

Prisoner’s Dilemma

State S = strategies that agents play in the PD game; mapping from PlayerID to Strat

Commitment constructor CCom = takes in either a function from PD game strategy to PD game strategy (Strat -> Strat), or a PD game strategy (Strat)

Commitment semantics F: [Com] -> S = checks the list of commitments, apply commitments of type Strat to commitments of type Strat -> Strat, then puts result into the mapping of S

What about blocktime t?

67 of 67

🧠 Intelligence Beyond Commitment Devices 🗳️

Xinyuan Sun (Xyn)

Research, Flashbots ⚡🤖

1 of 67

2 of 67

3 of 67

4 of 67

5 of 67

6 of 67

7 of 67

8 of 67

9 of 67

10 of 67

11 of 67

12 of 67

13 of 67

14 of 67

15 of 67

16 of 67

17 of 67

18 of 67

19 of 67

20 of 67

21 of 67

22 of 67

23 of 67

24 of 67

25 of 67

26 of 67

27 of 67

28 of 67

29 of 67

30 of 67

31 of 67

32 of 67

33 of 67

34 of 67

35 of 67

36 of 67

37 of 67

38 of 67

39 of 67

40 of 67

41 of 67

42 of 67

43 of 67

44 of 67

45 of 67

46 of 67

47 of 67

48 of 67

49 of 67

50 of 67

51 of 67

52 of 67

53 of 67

54 of 67

55 of 67

56 of 67

57 of 67

58 of 67

59 of 67

60 of 67

61 of 67

62 of 67

63 of 67

64 of 67

65 of 67

66 of 67

67 of 67