What does a compute centric perspective say about AI takeoff speeds?
What is takeoff speed?
Capabilities takeoff speed: How quickly will AI improve as we approach and surpass human-level AI?
Impact takeoff speed: How quickly will AI’s impact on the world increase as we approach and surpass human-level AI?
A “compute centric” framework
Theoretical framework
Difficulty gap
Speed crossing difficulty gap
AI can automate 20% of tasks
Effective compute used in training
More compute, better algorithms
AI can automate 100% of tasks
Theoretical framework
Difficulty gap
Speed crossing difficulty gap
AI can automate 20% of tasks
Effective compute used in training
More compute, better algorithms
AI can automate 100% of tasks
What is the meaning of the difficulty gap?
Hans Moravec's "rising tide of AI capacity" can illuminate the meaning of the difficulty gap
How much more effective compute to train [AI that can automate 100% of tasks] than to train [AI that can automate 20% of tasks]?
What is the meaning of the difficulty gap?
How wide is the hump?
Effective compute used to train AI
AI can automate 100% of tasks
AI can automate 20% of tasks
How big is the difficulty gap?
Arguments for a small difficulty gap (~1 - 2 OOMs):
Arguments for a big difficulty gap (~4 - 8 OOMs):
My median: ~3-4 OOMs.
Theoretical framework: fitting the framework
Difficulty gap
Speed crossing difficulty gap
AI can automate 20% of tasks
Effective compute used in training
More compute, better algorithms
AI can automate 100% of tasks
How fast will we cross the difficulty gap?
In the last decade (h/t Epoch):
Speed crossing difficulty gap
More spending, cheaper compute, better algorithms
How fast will we cross the difficulty gap?
What might change while we cross the difficulty gap?
| Speed crossing the gap (OOM/year) | |||
| More spending | Cheaper compute | Better algorithms | Total |
Last decade | ~0.5 | ~0.1 | ~0.4 | ~1 |
Crossing the difficulty gap | ~0.3 | ~0.2 | ~0.8 | ~1.3 |
Speed crossing difficulty gap
More spending, cheaper compute, better algorithms
Very tentative conclusions
| Difficulty gap How much more effective compute to train [AI that can automate 100% of tasks] than to train [AI that can automate 20% of tasks]? | Speed crossing difficulty gap How quickly will the effective compute used in training increase? | Takeoff speed Time from [AI that can automate 20% of tasks] to [AI that can automate 100% of tasks] |
Median | ~3.5 OOMs | ~1.3 OOMs/year | ~3 years |
Aggressive | ~1 OOM | ~2 OOMs/year | ~0.5 years |
Conservative | ~8 OOMs | ~0.5 OOMs/year | ~16 years |
So what?
This is pretty fast, and no discontinuities are baked into the model.
A faster takeoff means:
“20% is an arbitrary startpoint”
“20% is an arbitrary startpoint”
| Doubling times (median) X years before AI can readily automate 100% of tasks | |||
Quantity | 1 year | 2 years | 5 years | 10 years |
FLOP/$ | 0.3 | 1.2 | 2.7 | 3.1 |
Software | 0.2 | 1.2 | 2.4 | 2.9 |
GWP | 0.7 | 2.4 | 8.4 | 21.1 |
Time from 100% automation to superintelligence?
Reasons for “very fast”:
Reasons against:
My guess: more likely than not this happens in <12 months.
Limitations
Questions