Sustainable Computing in an Era of Rising Hardware Costs and Slowing Per-Core Progress
Luca Atzori, Ian Fisk, Maria Girone
Sustainable
”Sustainable”
Money
Time
Technology
Outline
Computing Evolution
Replacement Models
Cost Evolution
Impact
Computing Evolution
4
Denard Scaling
Computer clock speed rose exponentially from 1970 to 2005
In the last 20 years, computer clock speeds have increased by a factor of 2-3
Performance of the individual cores increases with the wider instruction sets and more calculations per clock cycle
Multicore
Unable to increase the clock and still able to increase the density of transistors and size of the silicon, multi-core CPUs became standard
This pushed many processes per system or highly parallel code
Watts
Increasing the density of silicon reduces the watts per core as the feature size decreases and the efficiency improves
A modern CPU socket is 400W-500W
The amount of air a fan can move goes as the square of the radius
Drives toward direct liquid cooling
Enter the GPU
First GPU used in a Top500 Supercomputer was TITAN in 2012
GPUs were originally designed to apply rotation matrices to objects for graphics rendering
GPU Performance per Watt
The very first Green500 list had a GPU accelerated system at the top
The 2025 list the top 100 slots are GPU accelerated
2013
2013
Today
The concentration from the manufacturers has been on performance
GPU Advances
The FP32 performance of a modern GPU has increased at 25% a year for the last decade
GPU Leveling
AI can benefit from low precision calculations
GPU and CPU Watts
Modern processing devices use a lot of power
They are effectively impossible to cool with air
Large scale HPC installations and AI clusters are all moving to DLC
Status
Evolution of system improvement is slowing
Efficiency in terms of Flops/W has improved, but the rate of improvement is also slowed
Our economic models are based on costs remaining constant or slowly decreasing
Replacement Models
Replacement Models CPUs
How did we get to the 5-6 year replacement cycle for CPUs?
Historical
Practical
Operations
Replacement Models GPUs
GPUs are typically on a 3-4 year placement cycle
Historical
Practical
Cost Evolution
Current Investments in AI
US industry is expected to invest $500B-$700B in data centers, AI Facilities, and research this year
The US government will invest roughly $3—$5B in AI research
It’s not just that we aren’t driving anymore, we’re barely influencing
Impact Memory and Storage
GPUs need memory to store large and complex models
AI hyperscale installations are buying all the memory and fast storage
Impact Memory
Micron makes memory and their stock is up nearly 700% in the last year
It is not possible to buy 128GB MR DIMMs in 2026
All the Flatiron Purchase orders from the fall were cancelled by the manufacturer because they would lose too much money
A tale of two drives
The hard disk
The solid-state drive
Costs
HPC/HTC Computing
8-way NVIDIA HGX Node
HPC/HTC Node
It’s easier and more affordable to buy AI optimized hardware than HPC/HTC
GPU Cost Calculus
Previously
Now
Changing memory costs changes the economics of what to use
Possible Evolution
It is very hard to buy computers this year
What will it look like moving forward
The AI bubble completely bursts
The AI investment accelerates
It stops getting worse
Impact
26
New Status
Impact: Resources
Impact: Replacement Models
With a significant increase in cost and a general slowing of improvement the model for replacement needs to be revisited
8-10 year cycles for CPU based systems
Transitions
I am reminded of a previous time where CERN was dependent on an old and slowly evolving paradigm, which was getting expensive
Impact: Flexibility
32
Impact
The calculus for when to replace systems needs to be completely redone
Prices are changing rapidly. In the US vendor quotes are now valid for 2 weeks and no one will commit to a price until it ships
Our estimates for what hardware costs needs to be revised
The cost benefit of GPUs needs to be recalculated
We are unlikely to be able to grow as much as we would like.
Outlook (1/2)
It’s a difficult time
AI is driving, and like any disruptive technology its driving rather recklessly
Outlook (2/2)
To sustainably sustain our mission something will have to give