How Far Can AI Progress Before Hitting Effective Physical Limits?
Released on 14th February 2025
Tom Davidson
Rose Hadshar
Will MacAskill
Tom did the original thinking; Will and then Rose helped with later thinking and writing. Thanks to Max Dalton, Oscar Delaney, Ege Erdil, Daniel Eth, John Halstead, Anson Ho, Fin Moorhouse, Ben Todd and the governing explosive growth seminar group for helpful comments.
This is a rough research note – we’re sharing it for feedback and to spark discussion. We’re less confident in its methods and conclusions.
Once AI systems can themselves design and build even more capable AI systems, this will create feedback loops where AI builds better AI which builds better AI. More specifically, we can think about three important feedback loops:
A software feedback loop, where AI develops better software. Software includes AI training algorithms, post-training enhancements, ways to leverage runtime compute (like o1), synthetic data, and any other non-compute improvements.
A chip technology feedback loop, where AI designs better computer chips. Chip technology includes all the cognitive research and design work done by NVIDIA, TSMC, ASML, and other semiconductor companies.
A chip production feedback loop, where AI and robots build more computer chips.
If one or several of these feedback loops cause AI progress to become very fast, we could call this an “intelligence explosion”.
In simplified models of intelligence explosions, progress goes to infinity. But in reality, this is of course implausible. We will eventually hit effective limits on all of software, chip technology and chip production improvements, and so progress from each of these sources will eventually plateau. And progress could slow long before effective physical limits are reached, because of diminishing marginal returns, regulation, or other factors.
In this essay, we set aside human constraints and focus on effective physical limits, as these are easier to analyse.
How far can AI progress before hitting effective physical limits? If we are already close to the limits, then the potential capability increases will be smaller. But if the limits are very high, then ultimate capability levels (and the maximum pace of progress to reach them) could be extreme.
We can operationalize the distance to physical limits in terms of effective compute.
We argue that, setting aside human constraints:
Software could likely increase effective compute by ~13 OOMs, possibly more.
Chip technology could likely increase effective compute by ~2 OOMs within the current paradigm, and by a total of ~6 OOMs if technology approaches Landauer’s limit (a physical constraint on the energy efficiency of computation). Reversible computing could conceivably go further still.
Chip production could scale by ~5 OOMs using earth-based energy capture, and by a further ~9 OOMs if space-based solar could capture all the energy emitted by the sun.
There are three ways to increase effective compute in our framework: better software, better chip technology, and more chip production.
Software progress
Software could likely increase effective compute by ~13 OOMs, possibly more.
Focusing just on the room for improvement for training efficiency, how much better could software get before it reaches effective physical limits?
Human lifetime learning takes 1e24 FLOP. Assuming the relative gap between GPT-4 and GPT-6 will be the same as the gap between GPT-2 and GPT-4, GPT-6 will be trained with 1e29 FLOP.1
If each GPT-6 computation were as useful as a human computation, then GPT-6 would be ~5 OOMs less training efficient than human lifetime learning, suggesting at least 5 more OOMs to physical limits.2 Of course, GPT-6 might be more or less useful per computation as a human brain, but we’ll use this as a rough estimate.
But the human brain is not the physical limit of training efficiency. Firstly, the data used for human learning could be greatly improved upon: the brain is severely undertrained, humans spend only a small fraction of their time on focussed academic learning,3 and data quality could be much higher. Secondly, brain algorithms could be improved upon. Brains have to satisfy biological physical constraints, and humans have to coordinate via language. Combined with the significant variation between humans and the fact that evolution is a blind search process, it seems that there is significant room for improvement. Davidson (forthcoming) estimates that these factors add another 4-12 OOMs, so 9-17 OOMs total. The midpoint of this range is ~13 OOMs.
There will likely be other kinds of gain besides training efficiency, which would increase this estimate somewhat.
Chip technology progress
Chip technology could likely increase effective compute by ~2 OOMs within the current paradigm, and by a total of ~6 OOMs if technology approaches Landauer’s limit. Reversible computing could go further still.
We’ll measure this in units of FLOP/J – how many floating point operations chips can do per joule of energy. (This is consistent with our calculation of the limits to chip production below, where we consider the scale at which we could increase the energy used by chips.)
Epoch estimates that within the current paradigm, there is room for a ~200X improvement, which is ~2 OOMs.4
But beyond the current paradigm, there could be room for further improvements. Landauer’s limit implies that it’s not physically possible to do more than 3e20 bit erasures per Joule. We want to convert this limit to FLOP/J, so we need to estimate the number of bit erasures per FLOP. As a rough ballpark estimate for the limits of non-reversible computing, if we use 8-bit precision numbers, we might have slightly more than 8 bit erasures per FLOP – 8 because a FLOP takes in two 8-bit numbers but outputs only one; and a little more for some intermediate calculations along the way. Let’s round it to 10 bit erasures per FLOP. That implies we could achieve 3e19 FLOP/J.
Today you can get ~1e13 FLOP/J, which implies ~6.5 OOMs of improvement before reaching effective physical limits.
If reversible computing is possible, the ceiling could be higher still: Landauer’s limit only applies to irreversible logical operations, so reversible computing could surpass this limit.
Chip production progress
Chip production could scale by ~5 OOMs using earth-based energy capture, and by a further ~9 OOMs if space-based solar could capture all the energy emitted by the sun:
Data centres currently consume 1-1.5% of global electricity, and 10-20% of data centres are used for AI, so AI chips currently consume 0.1-0.3% of global electricity.5 This share could perhaps increase 100X, to 10-30%.6
Earth-based solar power could increase the current economy’s energy capture by around 3000X.7
Taken together this is 300,000X, which is ~5 OOMs.
Space-based solar power within this solar system could increase total energy capture a billionfold more again,8 which is a further ~9 OOMs.
Combined progress
These are our estimated limits for each of the three feedback loops:
Our estimates of the total room for improvement for each feedback loop before hitting effective physical limits. The limits for software and chip technology might be higher.
What does this mean for how far an intelligence explosion could go? Elsewhere, we argue that these feedback loops could lead to three particularly plausible kinds of intelligence explosion (IE):
A software IE, where AI-driven software improvements alone are sufficient for rapid and accelerating AI progress.
An AI-technology IE, where AI-driven improvements in both software and chip technology are needed, but AI-driven improvements in chip production are not.
A full-stack IE, where AI-driven improvements in all of software, chip technology and chip production are needed.
To see how far these three intelligence explosions could go, we can simply add up the limits of the feedback loops to estimate the limits for each IE:
The software IE could increase effective compute by ~13 OOMs, possibly more.
The AI-technology IE could increase effective compute by ~19 OOMs or more.
The full-stack IE could increase effective compute by ~24 OOMs using earth-based energy, or ~33 OOMs using all solar energy.
Our estimates of the total room for improvement for each intelligence explosion before hitting effective physical limits. All the limits might be higher.