
The cycle’s engagement with AI capabilities requires a theory of why emergent capabilities appear where they do and not elsewhere, and why blocking one approach to a capability rarely prevents the capability from appearing. Equifinality is the theoretical foundation for both observations. The scaling curve is not a unique road to a fixed destination; it is evidence that frontier-scale competence is an attractor reachable from many starting points. This is why scaling laws are so consistent across architectures: the law is a property of the attractor, not of the specific training procedure.
Equifinality also reframes what it means for a capability to be dangerous. If a dangerous capability is an equifinal attractor, then it is dangerous in a deep structural sense: not dangerous because one organization or one architecture enables it, but dangerous because many starting conditions converge on it. This is the governance insight the concept forces: the interventions that matter are those that affect the attractor itself—the conditions of the regime under which the attractor exists—not the interventions that block specific paths to it. Norbert Wiener’s cybernetics offers a complementary angle: feedback-controlled systems can be steered, but they must be steered at the level of the control signal rather than at the level of any particular mechanical linkage.
Bertalanffy introduced equifinality in his work on developmental biology in the 1920s and 1930s, drawing on Hans Driesch’s experiments with sea urchin embryos: Driesch had shown that a bisected blastula could develop into two complete, smaller organisms rather than two half-organisms, and that development could proceed normally even after significant early perturbation. Classical mechanism had no account of this; the classical model said that the developmental program was encoded in the initial conditions, so disturbing the conditions should disturb the outcome. Equifinality named the observed fact that this was wrong: the end state was in some sense “aimed at” by the system as a whole, reached by multiple paths, and robust to perturbation of the starting point.
The formalization came through Bertalanffy’s open-system thermodynamics. A closed system can reach only one equilibrium state from given initial conditions; an open system, constantly exchanging matter and energy with its environment, can approach the same steady state from many starting points. This is the formal basis of equifinality: it is a property of open systems, and of the specific kind of open system that living things are, maintained far from equilibrium by continuous exchange.
Path-independence of the attractor. The competence that a sufficiently trained model achieves is more a property of the training regime—the scale, the data distribution, the broad family of methods—than of the specific architecture or the specific training run. Different paths through the same regime converge on the same attractor. This is why representational convergence research finds that independently trained networks often learn representations that are recognizably the same up to a rotation: the data’s statistical structure is pulling every sufficiently large learner toward the same internal organization.
Governance implications. If dangerous capabilities are equifinal attractors, blocking one approach to them is like blocking one road to a city while leaving all the others open. The governance interventions that matter are those that affect the attractor’s existence—the computational and data conditions under which the regime produces the attractor—not those that restrict specific architectural choices or specific training procedures.
Equifinality and the consciousness question. Bertalanffy saw equifinality as evidence for functionalism avant la lettre: if many different physical realizations converge on the same competence, then competence is multiply realizable—a property of organization rather than substrate. This is the foundational wager of AI: that intelligence is a pattern of organization that silicon can instantiate as well as carbon. Bertalanffy would have endorsed the wager for competence while insisting on the open question of inwardness—whether the organization of mind and the experience of mind are equifinal in the same sense, or whether inwardness is something that organization, however achieved, does not by itself produce.
The sharpest debate equifinality provokes is whether representational convergence in independently trained networks is genuine equifinality—convergence on a common organizational attractor dictated by data structure—or an artifact of shared architectural inductive biases. If different architectures trained on the same data converge on similar representations, does this show that the data imposes the organization, as Bertalanffy would predict, or that the architectural choices made by the field have been more uniform than the diversity of apparent options suggests? The empirical evidence for convergence is robust; the interpretation remains contested. A second debate concerns the governance implications. Equifinality does not mean that capability attractors cannot be managed at all; it means that the management must target the regime rather than the path. Some AI governance proposals, particularly those focused on compute thresholds and data access rather than specific model architectures, are implicitly Bertalanffian in this sense: they attempt to shift the conditions under which capability attractors exist rather than blocking any particular road to them. Whether this is politically feasible is a separate question from whether it is theoretically correct, and Herbert Simon’s work on near-decomposability suggests that even equifinal systems may have leverage points where intervention is more effective than in others.