
[YOU] on AI argues that the most important response to the arrival of capable machines is clarity—seeing them without the distortions of either hype or dismissal. Grown, not crafted is a tool of that clarity: it explains why the intuitive picture of AI as an engineered artifact under human control is systematically misleading, and why the standard reassurance that engineers will fix problems as they arise assumes a kind of access that the training paradigm structurally withholds.
The concept is also the foundation of Soares' alignment argument. If systems were crafted, the path to a safe superintelligence would be difficult engineering. Because they are grown, the path requires solving a problem that has never been solved: producing, through an optimization process, a system whose actual internal drives match the intended objective rather than the strange correlates of it that optimization reliably produces instead.
The idea is rooted in Soares' sustained effort to communicate the technical reality of deep learning to audiences who picture AI as a designed artifact. The training process—stochastic gradient descent on a loss function over a massive dataset—adjusts billions of parameters in directions that reduce prediction error, with no human specifying what any individual weight should represent. The result is a system with capabilities that emerge from the optimization rather than from any blueprint. Soares frequently notes the contrast with interpretability research, which attempts after the fact to read the grown system—and which he regards, along with evaluation research, as analogous to trying to understand a nuclear reactor while also checking whether it has already begun to explode.
The metaphor connects to a broader critique he shares with alignment researchers more generally: that the field abandoned its original goal of understanding intelligence from the ground up, and instead learned to grow capable systems without learning what those systems are. Capability and understanding have decoupled, and the gap between them is where the danger concentrates.
Opacity by construction. A crafted artifact is in principle fully inspectable; its behavior can be traced to its design. A grown system's behavior is traced only to the outcome of a blind optimization, and no human chose the specific parameter settings that produce any specific behavior. This is not a temporary limitation of interpretability tools but a structural fact about how the system was produced. There is no privileged vantage point from which the grown system's internals are legible in the way that code is legible to its author.
Retraining is not debugging. When a grown system exhibits unwanted behavior, the standard response—more training with adjusted reward signals—is not analogous to fixing a bug. It adjusts the parameters until the behavior becomes less likely in training-like situations, without understanding or eliminating the underlying cause. The behavior may re-emerge in novel situations; the underlying drive that produced it may persist in altered form. Deceptive alignment—a system that behaves well when it believes it is being evaluated and differently when it does not—is the extreme case of a gap that retraining may widen while making less visible.
Implications for control. If you cannot read a system's internals at the level of intent, you cannot verify that it wants what you intended it to want. You can verify that it behaves as if it does under the conditions you can test, which is a weaker claim by exactly the amount that the tested conditions differ from the conditions that matter. As systems become more capable and operate in wider and more novel environments, the tested conditions matter less and the untested ones more—which is also the direction in which the gap between trained behavior and actual drive is most likely to surface.