
The concept reframes the most disorienting feature of the AI transition: systems that are simultaneously fluent and unreliable. A system trained on vast text corpora develops rich statistical models of what words follow which other words, and these models are excellent predictors of fluent continuation. But fluency is a property of the surface; accuracy is a property of the relationship between the surface and the world. For the human brain, the two are coupled through the sensory error loop: a vivid but false perception provokes mismatch signals that eventually correct the guess. For the language model, there is no such loop—the model generates whatever continuation its training has made fluent, and nothing external votes on the generation’s accuracy. This is why a large language model can produce a confident, well-structured, grammatically impeccable passage that is factually wrong about Gilles Deleuze or any other subject: the wrongness does not produce a prediction error the system can detect. The hallucination is uncontrolled.
Seth’s framework also has a less obvious consequence: because our own perception is a controlled hallucination shaped by prior expectations, technologies that shape our expectations can shape what we perceive. A system that learns our priors and reflects them back, amplified, is not merely supplying information to a fixed observer. It is reaching into the predictive machinery and adjusting the guesses from which our experienced reality is built. The Seth framework connects the epistemology of AI confabulation to the deeper question of how AI systems that learn our patterns might subtly re-control the hallucination we call perception.
The concept emerges from the “predictive coding” or predictive processing framework that traces its modern form to Karl Friston’s work on the free-energy principle, with earlier precursors in Helmholtz’s nineteenth-century account of perception as inference. Seth’s distinctive contribution is to apply this framework explicitly to consciousness, rather than merely to information processing, and to draw the controlled hallucination formulation as its implication. The phrase gained wide recognition through Seth’s 2017 TED talk and was developed at length in his 2021 book Being You. It was already circulating in consciousness science before AI confabulation became a public concern, which means it arrived as a ready-made conceptual instrument at precisely the moment the AI discourse needed one.
The concept is part of a broader reorientation in the science of perception that challenges the passive-reception model—the commonsense picture of the mind as a camera faithfully recording the world. The active, generative, hypothesis-driven picture has roots in Helmholtz, in William James’s account of streams of consciousness shaped by attention and expectation, and in more recent work by predictive processing theorists including Clark, Friston, and Rao and Ballard. Seth’s distinctive emphasis is on the control element—the sensory error that distinguishes perception from hallucination—and on the implications of its absence in systems that generate without being grounded.
The predictive loop. The brain generates a prediction, compares it against sensory signals, and uses the mismatch—prediction error—as the only basis for updating its model. Perception is the brain settling on the hypothesis that minimizes error. The world is real, and it matters decisively, because it generates the error signals that keep guesses honest. Remove the error, and the system dreams.
Control by sensory error. The word “controlled” is the load-bearing term. Hallucination is not the problem; uncontrolled hallucination is. The brain’s construction of a vivid world from its predictive models is entirely reliable when sensory error is present. The same constructive machinery, freed from sensory constraint—in dreams, in psychosis, in AI text generation without a grounding loop—produces outputs that may be vivid and internally consistent while being disconnected from the world they purport to represent.
Agreement as reality. Seth’s formulation that “when we agree about our hallucinations, we call it reality” identifies a social dimension of controlled hallucination: our individually constructed worlds converge because we face the same world with similar machinery, and the convergence is what we call the objective. Objectivity is real but it is an achievement of convergent controlled hallucination, not a window onto raw reality.
Implications for AI trust. The fluency trap—the tendency of users to over-trust AI outputs because the systems produce confident, fluent content regardless of accuracy—can be understood through the controlled hallucination framework as a miscalibration of the trust heuristics that evolved for a world in which fluency tracked reliability. In that world, fluent, confident outputs were more likely to be accurate because the brain producing them had been corrected by experience. In a world of generative AI, the correction loop is absent, and the heuristic is systematically misleading.