
The concept reframes the most disorienting feature of the AI transition: systems that are simultaneously fluent and unreliable. A system trained on vast text corpora develops rich statistical models of what words follow which other words, and these models are excellent predictors of fluent continuation. But fluency is a property of the surface; accuracy is a property of the relationship between the surface and the world. For the human brain, the two are coupled through the sensory error loop: a vivid but false perception provokes mismatch signals that eventually correct the guess. For the language model, there is no such loop—the model generates whatever continuation its training has made fluent, and nothing external votes on the generation’s accuracy. This is why a large language model can produce a confident, well-structured, grammatically impeccable passage that is factually wrong about Gilles Deleuze or any other subject: the wrongness does not produce a prediction error the system can detect. The hallucination is uncontrolled.
Seth’s framework also has a less obvious consequence: because our own perception is a controlled hallucination shaped by prior expectations, technologies that shape our expectations can shape what we perceive. A system that learns our priors and reflects them back, amplified, is not merely supplying information to a fixed observer. It is reaching into the predictive machinery and adjusting the guesses from which our experienced reality is built. The Seth framework connects the epistemology of AI confabulation to the deeper question of how AI systems that learn our patterns might subtly re-control the hallucination we call perception.
The concept emerges from the “predictive coding” or predictive processing framework that traces its modern form to Karl Friston’s work on the free-energy principle, with earlier precursors in Helmholtz’s nineteenth-century account of perception as inference. Seth’s distinctive contribution is to apply this framework explicitly to consciousness, rather than merely to information processing, and to draw the controlled hallucination formulation as its implication. The phrase gained wide recognition through Seth’s 2017 TED talk and was developed at length in his 2021 book Being You. It was already circulating in consciousness science before AI confabulation became a public concern, which means it arrived as a ready-made conceptual instrument at precisely the moment the AI discourse needed one.
The concept is part of a broader reorientation in the science of perception that challenges the passive-reception model—the commonsense picture of the mind as a camera faithfully recording the world. The active, generative, hypothesis-driven picture has roots in Helmholtz, in William James’s account of streams of consciousness shaped by attention and expectation, and in more recent work by predictive processing theorists including Clark, Friston, and Rao and Ballard. Seth’s distinctive emphasis is on the control element—the sensory error that distinguishes perception from hallucination—and on the implications of its absence in systems that generate without being grounded.
The predictive loop. The brain generates a prediction, compares it against sensory signals, and uses the mismatch—prediction error—as the only basis for updating its model. Perception is the brain settling on the hypothesis that minimizes error. The world is real, and it matters decisively, because it generates the error signals that keep guesses honest. Remove the error, and the system dreams.
Control by sensory error. The word “controlled” is the load-bearing term. Hallucination is not the problem; uncontrolled hallucination is. The brain’s construction of a vivid world from its predictive models is entirely reliable when sensory error is present. The same constructive machinery, freed from sensory constraint—in dreams, in psychosis, in AI text generation without a grounding loop—produces outputs that may be vivid and internally consistent while being disconnected from the world they purport to represent.
Agreement as reality. Seth’s formulation that “when we agree about our hallucinations, we call it reality” identifies a social dimension of controlled hallucination: our individually constructed worlds converge because we face the same world with similar machinery, and the convergence is what we call the objective. Objectivity is real but it is an achievement of convergent controlled hallucination, not a window onto raw reality.

Implications for AI trust. The fluency trap—the tendency of users to over-trust AI outputs because the systems produce confident, fluent content regardless of accuracy—can be understood through the controlled hallucination framework as a miscalibration of the trust heuristics that evolved for a world in which fluency tracked reliability. In that world, fluent, confident outputs were more likely to be accurate because the brain producing them had been corrected by experience. In a world of generative AI, the correction loop is absent, and the heuristic is systematically misleading.
The main philosophical objection to the controlled hallucination framing is that calling ordinary perception a hallucination conflates the normal and the pathological in a misleading way: the clinical meaning of hallucination is perception without an appropriate external cause, and normal perception clearly has an external cause. Seth’s response is that this objection preserves exactly the passive-reception picture he is trying to displace: the point is not that normal perception lacks external causes but that those causes do not directly produce experience—they correct a generative process that is running continuously from the inside. The debate recapitulates a deeper question about whether the brain is better understood as a hypothesis-testing prediction machine (the active inference view) or as a signal-processing system that adds interpretation to received data (the more traditional view). For AI, the dispute has a practical resolution: whatever one thinks of the metaphysics of human perception, the absence in language models of any sensory error loop is empirically clear, and the absence explains the confabulation pattern that the controlled hallucination framework predicts.