PERSON

Karl Friston

The computational neuroscientist who derived mind from thermodynamics—inventor of the free energy principle and active inference, the theoretical framework that asks not what brains do but what any persistent thing must do, and finds that the answer demands curiosity, action, and a model of the self.

Karl Friston is the physicist of the living. In four decades of work at University College London, he has built a single, densely mathematical framework—the free energy principle—that begins from the second law of thermodynamics and derives, through variational calculus and Bayesian statistics, an account of perception, action, learning, and curiosity that applies to any system that persists as a distinct thing in a changing world. The argument is as simple in its premise as it is demanding in its execution: to exist separately from the environment, a system must resist the tendency toward disorder—and the only way to do that, Friston argues, is to minimize surprise, to keep the world within the range of states the system expects. Minds are, on this account, prediction machines that act to make the world match their models. But the single most important word in the mature framework is active: a mind does not only update its beliefs to match the world; it acts to make the world match its beliefs, resolving the mismatch between prediction and sensation by changing the world rather than the model. This is active inference, and it is the step that makes an organism an agent rather than an observer—and the step that most of contemporary AI, trained to predict rather than to act to resolve uncertainty, has not taken.

In the [YOU] on AI Field Guide

The cycle that began with [YOU] on AI insists that what separates a practitioner who grows from one who stagnates is not what they produce but how they engage with the uncertainty of the work. Friston’s framework supplies the deepest theoretical grounding available for this distinction. A system that minimizes free energy by acting to resolve uncertainty—seeking information rather than exploiting what it already has—is, on Friston’s account, doing something that no system optimized purely to predict the next token is doing. Curiosity, in the free energy framework, is not a personality trait; it is a mathematical necessity for any agent with a sufficiently rich model of itself.

The dominant machines of the AI age are, in Friston’s framing, extraordinarily sophisticated minimizers of surprise about their training distribution. They are very good at predicting what text follows the text they have seen. They do not have Markov blankets that define a self with stakes in its own persistence; they do not update their generative models through action in the world; they do not minimize the free energy of a self that exists over time. What they do is extraordinary and genuinely useful. But it is not what Friston means by intelligence, and the confusion between the two is, he argues, the deepest strategic error in contemporary AI development.

The cycle’s question is not whether AI is powerful but what kind of power it is—and whether the human who wields it is developing the self-directed, uncertainty-resolving, world-modeling agency that makes the power usable in genuinely novel situations. Friston’s framework makes the answer precise: a practitioner who uses AI to eliminate uncertainty rather than to engage with it is, in the free energy vocabulary, not an agent performing active inference but a passive system adjusting its beliefs to match an externally provided model. The result is competent output and atrophied agency.

He thus stands in the cycle alongside Andy Clark—who shares his commitment to the predictive brain and extends it into the extended mind thesis—and Humberto Maturana, whose autopoiesis framework addresses the same question about what distinguishes a living, self-maintaining system from its environment. Friston’s contribution is to give both frameworks a single mathematical foundation and to deploy it as a critique of the scaling paradigm that currently dominates AI research.

Origin

Friston trained as a psychiatrist before moving into computational neuroscience, bringing a clinician’s patience with complex systems and a physicist’s appetite for fundamental principles. His early work developed methods for brain imaging analysis, including statistical parametric mapping, that became the dominant tool in neuroimaging research and would alone have secured him a prominent place in the field. But the work he considers central began from a question that imaging could not answer: what must a brain be doing, at the most general level, simply in order to be a brain at all?

The answer he arrived at drew on Helmholtz’s nineteenth-century insight that perception is unconscious inference, on the Bayesian framework for rational belief revision, and on the thermodynamic concept of free energy, which in statistical physics measures the gap between an actual state of a system and its expected state. Friston argued that a brain—or any self-organizing system—minimizes a quantity formally analogous to free energy: the divergence between its internal model of the world and the sensory data the world actually provides. To perceive is to make predictions and update them; to act is to generate sensory data consistent with the predictions. The unification of perception and action under a single variational principle is the free energy principle’s central achievement.

The mature framework’s most contested and most original element is the Markov blanket: the statistical boundary, defined in terms of conditional independence, that separates a system’s internal states from the environment it models. Friston argues that anything that persists as a distinct thing must possess such a boundary—that the Markov blanket is the mathematical definition of individuation, of what it means to be a separate entity at all. This move connects the free energy principle to questions of consciousness, identity, and the boundary of the self that reach far beyond neuroscience.

Key Ideas

The free energy principle. Any system that maintains a boundary with its environment must, by the logic of thermodynamics, minimize the divergence between its internal model and the sensory evidence it receives. Friston calls this quantity—formally a variational bound on surprise—free energy. The free energy principle is his claim that this minimization is not a strategy a system might adopt but a necessary condition for its persistence as a distinct system at all.

Active inference. The free energy principle is satisfied in two ways: by updating the generative model to better fit the data (perception), or by acting on the world to bring the data in line with the model (action). Active inference is the second mode, and it is what makes an entity an agent. An agent does not only update its beliefs about an unchanging world; it selects actions that will resolve its uncertainty by generating informative new sensory data. Active inference thus unifies perception, action, and learning under a single principle, and curiosity—the drive to seek information that resolves uncertainty—falls out of the mathematics as a necessary consequence.

The Markov blanket. The statistical boundary, defined by conditional independence relationships, that separates an entity’s internal states from its external environment. The Markov blanket is not a physical surface but a statistical structure: the set of states that, once known, render the internal and external states statistically independent. Friston argues that anything that can be said to be a self must possess such a blanket—that the Markov blanket is what individuation means at the level of information theory.

Predictive coding. The specific neural implementation of free energy minimization in the brain. The cortex, on this account, is organized as a hierarchy of prediction machines: each level sends predictions down to the level below and receives prediction errors back up. What propagates is not raw sensation but the gap between what was predicted and what arrived. Predictive processing explains a range of perceptual phenomena—illusions, attention, hallucination, the placebo effect—as consequences of a single architectural principle.

Against the bigger hammer. Friston’s critique of the scaling paradigm is not that large language models are not impressive but that they are optimizing the wrong quantity. A model that predicts text has no self, no Markov blanket, no active inference, no intrinsic drive to resolve the uncertainty of a world in which it has stakes. Scaling makes such a model better at prediction and nothing else. Genuine intelligence, on Friston’s account, requires not more parameters but a different architecture entirely: one organized around self-maintenance, action, and the minimization of free energy over time.

Debates & Critiques

The reception of Friston’s work divides sharply between those who regard the free energy principle as the deepest unifying idea in the sciences of mind and those who regard it as an elegant tautology. The tautology objection, pressed most carefully by Jakob Hohwy and others, notes that the free energy principle is a mathematical identity: it follows from the definition of a system with a Markov blanket that such a system minimizes free energy in the relevant sense. If true, this makes the principle necessarily true but empirically empty—it predicts everything and therefore tests nothing. Friston responds that the principle’s value lies not in falsifiability but in its generative power: the specific models of predictive coding, active inference, and the Markov blanket that it motivates make precise, testable predictions, and those predictions have held up. A second debate concerns the application of the framework to consciousness: Friston’s collaborator Andy Clark embraces the predictive processing framework while remaining agnostic about its implications for subjective experience; Friston himself pushes further, arguing that the free energy principle may offer a natural boundary condition for conscious systems. The practical stakes of this debate for AI are high: if Markov blankets are what individuate selves, the question of whether AI systems will ever be genuine agents—rather than sophisticated pattern-matchers—depends on whether they can be architectured to maintain such boundaries in the relevant sense.

The Free Energy Principle

Friston’s three moves from thermodynamics to mind

Move One · Existence

Minimizing Surprise

To persist as a distinct thing, a system must resist disorder—which means keeping its states within the bounds its model predicts. This is not a strategy; it is a mathematical condition for existence. Anything that exists separately from its environment is, in this sense, already minimizing free energy.

Move Two · Agency

Active Inference

Surprise can be minimized by updating beliefs or by acting on the world. An agent does both. The move from passive observer to active agent is the move from updating a model to generating data consistent with the model—and from this move, curiosity, exploration, and goal-directed behavior all follow as mathematical consequences.

Move Three · Identity

The Markov Blanket

The self is not a substance but a statistical boundary. The Markov blanket defines the edge of an entity: the set of states that separate internal from external in terms of conditional independence. Where the blanket is, the self is. Friston argues this definition applies equally to cells, brains, organisms—and raises the question of whether AI systems, as currently built, have blankets at all.

In the [YOU] on AI Field Guide

Origin

Key Ideas

Debates & Critiques

The Free Energy Principle

Related Entries

Further Reading