Bateson defined information as 'a difference that makes a difference.' The emphasis falls on the second difference — the making, the consequence, the effect the distinction produces. A tree falls in a forest; if no system registers the fall, the event produces physical change but not information. Information requires a circuit in which the difference is registered and the registration makes a further difference. This definition has direct consequences for AI. The AI is a transducer: it takes differences expressed in natural language and converts them into different kinds of differences — connections not seen, structural clarities that were latent. The transduction is circular, not one-directional. What has changed with current AI is the dimensionality of the transduction — communicative, analytical, creative, memorial, hermeneutic — all expanding simultaneously.
The definition emerges from cybernetic formalism. In classical physics, causation travels through forces and impacts — one billiard ball striking another. In cybernetics, causation travels through information — a thermostat's detection of a temperature difference triggers heating, which produces a new difference, which modifies the thermostat's state. The second kind of causation is constitutive of Creatura, the world of living systems.
The definition changes where we draw the boundary of mind. If mind is the process that deals in information, and information is a difference that makes a difference, then mind exists wherever differences are registered and responded to. The blind person's stick is the site where spatial differences are transduced into tactile differences, which travel through nerves to the brain, which sends motor commands back — the entire circuit is the locus of the mental process.
Applied to AI, the framework illuminates the multi-dimensional character of the transduction. Previous tools expanded cognition along a single dimension: the telescope expanded perception, the calculator expanded computation. The AI expands across communicative, analytical, creative, memorial, and hermeneutic dimensions simultaneously, producing a circuit whose bandwidth approaches the human-human cognitive circuits that Bateson studied in families and therapy sessions.
The definition also provides an evaluative framework. The relevant information in AI-generated work is not the output itself but the difference between what the AI produced and what an unassisted human would have produced. When the difference is large and positive, the AI has contributed genuine information. When the difference is small, the AI has added production without adding information — noise masquerading as signal.
Bateson articulated the definition most concisely in Steps to an Ecology of Mind (1972), drawing on Shannon's information theory but extending it into the domain of meaning and relationship. Where Shannon measured information in bits, Bateson measured it in differences — the qualitative distinctions that make a difference to systems that care about distinctions.
The phrase has since become the most-quoted Bateson formulation, often cited without the context that gives it precision. Bateson insisted that the definition is relational: a difference is information only when there is a system capable of registering it and being changed by the registration.
Information is relational, not substantive. A difference becomes information only when it is registered by a system in a way that produces consequences.
The AI as transducer. Current AI systems transduce differences across multiple dimensions simultaneously — the feature that distinguishes them from all previous tools.
Evaluate by the difference that makes a difference. The test of AI contribution is not the volume of output but whether the output adds distinctions the human could not have reached alone.
Noise vs. signal at scale. Fluent output that adds no difference to what the human would have produced is noise, regardless of its polish.
Bandwidth as dimensional count. The richness of a cognitive circuit depends not on a single bandwidth measure but on how many dimensions of difference-processing the circuit spans.