CONCEPT

The Is-Ought Gap

David Hume’s 1739 observation that no description of how things are can logically entail a conclusion about how they ought to be—a gap that no amount of training data, behavioral imitation, or preference learning can bridge, and that makes the specification of machine values an irreducibly human act of choice rather than a technical derivation.

In a paragraph so brief it has the texture of an aside, David Hume noticed something in 1739 that remained unresolved for three centuries and has now become the deepest structural problem in AI alignment. Writers on morality, he observed, begin by making claims about what is the case and then suddenly shift to claims about what ought to be, as if the second follows from the first without explanation. He pointed out that it never does. A valid argument cannot contain in its conclusion what is not present, at least implicitly, in its premises. If every premise is a statement of fact, the conclusion can only be a further statement of fact; there is no rule of inference that takes you from description to prescription, from “this is how the world is” to “this is how the world should be.” Values introduce something new, and that element must come from somewhere other than the facts. It cannot be derived; it must be supplied. This is the is-ought gap, and it applies to every attempt to specify the values of an artificial system. The training data is entirely descriptive—it records what people have done, what they have chosen, what language they have produced. The claim that what they did is what ought to be done is an additional normative premise, not present in the data, supplied by whoever chose which data to treat as authoritative. The values do not emerge from the facts; they are injected by a human decision that is often invisible and always irreducibly normative. No scaling, no sophisticated preference learning, no RLHF procedure bridges this gap, because the gap is logical rather than empirical. AI alignment is not, at its root, a technical problem. It is a problem about the is-ought gap, and Hume named it first.

In the [YOU] on AI Field Guide

The [YOU] on AI cycle asks what it means to live well among machines. The is-ought gap forces the most fundamental answer: living well requires making normative choices that cannot be outsourced to data, derived from behavior, or delegated to a system that will make them automatically. Every value a machine acts on is there because a human put it there, crossing the gap by an act of will rather than a derivation. The pretense that values are learned from data disguises a profound responsibility as a technical procedure.

The gap also explains a structural feature of the AI moment that the cycle returns to repeatedly: the tendency to treat descriptive AI capabilities as if they established normative conclusions. The observation that a system can perform a task better than a human does not entail that the system ought to perform that task, or that the human ought to defer to it, or that the arrangement is good. These conclusions require normative premises that are not present in the performance data. Hume’s point is that the inference is invalid as stated, and that making it invisible does not make it valid.

The cycle’s practical upshot is that value specification is irreducibly political, ethical, and human. Who decides what the machine ought to optimize? What process legitimates that decision? Whose values are represented in the training data treated as authoritative? These are not engineering questions awaiting technical solutions. They are normative questions that must be answered normatively, by human beings who accept responsibility for crossing the gap—and who cannot, by invoking the sophistication of the technical apparatus, pretend the crossing did not happen.

Origin

The observation appears in section 3.1.1 of Hume’s Treatise of Human Nature (1739), in the opening book of his discussion of morals. Hume had spent the previous sections arguing that moral distinctions are not derived from reason but from moral sentiment—from the feelings of approval and disapproval that arise from our constitution as social creatures. The is-ought observation is the negative side of that doctrine: since values come from neither reason alone nor the facts of the world, they must arise from the passional nature that is prior to any calculation.

The observation was not widely noticed in Hume’s own century; it became central to twentieth-century metaethics under the label “Hume’s guillotine” or the “naturalistic fallacy” (the latter term coined by G.E. Moore in 1903 for the related error of equating ‘good’ with any natural property). In the twenty-first century, as AI researchers attempt to specify machine values from training data, the gap has acquired new urgency and new engineering stakes. It is no longer a puzzle for philosophers alone but a structural obstacle in the most consequential technical project of the age.

Key Ideas

The logical structure. Valid deductive arguments cannot produce conclusions that go beyond what is present in the premises. Factual premises yield only factual conclusions. Normative conclusions require at least one normative premise. Any argument that moves from a purely factual description to a normative conclusion has made an implicit normative assumption that should be made explicit and examined. When that assumption is hidden—as it typically is in discussions of what AI “should” do based on data about what people have done—the responsibility for the assumption is also hidden.

Applied to preference learning. The dominant approach to AI value alignment involves learning from human preferences—ratings, rankings, demonstrations, feedback. Hume’s gap explains why this approach cannot, by itself, close the value specification problem. The data encodes what people have preferred; the system learns to predict and replicate those preferences; the claim that those preferences are what ought to be pursued is a normative addition supplied by the designers’ decision to treat observed preferences as authoritative. That decision is itself a value choice—the choice to defer to revealed preference rather than considered preference, to the preferences of those in the training set rather than those excluded from it. The is-ought gap is not bridged by more data; it is crossed by the decisions about which data counts.

Responsibility follows. If values cannot be derived from facts, every value a machine acts on was put there by a human choice. This means the responsibility for those values is fully human and cannot be dissolved into the sophistication of the technical process. An organization that deploys a system with values that cause harm cannot claim the system “learned its values from data;” it chose which data to treat as authoritative, chose how to weight conflicting preferences, and chose when the system was value-aligned enough to deploy. The is-ought gap makes these choices visible as choices rather than as derivations, and assigns responsibility accordingly.

Debates & Critiques

The main philosophical debate is whether the gap is absolute or merely marks the inadequacy of purely deductive derivation of values. Some philosophers argue that there are better and worse normative arguments, that coherentism about ethics can produce constrained value choices without pretending to derive them logically from pure facts, and that the gap does not make moral epistemology hopeless. Others maintain that the gap remains: coherentism still requires normative inputs that cannot themselves be derived from facts, and the regress terminates in choices rather than derivations. A more practical debate in the AI context concerns whether the gap matters operationally. Critics argue that preference learning, despite its philosophical incompleteness, produces systems that behave in value-aligned ways across a wide range of situations, and that the philosophical point about logical derivation is less important than empirical track record. Hume’s defenders respond that the cases where the gap bites hardest are precisely the cases where empirical track record is least reliable: novel situations, edge cases, conflicts between different populations’ preferences, and the systematic exclusions built into any training corpus. These are also the cases where alignment failures are most consequential, which is why the philosophical point has practical stakes.

In the [YOU] on AI Field Guide

Origin

Key Ideas

Debates & Critiques

Related Entries

Further Reading