CONCEPT

Thick Alignment

Alondra Nelson’s rebuke to the dominant definition of AI alignment—insisting that the values a system is aligned to must themselves be examined, contested, and chosen through a process that includes the people affected, not merely the people who built it.

Thin alignment asks whether an AI system does what its builders intend. Thick alignment asks the prior question: whether what the builders intend is itself aligned with the values, contexts, and lives of the people the system will touch. The distinction, introduced by sociologist Alondra Nelson, cuts directly against the dominant framing of AI safety, which treats the hard problem as technical—how to make the model reliably pursue the specified objective—and treats the specification of the objective as settled or obvious. But the specification is precisely what is not obvious. Whose values get encoded. Whose notion of safety counts. Whose harms register as harms. These are not engineering questions, and they cannot be answered by the people doing the engineering, because those people are a tiny, unrepresentative sliver of the humanity the systems will affect. Thick alignment draws on the sociotechnical tradition, which refuses to separate the technical system from the social world it operates in and instead studies them as a single fabric. A thickly aligned system is one whose builders’ wants have been subjected to scrutiny by the broader public whose lives are at stake, so that the objective being pursued has some claim to legitimacy beyond the preferences of those who happened to be in the room. On the question of values, the affected public is also an expert—an expert in its own lives, its own values, its own experience of being governed by large language models—and a democracy that forgets this is no longer governing its technology at all.

In the [YOU] on AI Field Guide

[YOU] on AI rests on the conviction that these tools will belong to all of us only if all of us insist on a hand in shaping them. Thick alignment names the mechanism by which that insistence becomes structural rather than rhetorical. It reframes what “safe AI” means: not a system that does not deceive or harm its builders, but a system whose objectives have been legitimated by the people it governs. The cycle asks what it would mean to see the machine clearly; thick alignment asks what it would mean to govern it well, and argues that the two questions cannot be separated.

Where algorithmic governance now touches credit, policing, healthcare, and hiring, thin alignment produces systems that do what their builders want while systematically failing the people most affected. Thick alignment demands that those people have genuine standing to shape outcomes—not the comment period nobody reads, not the advisory board with no power, but input that can change what gets built before it is deployed. Nelson knows the standard is rarely met. She holds to it because the alternative—a technological future designed for the public without the public—is not a future a democracy can accept while remaining one.

Origin

Nelson developed the concept by noting that the word “alignment” had done remarkable work in the AI field while smuggling in a hidden assumption: that the values to which a system should be aligned are already given, obvious, or safely determined by experts. In its standard technical usage, alignment means getting a system to pursue the objectives its designers specify, avoid the behaviors they forbid, and remain controllable in their hands. Stated this way it sounds unobjectionable. Nelson’s objection is not that this goal is wrong but that it is radically incomplete—and that the incompleteness is disguised by the reassuring sound of the word.

The concept draws on her earlier work tracing how the prestige of science has been used to make contestable claims appear as settled facts. In medicine, the language of objective science cloaked profoundly unequal treatment—some bodies cared for, others neglected or harmed, all under the banner of a single impartial science. Thick alignment is her attempt to prevent the same dynamic from recurring in AI: to insist that the value questions cannot be absorbed into the technical work and thereby hidden from view, but must be answered through a process that includes those who will live with the result.

Key Ideas

The specification is the problem. Thin alignment takes the values question as answered and focuses on the engineering question. But the specification of the objective is precisely what is not settled. An objective function encodes someone’s notion of what counts as success; a training dataset is a record of a particular society’s choices; a safety definition reflects whose harms the designers have learned to see and whose they have not. The engineering follows the specification; the specification follows power; and power follows who is in the room. Thick alignment moves the conversation upstream, to the room.

Affected communities as experts. Nelson’s most consequential argument is that on the question of values, technical expertise confers no special authority. A system’s designers may be world-class engineers; that expertise tells them nothing about whether a given distribution of benefits and harms is just. The person whose loan was denied by an algorithm knows something about that system that its designers do not—what it is like to be on the receiving end, what the error costs, what recourse was or was not available. That knowledge is a form of expertise, and a governance process that excludes it is systematically incomplete.

Legitimacy as infrastructure. A technology deployed without legitimate public standing is fragile—it generates the kind of resistance and backlash that ultimately undermine deployment. Democratic legitimacy, on this account, is not a luxury that successful technology can do without; it is part of the infrastructure of durable deployment, and skipping it stores up costs to be paid later. Thick alignment is therefore not only a moral demand but a practical one: systems that have been thickly aligned are more trustworthy and therefore more durable than those that have not.

Debates & Critiques

Critics from the technical side argue that thick alignment is practically unrealizable: the systems are too complex for meaningful public participation, the decisions too fast-moving, and the public too ill-equipped to evaluate tradeoffs. Nelson’s response is that this objection conflates the difficulty of the standard with its illegitimacy. A bridge must be structurally sound, but the decision about where to build it is not a structural question, and an engineer who claimed technical expertise settled that question would be overstepping. Thin alignment is the AI equivalent of that overstep. A second critique challenges thick alignment from the left: that even meaningful participation cannot overcome the structural inequalities that determine who gets to speak effectively in any deliberative process. Nelson takes this seriously and insists on the difference between processes that can change outcomes and those that ratify decisions already made. The open question is whether institutional design can produce the former reliably enough to matter, and whether the pace of AI deployment structurally forecloses the deliberation thick alignment requires.

In the [YOU] on AI Field Guide

Origin

Key Ideas

Debates & Critiques

Related Entries

Further Reading