The cycle asks why models are so fluent and so unreliable about fact at the same time. The language-entry/language-exit framework answers directly. Fluency comes from mastery of the intra-linguistic web—the system has learned the full distributional structure of language with extraordinary fidelity. Unreliability about fact comes from the absent entry transitions: the model cannot reliably tell you how things are because it has no independent access to the world that would allow it to check the patterns against reality. Its competence is competence about discourse, and its failures are exactly the failures you would expect from a mind that has read everything and seen nothing.
The embodied and multimodal turn in AI deserves honest attention within this framework. A system equipped with cameras and sensors and actuators, situated in an environment, does begin to acquire something like genuine language-entry transitions: it is prompted by the world it perceives and acts back upon it. By Sellars’s own lights this is the right direction—the missing ends of the language game being restored. But two cautions follow: first, perception even for an embodied system is a conceptual achievement, not a bare given (the Myth of the Given is a trap the roboticist can fall into as easily as the empiricist); second, language-entry in Sellars’s full sense is entry into the space of reasons—a perceptual report one can be entitled to and held to—not mere causal sensitivity to the environment.
The three-part decomposition of language appeared across several of Sellars’s texts, most explicitly in the context of his critique of meaning as naming and his positive account of meaning as role. The language-entry/exit framework drew on behaviorist vocabulary—stimulus-response—but gave it an entirely different philosophical significance: the point was not to reduce meaning to behavior but to show that a full account of meaning required both the behavioral periphery (entry and exit) and the inferential interior, and that the interior alone was insufficient. This is why Sellars was not a behaviorist despite using behavioral vocabulary. He was arguing against the behaviorist reduction of meaning to stimulus-response by showing that even on a behavioral account, meaning requires the full three-part structure, including the rich inferential web that behaviorism itself tended to ignore.
The concept connects to what Sellars called the “semantic dimension” of language in opposition to the “syntactic dimension”: syntax is the structure of valid inference among symbols, which can in principle be specified formally; semantics is the entry and exit transitions that connect those symbols to a world and to action, which cannot be specified by the inferential rules alone and which require the learner to be embedded in a world and a community.
Language-entry as world-contact. The entry transition is where a perceiver, embedded in the world, is prompted by the world to make an utterance. This is not a passive registration (which would be the Myth of the Given) but an active, conceptually structured response to what is perceived. The model’s training data is the frozen residue of countless human entry transitions. The model has inherited the outputs of language-entry without ever performing the act of it.
Language-exit as world-consequence. The exit transition is where an utterance issues in action in the world the speaker shares with others. Saying “I promise” creates an obligation. Saying “I’ll move it” moves something. The speaker is answerable for the consequences of her utterance in a way that has nothing to do with the truth-value of the sentence and everything to do with her participation in a shared world where words bear consequences. The model’s utterances bear no consequences for the model.
Grounding as a necessary but not sufficient condition. The embodied multimodal turn in AI aims to restore language-entry by giving models sensory access to the world. By Sellars’s framework this is necessary but not sufficient for full semantic competence. Necessary because without genuine world-contact, the inferential web floats free. Not sufficient because genuine language-entry requires the entry to be a move in the space of reasons—a perceptual report one is entitled to and answerable for—not merely a causal response to a sensory input.
The primary debate concerns whether the lack of genuine language-entry and exit is a contingent technological limitation (addressable by embodiment, multimodality, and richer interaction with the world) or a structural feature of systems built from text corpora (which would make the limitation permanent regardless of architectural additions). Sellars’s framework does not settle this: it specifies what is required for full meaning (all three kinds of transition) without claiming that artificial systems cannot in principle have it. What it does settle is the diagnosis of the current situation: the model has the intra-linguistic competence and lacks the perceptual and agentive grounding, and no amount of fluency in the first substitutes for the second. A second debate concerns whether genuine language-exit—utterances that bear world-consequences for the system—can be achieved by an AI system at all, or whether it requires mortality, bodily vulnerability, and the kind of stakes that come only with having a life at risk. This connects the semantic question to the existential question William Barrett poses from a different angle.