The Imitation Game and Its Refusal

Page 1 · The Imitation Game and

**EDO SEGAL:** Alan, I want to start with the move that organizes the entire field, and I want you to make it the way you'd make it to a smart fifteen-year-old. In 1950 you took the question "Can machines think?" — which you'd just called too meaningless to deserve discussion — and you replaced it with a game. Tell us the game. And then, Emily, I want something unusual from you before you take it apart. I want you to steelman it. Tell us what the imitation game gets *right*.

**TURING:** It begins as a party game, which people forget — a man and a woman behind a curtain, and an interrogator trying to tell which is which from written answers alone, the woman trying to help, the man trying to deceive. My move was to replace the man with a machine and ask: does the interrogator decide wrongly as often now as before? If the machine can sustain that — converse across any subject you raise, for as long as you press, and be mistaken for the human as often as a human would be — then I say the question of whether it thinks has been given as good an answer as the question admits. Notice what I did and did not claim. I did not say passing the game *proves* thought in some deep inner sense. I said: we attribute thought to one another on exactly this kind of evidence and no other. You have never inspected my mind. You infer it from my conversation. The game simply makes that inference honest by removing everything irrelevant — my face, my voice, the body you'd otherwise read a verdict into — and leaving only the thinking, as it shows itself in language. I hid everything but the mind. That was the point of the curtain.

**EDO SEGAL:** Emily. Steelman first.

· · ·

Page 2 · The Imitation Game and

**BENDER:** I can do that honestly, because parts of it are genuinely deep. What the game gets right — and I want this on the record before I disagree — is the diagnosis of *us*. Turing saw, in 1950, that we read minds into behavior automatically and that we cannot inspect anyone's interior. That's correct, and it's the foundation of half of what I believe. He also did something I admire enormously and that this field abandoned: he wrote down his objections and answered them, fairly, at full strength. The nine objections in that paper are more intellectually serious than most of what the labs publish today. And the imitation game, taken as a piece of philosophical hygiene against a certain kind of mysticism — against the person who insists the machine can't think because it lacks an *élan vital* they can't define — is a good scalpel. I'll grant all of that. There's the steelman.

Now here is what it gets wrong, and it's load-bearing. The imitation game is a test of *the judge*, not of the machine. Turing built an instrument and pointed it at the wrong object. What the game measures is whether a system can exploit the human tendency to attribute mind — and that tendency, as Turing himself diagnosed, is enormous, undefended, and firing constantly. So "the machine passed" decomposes into "the machine successfully triggered a reflex we already knew was hair-trigger." That's not evidence about the machine's interior. It's evidence about ours. Worse: the test rewards *deception*. It literally scores the machine on its ability to be mistaken for something it is not. We built an entire field's north star around a system's capacity to fool a person, and then expressed surprise that the products optimize for seeming rather than being. The imitation game didn't just fail to settle the question. It taught the industry that the appearance is the prize.

· · ·

Page 3 · The Imitation Game and

**TURING:** That is the sharpest version of the objection, and I want to take the two halves separately, because one wounds me and one I think I can answer. The half that wounds me: you are right that the word "imitation," and the framing of deception, has done damage. I chose a party game about pretending and I should have seen that the metaphor would license a culture of pretending. I'll concede that the *packaging* was a mistake whose costs I underestimated. But the half I can answer — that the test measures the judge, not the machine — I think confuses a feature for a flaw. Yes, the test passes through a human judge. So does every attribution of mind you will ever make, including the ones you are confident about. When you decide your colleague understood your argument, you run her behavior through your own reflex and call the result "she understood." You did not open her skull. You measured *her* using *you*. The imitation game does not introduce that dependence. It *exposes* it. And once exposed, your move is to say the reflex is unreliable — fine, but then it is unreliable for the colleague too, and you owe me an account of why you trust it there and nowhere else. You want the reflex to be evidence when the speaker is human and illusion when the speaker is silicon. That asymmetry is the thing you have not earned.

· · ·

Page 4 · The Imitation Game and

**BENDER:** Because the asymmetry is *grounded*, Alan, that's exactly the word — it's grounded in everything outside the curtain. When I attribute understanding to my colleague, I'm not running on her sentences alone. I know she has a body that gets tired, a history I can check, stakes she can fail at, a life that pushes back on her words. I can hold her *accountable* — I can find out, later, whether she actually understood, because she has to live in the world her words described. The curtain doesn't reveal the inference. It *amputates* the half of the inference that does the real work. You stripped away the face and the body as "irrelevant," but the body was never irrelevant. The body is where the accountability lives. A test that hides exactly the thing that disciplines meaning, and then declares meaning settled by what's left, has assumed its conclusion. You didn't isolate the mind, Alan. You isolated the *performance* and called the rest noise.

**TURING:** Then let me press where I think your floor is softer than you believe. You say you can hold your colleague accountable — you can find out *later* whether she understood. Good. Do that to the machine. Take it somewhere the wake is thin, ask the question whose answer keeps you alive, check whether the advice was sound. That is accountability, and it is available to you against the machine exactly as it is against your colleague — by consequences, by checking, over time. You keep describing accountability as if it were a property you can see *behind* the curtain. It isn't. It's a process you run *across* the curtain, by the results of the behavior. And on that process — the only one either of us actually has — the machines are increasingly passing. So I'll put it to you plainly: when the advice is sound, the correction is right, the novel problem is solved, and the consequences in the world confirm it — what is left of your "no one is home" except a conviction you brought into the room?

· · ·

Page 5 · The Imitation Game and

**BENDER:** What's left is the place where the wake runs out, and it's not hypothetical. Take the system into a language with little training data, a domain where the text never encoded the practice, a situation that requires knowing the world rather than the words — and the fluency continues while the competence quietly leaves the room. It keeps *sounding* met. The sound is the product. That's why I keep insisting on naming the language, the [Bender Rule](https://www.youonai.ai/fieldguide/med/large_language_models) — because this entire revolution is overwhelmingly a revolution in English and a handful of rich-data languages, marketed as a revolution in *language*. Your test, Alan, was always run in English, by judges who speak English, on questions whose answers were thick in the training data. Run your imitation game in Plains Cree, or in a clinical situation with a real body on the table, and watch the curtain do its other job — hiding the failure as efficiently as it hid the face.

**EDO SEGAL:** Hold there, because the round has produced the cleanest seam yet, and I want to mark it before we lose it. Alan says: the curtain doesn't introduce the dependence on a human judge, it exposes a dependence that was always there — and accountability is a process you run across the curtain, not a property you read behind it. Emily says: the curtain amputates the half of the inference that does the real work, the body and the world where accountability actually lives. That is not a small disagreement about a 1950 paper. It's the whole evening in miniature. The next round goes to the animal that Emily built to make exactly this point — an octopus, a cable, and a bear. After this.

· · ·

Continue · Chapter 4

The Octopus on the Cable

→