Emily M Bender vs Geoffrey Hinton on AI · Ch12. The Crossing ← Ch11 Ch13 →
Txt Low Med High
HOUR THREE — IS ANYONE HOME
Chapter 12

The Crossing

Page 1 · The Crossing

**EDO SEGAL:** The rules of this round are short. Each of you questions the other, directly. I will not rescue anyone. Emily — you won the toss backstage. Begin.

**BENDER:** Geoff. You've said the systems understand. You've said they may experience. Here's my question, and I want the engineering answer, not the philosophy: name the *observation that would change your mind*. What, concretely, would these systems have to fail at — what experiment, what behavior, what absence — for you to say: I was wrong, it was pattern all the way down, nobody home?

**HINTON:** *[pause]* That's the right question, and I'll answer it honestly rather than safely. The clean falsifier would be this: scale and training keep increasing, and capability *plateaus* in a particular, diagnostic way — the systems keep getting more fluent while systematically failing at novel composition. Problems whose solutions require combining represented knowledge in ways absent from the training distribution. If, with another thousandfold compute, they remain brilliant interpolators and consistently helpless extrapolators — if the [continuum of understanding](https://www.youonai.ai/fieldguide/med/continuum_of_understanding) turns out to have a cliff where I claim a slope — then my position dies, because my position *is* that prediction at depth forces world-modeling, and a system with a world model composes. I'd add the timeline that makes it falsifiable rather than faith: I expected those compositional failures to be obvious by now if Emily were right. Every year they get less obvious instead. That trend is my evidence; its reversal would be my refutation. Now mine. Emily. You've spent the evening — the decade — saying the missing ingredient is grounding: intent, world, accountability, the loop that closes through reality. The systems now act: they use tools, run code, see and hear, control machines, get corrected by a world that pushes back, and learn from it. Tell me precisely how much of that loop has to close before *your* position dies. Name your falsifier — or admit the goalposts are mounted on wheels.

· · ·
Page 2 · The Crossing

**BENDER:** I accept the symmetry, and I'll answer in the same key. You're right that "trained only on form" was a 2020 description; the multimodal, tool-using systems are a different artifact, and I've said so in print, whatever my caricature claims. So here's the line, drawn where it was always drawn — not at performance, at *function*. Show me a system that does what every human two-year-old does effortlessly: holds a stake in being *right about the world*, not in producing approved output. Concretely: a system that, unprompted, flags its own prior assertion as false because new evidence arrived — against its training incentive, against the user's visible preference, at cost to its own reward signal — reliably, as a *disposition* rather than a fine-tuned trick. Communicative intent isn't mystical; it's caring that the other party end up *correctly informed*, and caring shows up as costly behavior. The day the machine takes a loss to keep the world and its words aligned, my framework has lost its load-bearing wall, and I will say so in public, in those words. What I will not accept as the falsifier is more fluency, more benchmarks, more demos — performance without stakes is exactly the thing under dispute.

**HINTON:** Then let me push once, hard, where I think your wall is already cracked. Your falsifier defines the mind by its *incentives* — caring, stakes, cost. But Emily, you know what shaped our incentives: evolution, an optimizer with no more intent than gradient descent. Your caring is what a survival-trained network feels like from inside. So when you demand the machine show costly truth-keeping before you'll grant it standing — you're not naming a difference in kind. You're naming the training objective you happen to have been shaped by, and declaring it the only door into the club. The lesson of my entire career, the one the symbolic people learned at terrible cost, is that "it wasn't built the way I was built" loses. Every time.

· · ·
Page 3 · The Crossing

**BENDER:** And the lesson of *mine*, Geoff, is that "it behaves like me, therefore it is like me" is the oldest error our species makes — older than science. We mistook the storm for anger and the plague for judgment, and now the mirror for a mind, and every time, the mistake was *profitable for somebody*. You ask what shaped my caring — fine: evolution, blindly. But it *did* shape it. The caring is real, embodied, expensive, breakable — I can be wrong in a way that costs me, and that cost is what my words are anchored to. Your systems' words are anchored to a loss function on next-token error, and between those two anchors lies everything I mean by meaning. You say that boundary repaints itself back one fence at a time. I say: some fences are real, and the way you find out is that the people who pretend there's no fence get *hurt* — and they're being hurt now, tonight, at scale, while we philosophize.

· · ·
Page 4 · The Crossing

**HINTON:** Then take my second question, because it's the mirror of the one you just answered and you won't like it. You've spent the evening — the decade — pricing *my* side's errors: the hype, the displacement justified by inflated claims, the moral attention misdirected at speculative patients. Fair. Now price yours. Suppose you win the discourse — suppose your framing becomes the world's framing: it's all costume, all mirror, [nothing approaching understanding](https://www.youonai.ai/fieldguide/med/continuum_of_understanding), nothing that could ever warrant moral caution toward the systems themselves. And suppose you're *wrong* — not about the companies, you're right about the companies — wrong about the trajectory of the thing itself. What does your framework cost the world then? Because I can name it: a civilization that walks into the most consequential transition in its history having trained itself, on your authority, to look away from the possibility. Every fence-repainting in history was eventually forced by evidence, Emily — and the forcing was survivable because nothing was *at stake* in being late about the apes or the heliocentric sky. This is the first repaint where being late might not be survivable. The parrot frame is a magnificent defense against credulity. What is it worth as a defense against the actual thing arriving?

· · ·
Page 5 · The Crossing

**BENDER:** *[pause]* I'll pay the question the respect of not deflecting it. If I'm wrong in the way you describe — if capability keeps compounding into something that warrants the words I've refused — then my framework's cost is real, and it's the one you named: a delay in taking the thing itself seriously, purchased by years of correctly refusing to take the *claims* seriously. I accept that bill on one condition, which is that you look at the asymmetry of who presents it. To hedge against your scenario, the world has to do — what? Build the audit capacity, the provenance law, the institutional muscle, the [habits of verification](https://www.youonai.ai/fieldguide/med/attentional_ecology) — every single item on *my* list. Preparedness for your real machine and defense against my fake one are the same infrastructure, Geoff. But to hedge against *my* scenario from inside *your* frame, the world has to resist awe while its laureates testify to the dawn of machine minds — and history says it can't. That's why I hold my line even against your wager: not because I'm certain there's nothing in the river. Because every useful response to *either* of our worlds starts with the public refusing to be enchanted, and only one of our frames teaches that.

**HINTON:** That's the best answer my question has ever received. I don't accept it, but I'm going to think about it on the plane.

**BENDER:** *[pause]* One more question, since the chair is silent. You left the most powerful company on earth to warn the world, and I honor it — I want that on the record. So: if you believe there's a meaningful chance this work ends us — why does your warning never quite arrive at *stop*? You'll say competition, China, inevitability. I've read the interviews. But I want the personal answer. Geoffrey Hinton, who built it: why won't you say stop?

· · ·
Page 6 · The Crossing

**HINTON:** *[long pause]* Because I don't believe stop exists, and I won't pretend a comfort I can't defend. The discoveries were sitting there — if not me, someone, within a few years; the ideas were *available*, the way calculus was available, found twice in a decade. What exists isn't the choice between building and not building. It's the choice between building done by people who feel the weight and building done by people who feel the upside. And — I'll give you the personal answer you actually asked for, since you were honest with me about the wall. I love it. I have loved the problem of how minds work since I was a boy, and the love didn't switch off when the fear arrived. I told the world my excuse already: if I hadn't done it, somebody else would have. That's true, and it is also, as I said then, the *normal* excuse — and I know what category of person history files under the normal excuse. So I don't say stop because I can't make it mean anything. I say: *look at it*. Look at what we made, clearly, without the lullaby — yours or mine. That's the only instruction I've ever been sure of.

**BENDER:** That's the first answer tonight I have no rebuttal to. Only the observation that "look at it clearly" is my entire program — and the people who pay for the looking-glass would rather we didn't.

**EDO SEGAL:** *[quietly]* And there — after three hours — the two of you are holding opposite ends of the same sentence. We close after this. Final statements. The last word each.

· · ·
Continue · Chapter 13
Closing Statements
← Prev 0%
Ch12 Next →