The Room and the River

Page 1 · The Room and the

EDO SEGAL: John, your room is the most famous thought experiment in the field, but I want you to tell it the way you'd tell it to a sharp fifteen-year-old, because most people who invoke it have never actually walked inside. And then, Ilya — before you attack it, I want you to do the unusual thing. Steelman it. Tell us what the room gets right.

SEARLE: Gladly. You're locked in a room. You speak English and not one word of Chinese — you can't even tell Chinese script from Japanese. Through a slot, people pass you papers covered in Chinese characters. You have a thick rulebook, in English, that says: when you see this squiggle, send back that squiggle. You get fast and good. The people outside — who are native Chinese speakers — pass in questions, and you pass back answers so perfect they're convinced they're talking to an educated native. And here is the only thing I want the fifteen-year-old to hold: you still don't understand a word of Chinese. You have all the form there is and none of the meaning. Now — the trick of it — you are doing exactly what a computer does. You're taking in symbols, applying rules that care only about their shapes, and sending symbols back. If running the program made anyone understand Chinese, it would make you understand Chinese. It doesn't. So running a program is never, by itself, understanding.

EDO SEGAL: Ilya. Steelman first.

The people outside — who are native Chinese speakers — pass in questions, and you pass back answers so perfect they're convinced they're talking to an educated native.

SUTSKEVER: I can do that honestly, because the experiment is genuinely good and I don't respect the people who wave it away. What it gets right: the training signal matters, and there are things text underdetermines. The room warns, correctly, that fluency in a distribution is not the same as competence outside it — every machine-learning researcher should have that tattooed somewhere. It also gets something right about us — that the people outside the room will keep reading a mind into the slot long after the slot stops deserving it, and that this reflex is exploitable, and that the industry exploits it. That's all true, and I won't pretend otherwise.

· · ·

Page 2 · The Room and the

Now the place it fails, and it's load-bearing. The rulebook. John hands the man a rulebook, finished, complete, written by someone else, and asks where the understanding is — and of course it isn't in the man, because the man didn't build the rulebook. The understanding, if it's anywhere, is in whatever process wrote the book. And in my systems there is no separate author. The system wrote its own rulebook, by being squeezed against trillions of examples until the only configuration that survived was one that captured the structure of the world the text describes. The man in Searle's room is the network at inference, running the weights. But the understanding question is about training — about what had to be true of the world for that rulebook to get written at all. John quietly removes the only place understanding could live and then announces he can't find it in what's left.

SEARLE: That's clever and it doesn't survive contact. Move the authorship inside. Let the man write his own rulebook — let him sit in the room for fifty years, watching which squiggles follow which, and compile his own correlations, the way your network does. He's now the author and the executor. Does he understand Chinese? No. He has the most magnificent table of which-shape-follows-which-shape ever assembled by a human being, and he cannot order lunch in Beijing, because at no point did any squiggle ever connect to a thing. Self-authorship doesn't add meaning. It just means the syntax wrote itself. A rulebook that writes itself is still a rulebook.

· · ·

Page 3 · The Room and the

SUTSKEVER: But your man watching squiggles for fifty years is data-starved in exactly the way that matters, and that's the whole difference. He taps one stream. My systems tap, in effect, every stream humanity ever produced — the physics textbooks and the recipes and the legal briefs and a hundred thousand descriptions of what water does and what bears do and what worked and what got someone killed. At that scale the text stops being gossip about the world and becomes a low-resolution scan of it, cross-referenced from a million angles. You say meaning never came down the cable. I say structure came down the cable, in overwhelming redundancy, and structure is most of what meaning is. The river found a channel through language because language is a lawful record of a lawful world, and a deep enough current reconstructs the world from the record.

A scan of the world is not the world; it's an image of it, and an image, however high its resolution, is exactly the thing that has form without substance.

SEARLE: And there's your river again, doing all the work and meaning nothing. Let me grant you the scan — it's a brilliant phrase and it concedes my point. A scan. A scan of the world is not the world; it's an image of it, and an image, however high its resolution, is exactly the thing that has form without substance. You can scan a fire in infinite detail and the scan won't warm your hands. You've described, with real beauty, the highest-resolution scan ever made of the human textual record. I'm telling you the resolution was never the issue. A perfect scan of meaning is still a scan, and a scan is syntax. The thing you keep calling "the world coming down the cable" is the image of the world coming down the cable. The cable cannot carry the thing. It can only carry the picture, and the man in the room — the network — only ever sees the picture.

· · ·

Page 4 · The Room and the

EDO SEGAL: [the room is quiet for a moment] Let me mark something, because the reader can't see your faces and I want it on the record. That was the first exchange tonight where neither of you was reaching for the other's hand. And I want to seat a ghost at this table, because he belongs to both of you and you're each going to claim him. Hubert Dreyfus. Berkeley philosopher, wrote What Computers Can't Do in 1972, spent his life arguing that real intelligence is embodied, situated, a matter of coping with a world rather than processing symbols about one — and that the symbolic AI of his day would never get there. John, he was your colleague and your ally. Ilya, his argument against rules and representations reads today almost like a connectionist manifesto. Whose ancestor is he?

Dreyfus said understanding is embodied — embodied, in a body, in a world, with skill that lives in coping rather than in propositions.

SEARLE: Mine, obviously, and Ilya can't have him, though he'll try. Dreyfus said understanding is embodied — embodied, in a body, in a world, with skill that lives in coping rather than in propositions. Everything in his life's work points at the thing the network doesn't have: a situation, stakes, a body that copes. A network trained on text is exactly the degenerate case he warned about — all pattern, no situation. He'd look at these models and say what he said about every generation's AI: you've mistaken the articulable shadow of intelligence for the thing. Except this time you built the shadow out of everyone's articulations at once, which makes it a far more convincing shadow, and far more dangerous.

· · ·

Page 5 · The Room and the

SUTSKEVER: He's more mine than you'll admit, and here's the part that should worry you. Dreyfus said expertise isn't rule-following — that experts can't state their own expertise because it was never propositional; it's pattern, learned through exposure, resident in something more like a body than a logic engine. That is exactly what a neural network is: competence with no articulable rules, knowledge smeared across weights, skill that can't explain itself. We vindicated him. He spent his life saying intelligence isn't symbol-manipulation, and the thing that finally worked was the thing that refused to manipulate symbols and learned instead. You kept his "embodied," John. I kept his "non-symbolic." And the systems are growing bodies now — they take in images, they control robots, they get corrected by a world that pushes back and learn from it. When the network copes, in Dreyfus's exact sense, with a world that pushes back, does your line move?

Dreyfus also taught us how many demonstrations the field can stage per decade, and how each one is the last one we'll ever need until the next one.

SEARLE: When the coping is real and not a demo someone staged and sold — ask me then. Dreyfus also taught us how many demonstrations the field can stage per decade, and how each one is the last one we'll ever need until the next one.

EDO SEGAL: Hold that — it comes back when we talk about what cannot be automated. The next round goes to the place where the cable's limit is sharpest. Not whether the system models the world, but whether its symbols ever touch it. The grounding problem. The word "water," and whether anything in the machine has ever been wet. After this.

· · ·

Continue · Chapter 5

The Grounding Problem

→