The Parrot and the Prophet

Page 1 · The Parrot and the

EDO SEGAL: Emily, in 2021 you and Timnit Gebru, Angelina McMillan-Major, and Margaret Mitchell published the paper that cost two of your co-authors their jobs at Google and gave the culture its handle on this whole era: On the Dangers of Stochastic Parrots. People who have never read a word of it use the phrase. I'd argue most of them use it wrong — they think it's an insult, and it's actually an engineering description. Give us the description. Then I want to ask you both about what the paper was really about, because it wasn't mostly about parrots.

BENDER: It's precisely an engineering description. A language model is a system for haphazardly stitching together sequences of linguistic forms it has observed in its vast training data, according to probabilistic information about how they combine — but without any reference to meaning. Stochastic: random in a patterned, probability-governed way. Parrot: the mimicry is real and the comprehension is absent. We chose it because it was accurate, and because we could see the vocabulary war coming — the industry was already saying understands, reasons, thinks, and every borrowed word was doing unearned persuasive work in funding rounds and policy rooms.

But you're right, Edo, and I wish more people who wield the phrase knew it: the parrot was one section. The paper was a cost accounting. The environmental footprint of training runs the size of small nations' grids. Training data scraped at a scale that defeats documentation — so the biases of whoever was loudest on the internet get laundered into an authoritative voice and deployed on everyone, including the people the data erased. The opportunity cost of an entire field stampeding down one path because that's where the compute is. And the harm we put last because it summarized the rest: that fluent, ungrounded text at scale would pollute the information ecosystem. We wrote that before the flood. I take no pleasure in how it aged.

· · ·

Page 2 · The Parrot and the

EDO SEGAL: Geoff, you watched that paper detonate inside the company you worked for. And I want to put the uncomfortable symmetry on the table. Emily's school says: the industry inflates what these systems are, and the inflation transfers money and power. But Emily — your school says something parallel about Geoff. That when the most decorated researcher in the field stands up and says this technology might end humanity, he's not puncturing the hype. He's completing it. Nothing sells "this is the most powerful technology in history" like the prophet warning it might be too powerful. Geoff, you first: is your warning load-bearing for the valuations?

HINTON: It's a fair question, and I've sat with it, because the people making that argument include people I respect. Three things. First, the timeline is wrong for the cynical reading: I left Google to warn — I gave up the paycheck; the honest cynic keeps the salary and does the doom on weekends. Second, I notice the argument proves too much. If warning is hype and reassurance is hype, then no statement a knowledgeable person makes can be evidence of anything, and we should all go home. Third — and this is the part Emily and I genuinely share — I agree the companies exploit my warnings. They metabolize everything. I've said the risk of extinction is real, maybe ten, twenty percent; I've watched that quote raise money. That tells you something rotten about the incentive structure. It tells you nothing about whether the quote is true. Smoke detectors don't become wrong because arsonists sell insurance.

· · ·

Page 3 · The Parrot and the

BENDER: But you didn't just bring a smoke detector, Geoff. You brought a story — and it's the industry's favorite story: the machine as ascending mind, so capable it might transcend us. Every time you say "it might end us," a policy maker hears "it can do anything," and the procurement contract follows the awe. Meanwhile the harms that are not speculative — the benefits systems and the hiring filters, the synthetic sludge, the stolen work product, the people displaced by an inflated story about what the machine can replace — get triaged behind your science fiction. The AI Con's whole argument is that doomerism and boosterism are the same pitch deck. Both need the machine to be more than it is.

HINTON: And here is where I'll be sharp, because this matters more than our comfort. Emily's frame requires that the capability be fake — that under the costume there's nothing that warrants the concern. I spent a decade inside the place building these things. I watched capabilities arrive ahead of the roadmaps, surprising the people training the models — surprising me, and I'd bet my reputation on networks for fifty years. The scaling pattern is the most reliable empirical regularity this field has ever produced. When Emily says "more wake is more wake," she's making a bet about where that curve goes. I've made the opposite bet all my life, and I have the unusual experience of having been right against an entire field's certainty once already. I'm not asking anyone to trust my fear. I'm asking them to look at the derivative.

BENDER: The derivative of what, measured how? That's not rhetoric — it's my entire objection in four words. Benchmarks saturated with their own training data. Capabilities defined by demos. "Surprise" as evidence — Geoff, surprise is a fact about your expectations, not about the system. The whole epistemics of your field runs on vibes at scale, and you of all people, the man who insists on mechanism, should be the one demanding better measurement instead of dressing your shudder in percentages. Ten to twenty percent — derived from what model, what data? You've said yourself it's a feeling with a number on it.

· · ·

Page 4 · The Parrot and the

HINTON: An honest expression of uncertainty about an unprecedented situation. You'd prefer I round it to zero for hygiene?

It's that it's expensive, and the price list was published in December 2020.

BENDER: I'd prefer the public knew the number was a mood. Because they make laws out of your moods now.

EDO SEGAL: Since we're standing in the uncomfortable place, let's stand here properly. Emily — the parrot paper has a history your critics never mention and your admirers never stop mentioning: Google demanded the paper be retracted, and within months your co-authors Timnit Gebru and Margaret Mitchell were out of the company. Geoff — you were at Google when it happened, senior enough that your word carried weight, and the record of senior researchers defending them publicly is thin. I'm not asking either of you to relitigate personnel files. I'm asking the structural question: what does that episode tell us about whether this field can produce honest knowledge about itself?

BENDER: It tells us the field's knowledge runs through a funnel with a corporate valve on it, and everyone working in it knows where the valve is. Understand what the paper was: a survey. A cost accounting, with citations, of risks the company's own products carried. For that — not for fraud, not for error, no one has ever identified the error — the leading corporate lab in the world pushed out two of its most prominent researchers, both women, both of whom had built its ethical credibility. Every researcher in the field watched it happen and updated accordingly. That's the mechanism, Geoff — not censorship with a stamp, just a demonstration, once, in public, of what happens to careers that itemize costs. The chill does the rest for free. And then the same companies fund the conferences, license the compute, and hire the graduates, and we're asked to treat the resulting literature as the disinterested record of what these systems are. The reason my critique sounds lonely is not that it's rare. It's that it's expensive, and the price list was published in December 2020.

· · ·

Page 5 · The Parrot and the

HINTON: I'll answer the part aimed at me, because dodging it would prove her point. Could I have said more, then, publicly? Yes. I told myself what senior people always tell themselves — that I didn't know the details, that it was a management matter, that my lane was research. I notice those sentences more clearly now than I did then; leaving changed my eyesight, which is itself evidence for Emily's structural claim and I'll hand it to her freely. Where I'll push back is the inference that the valve falsifies the science. The capabilities literature isn't hype because corporations fund it — replication exists, open-weight models exist, academics adversarially probe these systems every day, and the systems keep passing the adversarial probes. Emily's argument explains why we should audit the record. It cannot do the work she sometimes wants it to do, which is to discount every inconvenient result as purchased. Sociology tells you where to point the skepticism. It doesn't get to be the skepticism.

Because there's a sixty-year-old piece of evidence about credulity, and it's a chatbot from 1966 that couldn't do anything at all.

BENDER: On that — agreed, and I've never argued otherwise. Audit everything, including me; The AI Con has footnotes for a reason. I'll take the concession about the eyesight, Geoff. It's more than most of your colleagues have offered.

EDO SEGAL: I want to name what just happened, because the reader can't see your faces. That was the first exchange of the night where neither of you was smiling. And notice the strange topology we've wandered into: Emily, the critic, and Geoff, the prophet, agree that the companies are not to be trusted, agree the public discourse is corrupted, agree the stakes are enormous — and disagree about whether the engine of the danger is the machine's power or our credulity about it. That fork in the river is the next round. Because there's a sixty-year-old piece of evidence about credulity, and it's a chatbot from 1966 that couldn't do anything at all. The mirror — after this.

· · ·

Continue · Chapter 6

The Mirror That Learned to Mirror

→