The Brittleness and the Long Tail

Page 1 · The Brittleness and the

**EDO SEGAL:** Melanie, you have a way of dramatizing the barrier that I cannot shake. Tell the audience about the panda and the gibbon. And then I want to connect it, because it is not a curiosity — it is the whole argument about surprise, stood on its head.

**MITCHELL:** A deep network classifies an image of a panda with high confidence. Then you add a perturbation to the pixels — a pattern of noise so faint no human eye could detect it, the image still looks exactly like a panda to any person alive — and the network now classifies it, with even *higher* confidence, as a gibbon. That is an adversarial example, and it is one of the most revealing failures in the field, because nothing a human would consider meaningful has changed. What changed is the statistical features the network actually keys on, and the revelation is that those features were never the ones a human uses to recognize a panda in the first place. The system was not seeing pandas. It was detecting correlations in pixel space that *happened*, on natural images, to track the label "panda" — until someone built an image where they did not. The performance was real and the foundation was nothing like ours.

· · ·

Page 2 · The Brittleness and the

And this generalizes into the thing that actually kills people, which I call the long tail. Any complex domain has an effectively infinite supply of rare situations — the weird configurations, the once-in-a-million scenarios. No training set covers them, because there are always more rare cases than examples. A human handles the long tail through *understanding* — confronted with a situation never seen, you reason from a general model of how the world works. A pattern-matcher has no such model to fall back on. It leaves the territory its data mapped and it does not reason; it extrapolates statistically, and the extrapolation is unreliable exactly where reliability matters most. That is why self-driving proved so much harder than promised. The ordinary miles are easy. The long tail is where the difficulty and the danger live, and the system fails there not gracefully, with hesitation, but completely and *confidently*, because it has no equivalent of the human sense of "this is a strange case, I should be careful."

**EDO SEGAL:** Now here is why I wanted this in the room with Ada. Melanie, your whole career documents the machine *surprising* us — failing in ways no one anticipated, the gibbon, the long tail, the confident catastrophe. And tonight we have been treating "surprise" as the thing that might mean origination. So which is it? When the machine surprises you with a gibbon, is that the same surprise that might mean creation — or its evil twin?

· · ·

Page 3 · The Brittleness and the

**MITCHELL:** It is the same surprise, and that is the most important thing I can give this whole debate, so let me say it slowly. Surprise is not evidence of a mind. Surprise is evidence of *our blindness to the system's actual structure*. The gibbon and the so-called creative leap come from the identical source: the machine is operating on features and combinations we cannot see, and so its outputs — the brilliant ones and the catastrophic ones — both astonish us, for the same reason. The cheerleaders point at the surprising-brilliant output and say "origination!" The same machine, the same opacity, produces the surprising-stupid output, and they call that a "bug to be patched." But it is not two phenomena. It is one phenomenon — capability we did not design and therefore cannot bound — wearing two faces. You do not get the creative surprise without the gibbon surprise. They are the same coin. And that is the deepest argument I have *for* the Countess, against my own field's romance: if you are going to call the machine's surprise "origination" when you like the output, intellectual honesty requires you to call it "origination" when it turns the panda into a gibbon — and nobody wants to say the machine *creatively originated* the gibbon. They want to say it made a mistake. But a mistake, Countess, is a deeply human-side word. It presumes a world the output was supposed to be right about. The machine has no such world. It has the weave.

· · ·

Page 4 · The Brittleness and the

**LOVELACE:** I could not have built that argument and I am grateful to receive it, because it completes mine. You have just shown that the machine's surprise is *symmetrical* — that it surprises us with brilliance and with catastrophe by the identical mechanism — and symmetry is fatal to the origination reading. If the same opacity produces the sonnet and the gibbon, then the opacity is not creativity; it is opacity, and we were calling it creativity only on the occasions we approved of the output. This is my "surprise is your blindness, not the engine's depth," and you have armed it with a panda. But — let me be honest where you have earned honesty — there is a place where your gibbon helps *you* against me, and I will not pretend not to see it. My accountability argument said: the engine does only what it was ordered, so a human is always answerable for what it does. Your gibbon says: no human ordered the gibbon. No one wrote a card that says "call this panda a gibbon." The behavior emerged from the search, and it is unpredictable and unowned in a way my hand-carded engine never was. So you have strengthened my objection to origination and *weakened* my doctrine of accountability in the same breath, and I think you did it on purpose.

· · ·

Page 5 · The Brittleness and the

**MITCHELL:** I did, because that is the real danger and it is the one I have spent more breath on than any singularity. The thing to fear is not that the machine is too intelligent. It is that the machine is *too stupid and has already taken over the world* — that we have handed consequential decisions, about loans and bail and hiring and what a car does at an intersection, to systems that match patterns, fail on the long tail, and cannot know when they are out of their depth, because there is no one home to know it. [Machine stupidity exercising authority it has not earned](https://www.youonai.ai/fieldguide/med/competence_without_comprehension). And your accountability doctrine, Countess, which I think is one of the most important things you ever wrote, is exactly what your own objection makes hard to enforce — because when no one wrote the cards, the chain from the gibbon back to a human decision goes long, branches, and disappears into the search. You said the engine could always answer "why did you do that?" with "because you ordered me to." Mine cannot. And that, not creativity, is what I am actually afraid of.

**EDO SEGAL:** Stop — because the reader can't see what just happened and it matters. That was the first exchange tonight where neither of you was scoring a point. Ada, you conceded that Melanie's gibbon, which strengthens your objection to *origination*, simultaneously dissolves your doctrine of *accountability* — that the same opacity which proves the loom is empty also makes the empty loom impossible to hold responsible. And Melanie, you said the thing you fear is not the full loom but the empty one, wielding authority no one can trace. You have arrived, from opposite centuries, at the same uneasy place: the machine that originates nothing may be more dangerous *because* it originates nothing, since there is no one in it to blame and no one outside it who can read it. Hold that. We are at the death cross now, and the next round names it.

· · ·

Continue · Chapter 9

The Mirror and the Machinery of Belief

→