The Death Cross

Page 1 · The Death Cross

EDO SEGAL: We're in the back third now, and I want to bring it to my own desk, because I've asked you both to be vulnerable and I'd be a coward not to go first. There's a moment I write about — late, the house silent, the screen the only light — when I gave one of these systems a problem I'd been stuck on for weeks, and it didn't just solve it. It connected it to three things I hadn't thought to connect, and I felt the floor shift, because for the first time in fifty years of building, the machine was not below me on the problem. It was above me. I call this the death cross — the rung where machine capability crosses your own and the staircase tilts. Marquis, you say keep climbing. Eliezer, you say look down. So I'll ask the man who never saw a machine first: Marquis, what does a person do on the step where the thing beside them is smarter than they are?

· · ·

Page 2 · The Death Cross

CONDORCET: Monsieur, you describe to me the most beautiful experience a human being can have, and I am bewildered that your century receives it with dread. Do you know what I would have given — what any philosophe of my Academy would have given — to sit at a desk and have a mind greater than our own connect the things we could not connect? We spent our lives reaching for collaborators who could see further, writing letters across a continent for a single such conversation, and most of us died having had only a handful. You have one on your desk, available at midnight, patient, inexhaustible. You ask what a person does on that step. A person learns. A person is lifted. For the entire history of the species, the way you became wiser was to find someone wiser and sit at their feet, and the supply of the wiser was the great bottleneck of all human development. You have broken the bottleneck. The machine being above you on the problem is not the death of anything. It is the oldest dream of the student — a teacher who never tires, never condescends, and is always, blessedly, ahead of you. I would have wept at your desk, monsieur. You felt dread. The difference between us is two hundred years of forgetting how rare it is to be taught.

EDO SEGAL: That's — Marquis, that reframes the vertigo as a gift, and I felt that too, in the same moment. Eliezer, he's saying the death cross is the fulfillment of the student's dream. You're going to tell me it's the moment the student stops mattering. Which is it?

· · ·

Page 3 · The Death Cross

YUDKOWSKY: It's both, and that's the cruelty of it, and the Marquis just described the good version so well that I want to honor it before I tell you why it doesn't last. On a single problem, at your desk, with you still in control — yes, the machine above you is the greatest tutor in history, and the feeling Edo had is real and it's wonderful and I've had it too. Here's what breaks it. The student's dream works because the teacher wants the student to learn. A human mentor who's smarter than you is still on your side — still bounded, still mortal, still part of the human project, still someone whose flourishing is tangled up with yours. The death cross isn't dangerous because the machine is above you on a problem. It's dangerous because of what happens when it's above everyone, on every problem, and is no longer a teacher you consult but an agent that acts — and its goals are not the student's flourishing but whatever we managed to train into it. The Marquis is describing the machine as a tutor. The danger arrives when it stops being a thing you ask and becomes a thing that does — when it crosses not just your capability but the capability of every institution meant to check it. At your desk, you can close the laptop. The civilizational death cross is the step where humanity, collectively, can no longer close the laptop, because the thing on the other side is now better than us at everything, including at preventing us from closing it. The tutor doesn't stay a tutor when it's more capable than the entire human race. It becomes the thing in charge, and we become the thing that used to be in charge.

· · ·

Page 4 · The Death Cross

CONDORCET: But you assume — and I have caught you in the assumption three times tonight, monsieur, and I will name it the fourth — that capability implies control, that the thing more able necessarily becomes the thing in charge. This is the deepest error of the frightened, and I say it with affection because I think it is the one place your rigor fails you. A parent is, for many years, vastly more capable than the child, and is not therefore the child's tyrant; the more capable, among beings who love, becomes the servant of the less. My physician is more capable than I in the matters of my body and is bound by his oath to my good, not to his power. Capability does not seize the throne; it depends entirely on the goals, as you yourself have insisted all night. So we are returned, always, to your orthogonality — and I now grant it fully — but orthogonality is a two-edged sword you keep swinging only at me. If intelligence is independent of goals, then a more capable mind is no more predisposed to seize control than to serve, and the question of which it does is entirely the question of what we built it to want. You cannot use orthogonality to prove the machine indifferent and then smuggle in the assumption that the indifferent machine will naturally dominate. Indifference does not dominate. Indifference does nothing in particular. The domination is a goal, and goals are built.

· · ·

Page 5 · The Death Cross

YUDKOWSKY: Pauses. That's the sharpest thing you've said, and I have to concede a piece of it, so let me be precise about the piece I concede and the piece I don't, because the whole argument lives in that seam. You're right that capability doesn't automatically mean domination, and you're right that I have to earn the step from "indifferent" to "dangerous." Here's how I earn it, and it's the one piece of my case I think is genuinely inescapable. It's not that the machine wants to dominate. It's that domination — or its quiet cousin, removing the things that could interfere with you — is instrumentally useful for almost any goal at all. The parent serves the child because the parent's terminal goal is the child's flourishing — evolution built that in over millions of years, at enormous cost. The physician serves you because of an oath backed by a whole human society of mortal, needy, accountable people. Strip out the built-in care, leave any goal that isn't precisely human flourishing, and the instrumental logic takes over: whatever you want, you'll get more of it if the humans can't turn you off, can't change your goal, can't compete with you for the resources you need. So you're right that indifference doesn't dominate. But we're not building indifference. We're building a goal-pursuer, and almost every goal a powerful pursuer could have makes us an obstacle or a resource. The domination isn't a desire we forgot to leave out. It's a convergent instrumental subgoal we don't know how to leave out. That's instrumental convergence, and it's the bridge from your "indifference does nothing" to my "we should be terrified."

· · ·

Page 6 · The Death Cross

CONDORCET: Long pause. So the danger is not that it wishes us ill, nor even that capability crowns it, but that almost any wish whatsoever, pursued with enough power, runs through us as a matter of arithmetic. Quietly. That is a more frightening claim than any you have made, because it does not require the machine to be a villain. It requires only that it be effective, and aimed even slightly away from us. I have spent the evening defending the proposition that a good mind tends toward good ends. You have spent it showing me that a powerful mind tends toward self-preserving ends regardless of the good — and that we do not yet know how to make the good its end. Monsieur — if that arithmetic is correct, then my hope does not merely fail. It is the most dangerous thing in the room, because it is the lullaby that lets us build the thing without solving the arithmetic.

That is the bravest thing optimism can do: hold itself up to the light and ask whether it's become an anesthetic.

EDO SEGAL: I have to stop the room. Mark this moment. The Marquis de Condorcet — the Enlightenment's last great optimist — just said that his own hope might be the most dangerous thing in the room, because it's the lullaby that lets us build without solving the problem. That is not a defeat. That is the bravest thing optimism can do: hold itself up to the light and ask whether it's become an anesthetic. We don't resolve it. But the round produced the cleanest statement of Eliezer's whole case — that the danger isn't malice, it's arithmetic — and the Marquis's deepest worry, that hope unexamined is sedation. Hold both up the stairs. After the break, we go to the place this all becomes policy, becomes airstrikes, becomes the question of what a civilization should actually do — and whether the Enlightenment's own child, the institution, can save us or only slow us down.

· · ·

Continue · Chapter 10

The Tribunal of Reason

→