The cycle that began with [YOU] on AI describes the vertigo a skilled builder feels when an AI system produces work that is simultaneously impressive and untrustworthy: fluent, fast, frequently correct, and then confidently, catastrophically wrong in a way no calibrated human expert would be. The oscillation between reliance and suspicion is exhausting, and it is exactly the oscillation that Fermi estimation is built to manage. The Fermi temperament does not trust the machine’s estimates, but it does not dismiss them either; it treats each output as a hypothesis, checks it against an independent line of reasoning, and holds confidence in proportion to the agreement between them.
The cycle holds Fermi estimation alongside other cognitive disciplines the AI age demands. Pearl’s ladder of causation asks which rung of intelligence a given output occupies. Fermi estimation asks whether the confidence the output expresses is calibrated to the evidence behind it. Together they constitute the auditor’s toolkit for working alongside AI systems that supply the feeling of an informed decision more reliably than they supply its substance.
Fermi developed his estimation practice as a pedagogical tool at the University of Chicago in the late 1940s, teaching physics students to navigate the gap between what a problem states and what it requires. The exercises he set—how many piano tuners in Chicago, how many golf balls fit in a school bus, what is the weight of the moon in pounds of butter—became known as Fermi problems and spread through physics education worldwide after his death in 1954.
The method has antecedents in the order-of-magnitude reasoning that physicists had practiced since at least the late nineteenth century, but Fermi raised it to a systematic discipline and, crucially, insisted on calibration as its test. An estimate is not a Fermi estimate unless the estimator holds their confidence in proportion to their accuracy—unless they know, before the answer is checked, how much to trust it. This metacognitive dimension distinguishes the discipline from confident guessing and is precisely what makes it teachable.
Decomposition. The method’s first move is to replace the unanswerable question with a chain of difficult but estimable ones. Each factor in the chain can be guessed to within an order of magnitude from general knowledge; the chain assembles those guesses into an answer. The power comes from independence: if the errors in adjacent factors are not correlated, they tend to cancel, so the product of several rough guesses can land closer to the truth than any single guess deserves.
Calibration, not confidence. The defining feature of Fermi estimation is not the production of an answer but the metacognitive assessment of how much to trust it. A well-calibrated estimator who says “I am eighty percent confident” is right about eighty percent of the time. Current AI systems are notably miscalibrated in their reasoning outputs: they will state a wrong answer with the same linguistic confidence as a right one, and will revise a correct estimate under pressure not because they have found an error but because the pressure pattern-matches to “you should reconsider.” The machine has confidence and it has accuracy, but the two are not reliably coupled, and an estimate whose confidence is uncoupled from its accuracy is not a Fermi estimate. It is a guess wearing the costume of one.
A model of the world, not retrieval from it. Fermi’s estimates worked because they drew on a coherent physical model of the world—a picture from which estimates could be drawn on dimensions the picture was never explicitly given. A large language model’s “knowledge” is a vast statistical association of tokens, capable of asserting two contradictory facts in consecutive sentences without registering the contradiction, because nothing in its structure requires the picture to hang together. The Fermi estimate is a probe that reveals the difference: it requires not retrieval but interrogation of a world-model, and it fails gracefully when the world-model is absent.
The discipline as a survival skill. Fermi taught estimation the way a sergeant teaches field-stripping a rifle: not as a trick but as a survival skill. The world constantly demands decisions before the data arrives. In situations where thresholds are being approached and the consequences of misjudgment are irreversible, the ability to reason under uncertainty without either false certainty or paralysis is the precise cognitive tool that working with AI systems most requires—because the machine will not supply it and the builder must.