The specific failure in the Thiokol charts was a failure of comparison. The thirteen previous launches had produced thirteen data points on O-ring damage and launch temperature. The correlation was clear if the data points were plotted on a single graph: damage rose as temperature fell. The charts Thiokol produced showed the data in a format that made direct comparison nearly impossible. Individual incidents were presented on separate pages. The temperature axis was not consistently labeled. The thermal-damage data was scattered across multiple graphics with different scales.
This is a failure of small-multiples design. The engineers had exactly the data a small-multiples presentation would have made unmistakable — thirteen instances of the same system under varying conditions. A single well-designed display would have placed all thirteen data points on one scatterplot with temperature on one axis and damage severity on the other. The pattern would have been visible in two seconds. The Thiokol charts took the same information and scattered it across thirteen displays, requiring viewers to hold each data point in memory while moving to the next — a cognitive task the human working memory cannot perform reliably under any conditions, and certainly not in a teleconference at 11 PM the night before a launch.
Tufte's analysis appeared most fully in Visual Explanations (1997) and has been reprinted, refined, and extended in subsequent work. The core claim is unambiguous: had the Thiokol engineers produced one honest scatterplot of damage against temperature, the launch would have been delayed. The data-ink ratio of the actual charts was low. The chartjunk was high. The critical pattern was buried beneath the format. Seven people died because a correct signal was presented in a display that hid it.
The case applies to AI with uncomfortable precision. When Claude produces an analysis that sounds authoritative but is wrong — the Deleuze failure Edo Segal describes in You On AI — the wrongness is invisible because the surface signals that normally correlate with reliability are all present. The builder who accepts the output on its surface quality is in the position of the NASA decision-makers who accepted the Thiokol recommendation: trusting a display whose format has concealed a critical pattern the underlying evidence would have revealed.
The Challenger disaster occurred on January 28, 1986, killing astronauts Francis Scobee, Michael J. Smith, Judith Resnik, Ellison Onizuka, Ronald McNair, Gregory Jarvis, and Christa McAuliffe. The Rogers Commission investigation documented the engineering and managerial failures leading to the decision. Tufte's reanalysis, focused specifically on the information-design failures, first appeared in Visual Explanations (1997) and has been extended in The Cognitive Style of PowerPoint (2003), which analyzed the similarly poor information design in the Columbia foam-damage analysis that preceded the 2003 Columbia disaster.
The data was there. The critical evidence — correlation between temperature and O-ring damage — was physically present in the materials presented to decision-makers. The design of the charts made it invisible.
The failure was a failure of comparison. Thirteen data points scattered across thirteen displays cannot be compared; the same thirteen points on one scatterplot would have revealed the pattern immediately.
The consequences were human. This was not an abstract failure of visualization aesthetics. Seven people died because a clear signal was presented in a display that hid it.
The failure repeats. The 2003 Columbia disaster involved a structurally similar failure in the PowerPoint analysis of foam-debris risk — the same architectural problem at the same organization seventeen years later.
The application to AI is direct. AI-generated output whose surface polish exceeds its substantive accuracy creates the same kind of display: evidence that appears to support a conclusion while actually concealing the pattern that would contradict it.