August von Wassermann developed his complement-fixation test for syphilis in 1906. Fleck's reconstruction of its history is one of the most devastating case studies in the philosophy of science. The Wassermann reaction did not work in the way a modern reader would assume. It was not a clean detector of the presence or absence of Treponema pallidum. It produced ambiguous results, required considerable interpretive skill, and generated false positives and false negatives at rates that would be considered unacceptable by contemporary standards. It became the standard diagnostic test not because it was objectively reliable — Fleck demonstrated it was not — but because a thought collective formed around it, developed interpretive protocols for its use, invested institutional resources in its infrastructure, and gradually standardized the reading of its results until the ambiguities were resolved not by improving the test but by stabilizing the framework within which its results were read.
Fleck's analysis of the Wassermann reaction is the clearest historical example of pathological stabilization of journal knowledge into handbook knowledge. The test migrated from experimental laboratories into clinical handbooks without the migration being warranted by technical reliability. What warranted the migration was institutional investment: training programs, diagnostic protocols, insurance frameworks, legal proceedings that presupposed the test's validity. The investment itself became the argument — so much had been built on the Wassermann reaction that questioning it meant questioning the entire clinical edifice.
The dynamic Fleck identified is not a story of fraud or incompetence. The researchers and clinicians were acting in good faith. They encountered ambiguous results and developed reasonable-looking ways of handling the ambiguity — interpretive skill, sample preparation protocols, reader calibration. Each individual accommodation was defensible. The aggregate effect was that the test's unreliability was managed rather than confronted, and the normalization of deviance that Diane Vaughan later identified in the Challenger disaster was already operating in serological laboratories seventy years earlier.
Applied to AI, the Wassermann reaction is the cautionary precedent. The current understanding of AI's impact is being stabilized through institutional investment — corporate restructuring, educational reform, regulatory frameworks, investment theses — at a speed that outpaces the collective negotiation that would refine it. The institutions that have committed resources to specific understandings of AI develop vested interests in maintaining those understandings, and the vested interests shape subsequent evaluation of evidence in ways that favor confirmation over revision. This is not because the actors are dishonest. It is because the institutional dynamics Fleck documented in 1935 operate with the same efficiency in 2026.
The Wassermann reaction was eventually replaced by better tests — but only after decades of institutional transition costs that fell disproportionately on patients misdiagnosed, physicians trained in obsolete protocols, and funding that could have been directed toward more accurate technologies. The AI analog is not hypothetical. The costs of premature stabilization are being accumulated now, and they will fall on workers, students, and citizens whose lives are being reshaped on the basis of claims that have not yet earned the handbook status they are being accorded.
Fleck's reconstruction of the Wassermann reaction's history forms the central empirical case of Genesis and Development of a Scientific Fact. He drew on his own laboratory experience with the test — he had performed thousands — and on extensive archival research into the protocols, training manuals, and debates that surrounded its adoption across European clinical medicine between 1906 and 1935.
Pathological stabilization. The canonical case of journal knowledge migrating into handbook form through institutional investment rather than technical maturation.
Accommodations as stabilizers. Interpretive protocols, training programs, and legal frameworks that handled ambiguity also stabilized the test against revision.
Investment as argument. The institutional stake itself became the reason for maintaining the test, since questioning it meant questioning everything built on it.
Good-faith actors. The dynamic operates through honest practitioners making reasonable individual accommodations.
AI parallel. Contemporary AI understanding is being stabilized through institutional investment at a speed that risks reproducing the Wassermann pattern.
Historians of medicine have debated whether Fleck's account overstates the unreliability of the Wassermann reaction, pointing to its actual clinical value as a screening tool. Fleck's defenders respond that his point was not that the test was useless but that its migration into handbook status was epistemically premature — and that the pattern he identified generalizes beyond the specific case whose details can be argued.