WORK

The Grammar of Science

Karl Pearson’s 1892 positivist manifesto declaring that science describes the regularities of sense-impressions and must remain silent about causes—the philosophical foundation that every large language model embodies by construction, and the most important book in the intellectual history of AI that nobody in the field reads.

Karl Pearson published The Grammar of Science in 1892 as a philosophical manifesto, not a technical manual, and it made a claim so radical it has mostly been absorbed without being noticed: that science does not explain the world, it only describes it, and that asking for anything more is a relic of metaphysics that grown-up minds should outgrow. The field of science, Pearson declared, is unlimited; its material is endless; every phenomenon is data. And the method is one: the classification of facts, the recognition of their sequences and relative significance. A scientific law is nothing more than “a brief description in mental shorthand of as wide a range as possible of the sequences of our sense-impressions.” Cause, for Pearson, was just a very tight correlation—the name we give to an antecedent stage in a routine of perceptions, nothing more occult than that. This is the philosophy that the causal-inference movement, led by Judea Pearl, now diagnoses as a hundred-year wrong turn. It is also—and this is the uncomfortable recognition that the book forces—the philosophy that contemporary machine learning has adopted wholesale, without quite deciding to. A Pearsonian intelligence describes, compresses, predicts, and acts on regularities. It does not explain. It cannot intervene meaningfully. It mistakes the richness of the description for the adequacy of the understanding. The Grammar of Science is the book that explains why, and the book that the [YOU] on AI cycle reads as both a founding document and a warning.

In the [YOU] on AI Field Guide

The cycle’s central challenge is to see the river of intelligence clearly, and one of the most important things to see is what kind of intelligence the river carries. The Grammar of Science identifies it: it is descriptive intelligence, organized around the compression of regularities, operating without any account of cause. This is not an incidental feature of current AI systems that will be engineered away; it is the philosophical doctrine their training embodies. Pearson argued that this was the whole of science. Pearl argues that this is the floor of intelligence, not the ceiling. The debate between them is now an empirical question, and the answer will determine what AI systems can and cannot do when deployed in consequential contexts.

The book also supplies the intellectual genealogy of what the cycle calls the lab coat of objectivity: the appearance of value-neutrality that quantitative methods acquire when their underlying value choices are invisible. Pearson’s positivism presented itself as the most rigorous and least metaphysical possible account of knowledge. The value choices embedded in deciding which facts to classify, which sequences to recognize as significant, which regularities to act on—these were not visible as value choices, because the method presented itself as merely describing what was there. This is exactly the appearance that algorithmic decision-making systems inherit: “the data shows” carries the same authority, and conceals the same buried assumptions, that “the science shows” carried in Pearson’s era.

The young Einstein recommended The Grammar of Science to his Olympia Academy reading circle in the years before special relativity, finding in its skepticism about absolute space and time an anticipation of the spirit that would reshape physics. The book’s influence on the development of quantum mechanics and operationalism in the philosophy of physics was real and substantial. This is part of what makes it so uncomfortable: it is not a crank text but a serious and influential philosophical work, and the discomfort of inheriting from it is the discomfort that is productive rather than dismissable.

Origin

Pearson wrote The Grammar of Science in 1892, before his major statistical contributions. It preceded the chi-square test, the formalization of the correlation coefficient, the development of the Pearson distributions. The philosophy came first, and the statistics followed from it: if description is the whole of knowledge, then the mathematical tools for precise description are the tools of science. The book went through two revised editions in Pearson’s lifetime (1900 and 1911) and remained in print throughout the early decades of the twentieth century.

The philosophical influences were primarily Ernst Mach, whose The Science of Mechanics (1883) had argued that physical concepts are economical summaries of experiential regularities rather than representations of hidden realities, and Wilhelm Clifford, whose posthumously published mathematical philosophy influenced Pearson’s account of the structure of scientific concepts. Pearson’s positivism was, in its time, a progressive and even radical position: it aligned with the socialist and secularist commitments of his early career, presenting science as a democratizing force against the authority of religious and metaphysical traditions.

The book’s reception history tracks the vicissitudes of positivism itself: celebrated in the early twentieth century, influential in the Vienna Circle and on the logical positivists, challenged by the growth of causal inference and the philosophy of science in the mid-century, and now returned to relevance by the AI systems that instantiate its philosophy at a scale Pearson could not have imagined.

Key Ideas

The unity of method. “The unity of all science consists alone in its method, not in its material.” Every domain is data; the one method is classification and the recognition of sequence. This is—precisely—the operating assumption of a transformer trained on tokens from protein sequences, legal documents, chess games, and English prose simultaneously. The method is indeed one. Whether the unity of method produces unified understanding is the question the book cannot ask.

The law as compressed description. A scientific law, for Pearson, is “a brief description in mental shorthand” of regularities in sense-impressions. The neural network that compresses the statistical regularities of human language into a set of parameters is performing, at enormous scale, exactly this operation. The grammar of description Pearson articulated has been turned into an engineering specification. His philosophy is running on servers.

The banishment of cause. Pearson held that “causation is nothing more than the shorthand description of sequences.” This move, Pearl argues, locked statistics to the first rung of the ladder of causation and produced a hundred years of confusion between description and understanding. The systems trained on Pearson’s philosophy—explicitly or by construction—cannot answer the interventional questions that consequential action requires, precisely because they have no causal model of the world whose regularities they so fluently describe.

The limits of the measurable. Pearson believed that whatever could not be measured was not the proper business of knowledge. AI is the most powerful measurement instrument ever built, and by extending the reach of the measurable so far, it shows us with new clarity what still lies outside it: meaning, value, purpose, the difference between a statement that is fluent and one that is true. The Grammar of Science is most useful today not as a doctrine to follow but as a precise map of those limits—a description of the philosophy that powerful systems embody, which makes its failures predictable rather than mysterious.

Explore more

Browse the full You On AI Field Guide — over 8,500 entries