WORK
The Winograd Schema Challenge
Hector Levesque's 2012 test designed to require 'thinking in the full-bodied sense'—sentence pairs whose pronoun resolution demands causal understanding, named for Winograd's original work on reference.
The Winograd Schema Challenge, proposed by Hector Levesque in 2012 and named in honor of
Terry Winograd's pioneering natural language work, consists of sentence pairs differing by a single word that flips pronoun reference: 'The city councilmen refused the demonstrators a permit because they feared violence' (they = councilmen) versus 'because they advocated violence' (they = demonstrators). Resolving the reference requires understanding the relationship
between fearing violence and denying permits, between advocating violence and being refused—what Levesque called 'thinking in the full-bodied sense,' not surface pattern-matching. The challenge was designed as a successor to the
Turing Test, targeting the specific gap between statistical competence and genuine comprehension.
In The You On AI Field Guide
By 2023, large language models were passing the Winograd Schema Challenge with accuracy above ninety percent. The original authors conceded, with intellectual honesty mirroring Winograd's own, that the test had been 'soundly defeated.' But the concession came with a puzzle: the models passing the test still appeared, by every