The Mozilla Foundation, collaborating with scholars at the Ostrom Workshop at Indiana University, has developed a practical framework for applying Ostrom's design principles to data commons governance. The framework identifies the specific institutional features that data governance arrangements require: clear definitions of the data commons' boundaries (what data is included, who has contributed it, who may access it), congruent rules for data use (different rules for different types of data and different contexts of use), collective-choice mechanisms (processes through which contributors participate in governance decisions), and monitoring systems (methods for tracking how data is used and whether use conforms to governance rules).
The framework responds to the training data question by demonstrating that neither privatization nor state regulation exhausts the available institutional options. The contributors to the training-data commons did not produce their contributions as property, and retroactively imposing property frameworks on data generated under different institutional assumptions creates distortions that may exceed the problem they address. But they did produce their contributions under implicit governance arrangements — norms of the open internet, terms of service, licensing frameworks — that can be examined and, where appropriate, updated through explicit community processes.
The practical challenges are significant. Contributors number in the millions and are distributed across every jurisdiction on earth. They have no pre-existing organizational structure, no shared identity, no institutional infrastructure through which to exercise collective governance. The data has already been incorporated into models that cannot easily disaggregate individual contributions. The corporations that hold the data have both the legal position and the economic incentive to resist governance arrangements that would constrain their use of it.
These are real obstacles but not unprecedented in Ostrom's empirical record. Communities with no pre-existing organizational structure, no shared identity, and no institutional infrastructure have developed effective governance arrangements in response to resource crises. The formation of self-governing institutions is itself an institutional achievement — requiring entrepreneurial individuals, catalytic events, and the recognition among potential community members that their individual interests are better served by collective action than by continued isolation.
Whether conditions for this kind of institutional formation exist in the training-data context is an empirical question that cannot be answered by theory alone. The framework's contribution is to identify what the institutional work would look like, not to prejudge whether it can be successfully undertaken.
The Mozilla-Ostrom Workshop collaboration emerged from parallel recognitions at both institutions that AI training data required governance frameworks the dominant discourse was not providing. Mozilla's public-interest technology orientation combined with the Ostrom Workshop's institutional analysis expertise produced the framework as a concrete operational resource for communities, policymakers, and researchers engaging with the data commons question.
Four institutional features. Boundaries, congruent rules, collective choice, and monitoring structure effective data commons governance.
Beyond the false binary. Neither privatization nor state regulation alone addresses the core question of who makes the governance arrangements.
Practical but difficult. Scale, jurisdictional diversity, and power asymmetries make this harder than any commons Ostrom studied.
Institutional formation required. The framework identifies what must be built, not whether it can be built — an empirical question.