OpenAI and other large AI companies are lobbying for regulation in the US to create barriers that maintain their competitive advantage. However, open source models are becoming increasingly competitive through techniques like training on smaller specialized datasets, low-rank parameterization, and quantization. Progress in AI will be driven more by the curation and management of specialized, minimal, modular datasets for training and evaluation, which provides an opportunity for the data management community. Curation, rather than model size, will determine success by enabling specialized models trained on trusted data to produce correct, verifiable results.