Cultural heritage institutions, like the Library of Congress, are evolving to integrate big data within their collections, making them accessible both as collections and as datasets. Examples include the National Digital Newspaper Program, web archives, and congressional information, which provide valuable resources for researchers seeking to analyze historical trends and datasets. As more researchers seek to mine and utilize these collections independently, the institutions face challenges related to data management, preservation, and the development of infrastructure to support expanded access and services.