This document provides an overview of a PhD research project aimed at developing an automatic system to extract structured information from a corpus of unstructured classics scholarly texts, in order to improve information retrieval capabilities. The project involves building a corpus from open access classics journal papers, applying natural language processing techniques to identify mentions of people, places, works, and other entities within texts, and using structured data from existing databases to disambiguate entity mentions and automatically generate new indices linking texts. The expected results are providing multiple meaningful access points to information within the corpus and demonstrating the scalability of the approach.