1) The document describes a project to develop an automatic system to extract semantic information from unstructured scholarly texts in classics, focusing on named entities and references.
2) The goal is to build knowledge bases integrating information from multiple sources to improve information retrieval over a classics corpus.
3) The project involves building corpora from online archives, processing texts to extract entities and references, and developing techniques to recognize canonical and bibliographic references.