This document discusses the definition and key aspects of a corpus in linguistics. It defines a corpus as a large collection of text samples that are selected and organized according to linguistic criteria. The corpus aims to represent a given language, dialect, or subset of language. It should contain a diverse range of authentic texts and be large enough to characterize different varieties and uses of the language. Important qualities of a corpus include quantity, quality, representativeness, simplicity, equality, retrievability, verifiability, augmentation, documentation, and management.