This document discusses corpus linguistics, which is the study of language through large collections of electronic texts known as corpora. It explains that corpus linguistics involves collecting and analyzing real-world text samples to derive abstract rules of natural language or relate languages. Originally done manually, corpora are now largely compiled automatically. The analysis consists of applying schemes to texts and mapping terms to theoretical models. Corpus linguistics allows linguists to perform experiments on datasets and explore different perspectives than those of the original compilers.