The document discusses the development of corpora at the University of Nottingham, including both mono-modal corpora containing one type of data (text-based) and multi-modal corpora containing multiple data types (text, video, audio). It describes the Nottingham Multi-Modal Corpus and Nottingham Learner Corpora as examples. The Nottingham eLanguage Corpus aims to collect diverse digital language data types from individuals, including SMS, email, social media, web browsing history, and location data. One challenge is modeling how language varies based on dynamic contextual factors. As a case study, the document outlines the Thrill corpus containing synchronized audio, video and sensor data from fairground rides, to examine linguistic patterns across different phases