The document discusses detecting patterns in news media content through automated analysis. It describes building a corpus of over 30 million news items from more than 1,300 global media sources and applying techniques like machine learning, natural language processing and statistical machine translation to analyze the data at scale. The goal is to answer previously intractable questions about the global mediasphere and media systems.