In this presentation, I first give a brief overview of descriptive corpus linguistics and the exploitation of descriptive corpora for pedagogic purposes within the framework of data-driven learning. Following a critical evaluation from a cognitive discourse perspective, I then argue for a pedagogic corpus approach in which corpus conceptualization and design are vitally shaped by the pedagogic orientation itself. This concerns in particular pedagogic motivation of corpus content development, including corpus annotation with pedagogically relevant thematic and linguistic categories, content enrichment to facilitate authentication, as well as online searches beyond the familiar concordances. The approch is illustrated with reference to copora available from the EU project “BACKBONE – Corpora for Content and Language Integrated Learning”. The BACKBONE corpora contain video interviews in English, German, French, Polish, Spanish and Turkish as well as in European manifestations of English as a Lingua Franca (ELF); they are freely available for non-commercial educational use. Moodle-based pilot courses are deployed to demonstrate how BACKBONE search results can be used to facilitate individual and collaborative learning activities in blended learning settings.