The document discusses a presentation on text processing for procedural question answering, highlighting the identification of instructional compounds and titles through a series of methods and observations. It outlines the global architecture of the processing system, including HTML cleaning and the use of various linguistic and visual clues for accurate title identification. Main issues addressed include overcoming noise from web pages and refining the hierarchy between tasks and their sub-tasks.