This document discusses research on multi-modal scientific communication and presentations. It describes three scenarios for studying audience reception and attention allocation during presentations using eye tracking and think aloud methods. Scenario I examines live presentations across various fields, finding differences in where audience attention was focused depending on the discipline. Scenarios II and III were to be conducted in a lab setting. The conclusions propose hypotheses about how meaning is composed from multiple communication elements and how coherence is achieved in multi-modal presentations through design and signaling. The overall goal is to better understand knowledge transfer through different presentation modes.