This document discusses research on detecting deception in real-time audio and video streams. It outlines challenges in synchronizing, capturing, indexing and analyzing multiple streams. It proposes using MPEG-7 semantic annotations to generate knowledge bases for analysis. The research tests infrastructure for capturing, storing and retrieving segmented streams in SQL Server 2008. It also demonstrates prototype avatar animation controlled by Python scripts. Further studies are needed on the visual concept models and detection analysis engine.