The document discusses evaluation methods for unsegmented speech retrieval and proposes modifications to the mean generalized average precision (mGAP) measure. It summarizes research that studied user behavior in a simulated retrieval task to navigate audio recordings and identify relevant passages. The study found that users prefer playback points before true starting points and are tolerant of points up to 1-2 minutes away. Based on these results, the document proposes modifications to the penalty function used in mGAP to give higher reward for points before passages and maintain reward within 1-2 minutes of true starting points. A comparison showed the modified function correlates highly with scores from the original measure.