This document discusses various methods for evaluating the usability of systems, including both analytic methods conducted by experts and empirical methods involving observations of and surveys with users. Empirical evaluations aim to draw valid conclusions about real-world usage but can be challenging due to issues with the representativeness of test users, the realism of test contexts and tasks, and whether collected data truly reflects real impacts. Field studies observe users in realistic contexts but are time-consuming, while lab studies allow more control but also reduce realism. Interviews rely on subjective user memory and perspective. Statistics like t-tests and ANOVAs can be used to analyze empirical data and determine statistical significance.