This document discusses evaluating search systems for digital cultural heritage collections. It argues that evaluation needs to consider more than just search results and look at the entire user experience and system from multiple perspectives. Effective evaluation requires combining both system-oriented and user-oriented methods. It also needs to account for how users differ and the contexts of their information needs and goals. The document outlines challenges in evaluating complex systems that go beyond basic search to include features like recommendations, visualizations and taxonomies. It provides an example evaluation framework used for the PATHS system that supported exploratory search and sense-making across various system components and user tests.