This document discusses methods for detecting off-topic pages in web archives. It begins by providing examples of ways pages can go off-topic over time, such as due to database errors, financial problems, hacking, or domain expiration. It then examines the behavior of "timemaps" that track archived versions of pages over time. The document outlines several methods for detecting off-topic pages, including analyzing textual content using cosine similarity or Jaccard similarity, examining page semantics using a search engine kernel function, and looking at structural changes like word counts. It evaluates these methods on manually labeled archive collections and finds that combining three methods provides the best results. Finally, it describes a publicly available tool for detecting off-topic pages in archives and applies