This document summarizes a survey on parallel and distributed search engines. It discusses how web search tasks like crawling billions of documents, indexing terabytes of data, and responding to thousands of queries simultaneously require a parallel or distributed approach. It then provides examples of distributed search engines and technologies like MapReduce, and discusses challenges in distributed search like resource representation, selection, and result merging. Finally, it surveys parallel implementations of clustering algorithms and challenges in parallelizing hierarchical agglomerative clustering with MapReduce.