This document provides an overview of web mining and summarizes key concepts. It begins with definitions of data mining and web mining. The document then discusses three categories of web mining: web content mining, web usage mining, and web structure mining. Various matrix expressions used to represent web data are also introduced, including document-keyword co-occurrence matrices, adjacent matrices, and usage matrices. Finally, two common similarity functions - Pearson correlation coefficient and cosine similarity - are outlined.