This document provides an overview of link analysis and summarizes several common approaches:
1. Early approaches calculated site popularity based on incoming and outgoing links, but had limitations in accounting for link importance. HITS introduced the concepts of hubs and authorities, where authoritative sites receive links from important hubs.
2. PageRank assigns weight based on the rank of parent sites, inversely proportional to their outdegree. It models the probability of a random web surfer being on a page.
3. Link analysis approaches like HITS and PageRank have limitations, such as failing to account for dangling nodes with no links. Modifications include adding a small probability of randomly restarting from any page.