This document discusses machine learning challenges posed by hypertext and the web. It presents two examples of applying machine learning to hypertext documents: 1) semi-supervised learning to classify topics of hypertext documents using both text and hyperlinks, and 2) classifying interconnected entities by labeling graphs with many classes. The author proposes models that combine text and link information to better learn from hypertext documents and address issues like "topic drift".