This document introduces Jsoup, an open source Java library for working with real-world HTML. It provides convenient methods for scraping and parsing HTML content from URLs, files or strings. Jsoup allows selecting elements using DOM traversal or CSS selectors, extracting and manipulating data from elements, and cleaning user-submitted HTML to prevent XSS attacks. Examples demonstrate how to use Jsoup to parse HTML, select elements, extract attributes and text, and work with URLs.