Beautiful Soup is a Python library that allows users to parse HTML and XML documents, extract data from them, and modify the parse tree. It transforms complex documents into a tree of Python objects representing the document structure, including tags, strings, comments, and the document itself. The main objects are tags, strings, BeautifulSoup, and comments.