XHTML is the next generation of HTML that combines HTML and XML. It aims to replace HTML by being a stricter, cleaner version that conforms to XML standards. Key differences from HTML include elements must be properly nested, documents must be well-formed, tag names must be lowercase, and all elements must be closed. There are three document type definitions for XHTML: Strict, Transitional, and Frameset.