SGML is a standard for specifying markup languages. It describes how to define document structure separately from presentation. XML is a simplified version of SGML used to store and transport data. Key differences between XML and HTML include XML focusing on data rather than presentation, being case sensitive, requiring closing tags, and preserving whitespace.