XML is a markup language that is used to define and store data in a structured format. It allows data to be separated from its presentation and is extensible to add new tags. An XML document must have a root element and follow syntax rules to be well-formed. It can also be validated against a DTD or schema to check that the elements and structure match the definitions.