Sensible testing of hypervisor behaviour is a complicated task. Checking whether guest OSes boot and install properly is certainly useful, but this only covers a fraction of the guest/hypervisor interfaces. x86 in particular has large quantities of architecture which isn't used by any modern OS.
Unit testing on the other hand would be a great, if unit testing a kernel were a plausible task in general. XTF takes an alternative approach, and allows for component level testing from a unikernel-like perspective.
It is amazing what you find from this viewpoint.