The increasing deployment of distributed systems to solve
large data and computational problems has not seen a con
comitant increase in tools and techniques to test these sys
tems. In this paper, we propose a data driven approach to
testing. We translate our intuitions and expectations about
how the system should behave into invariants, the truth of
which can be verified from data emitted by the system. Our
particular implementation of the invariants uses Q, a high
performance analytical database, programmed with a vector
language.
