The document discusses testing challenges with big data and proposes the test pyramid approach. It provides an overview of big data concepts like Hadoop, HDFS, MapReduce and HBase. Testing big data is challenging due to long feedback cycles and data dependencies. The test pyramid advocates automating basic tests like unit tests while using techniques like sampling and test automation to counter long test cycles. It also provides a high level example of using a Hive query validator as part of the testing approach.