-Kudu is a new storage layer for the Hadoop ecosystem that enables fast analytics on fast data; it splits the difference between the fast read/write of HBase and the fast scans of HDFS...while compromising minimally on performance. It can pair with Spark, Impala, or MapReduce.
-In the past, a lambda architecture was needed to run analytics on real-time data – that is, a complex architecture that created separate a “speed layer” for rapid availability/query/updates, and a “batch layer” for running analytics scans. This was complicated and took lots of tuning.
-With Kudu, the Apache ecosystem now has a simplified storage solution for analytic scans on rapidly updating data, eliminating the need for the aforementioned hybrid lambda architectures.