This document discusses integrating Apache Hadoop and Apache Cassandra. It provides an overview of each technology, describing Hadoop as a framework for distributed processing of large datasets and Cassandra as a distributed database. It then describes a system that was set up with four Cassandra nodes and a Hadoop cluster with Hive and Pig to allow loading sample data into Cassandra using Pig scripts and analyzing the data using MapReduce or Pig. The document notes this open source approach is now available commercially from Datastax Enterprise, which combines Cassandra and Solr into a unified big data platform.