This document provides an overview of Cassandra, including:
- Cassandra is a distributed database designed to handle large amounts of structured data across commodity servers. It provides high availability with no single point of failure.
- Data is replicated across multiple nodes for fault tolerance. Nodes communicate using a gossip protocol to detect failures.
- The data model differs from relational databases, with data organized into keyspaces containing column families and rows instead of tables. Columns are dynamic rather than fixed.