Distributed Storage: CAP Theorem & NoSQL Systems

CAP theorem fundamentals, the NoSQL landscape, and an architectural comparison of Bigtable (CP, wide-column) and Dynamo (AP, key-value).

Distributed storage systems must make explicit trade-offs between consistency, availability, and partition tolerance — as formalized by Brewer's CAP theorem. This section covers the two most studied systems in the DSA-AS exam: Google Bigtable (a CP wide-column store) and Amazon Dynamo (an AP key-value store).

💡 Why do we need distributed storage?

A single machine's disk is limited in capacity, speed, and fault tolerance. Distributed storage spreads data across many nodes, enabling: (1) horizontal scalability beyond single-machine limits, (2) fault tolerance via replication, and (3) geographic distribution for low-latency access.

CAP Theorem

Brewer's CAP theorem (2000, formally proved by Gilbert & Lynch 2002) states that a distributed system can guarantee at most two of the three properties: Consistency, Availability, and Partition Tolerance. Since partitions are inevitable in real networks, every distributed system must choose between CP or AP.

CAP Theorem — Interactive Triangle

Click a zone (CP, AP, or CA) to see which systems live there and why.

Click a zone (CP, AP, CA) or the colored triangle sectors to explore

# CAP Theorem (Brewer 2000, formally proved 2002)

In any distributed system, during a network partition (P),

you must choose between Consistency (C) and Availability (A).

Note: Partitions are inevitable in real networks →

every distributed system is either CP or AP (not CA).

Consistency (C)

Every read receives the most recent write (or an error). All nodes see the same data at the same time. Equivalent to linearizability.

Availability (A)

Every request receives a response (not an error), though the response may be stale. Every non-failing node must return a response.

Partition Tolerance (P)

The system continues operating despite arbitrary network partitions (messages lost or delayed between nodes). Required by all distributed systems.

Exam tip: PACELC model

CAP only addresses behavior during partitions. The PACELC model extends it: even without partitions (E), systems must trade Latency (L) for Consistency (C). Dynamo is PA/EL (available during partition, low-latency otherwise). Bigtable is PC/EC (consistent during partition, consistent otherwise).

NoSQL System Classification

NoSQL systems are classified by their data model. Each type optimizes for a different query pattern and makes different trade-offs.

🔑Key-Value Store

Simplest model: opaque key → opaque value. Extremely fast, no schema, no range queries.

DynamoDBRedisRiakVoldemort

No structured queries, no secondary indexes, client handles data format.

📊Wide-Column Store

Row key + dynamic columns. Efficient for sparse data, supports range scans on row key.

BigtableHBaseCassandraScylla

Column families must be declared; queries limited to row key range scans.

📄Document Store

JSON/BSON documents with nested structures. Supports secondary indexes and rich queries.

MongoDBCouchDBFirestoreCouchbase

Write amplification for indexed fields; consistency tradeoffs for distribution.

🕸️Graph Store

Nodes and edges with properties. Optimized for relationship traversal queries.

Neo4jAmazon NeptuneTigerGraphJanusGraph

Poor horizontal scalability; niche use cases; hard to shard graph relationships.

Storage Layout: Row-Oriented vs Column-Oriented

How data is physically stored on disk dramatically affects query performance. Row-oriented stores are optimized for OLTP (transactional) workloads; column-oriented (columnar) stores for OLAP (analytical) workloads.

Row-oriented vs. column-oriented storage layout comparison
Property	Row-Oriented	Column-Oriented
Physical layout	All columns of a row stored together	All values of a column stored together
Examples	MySQL, PostgreSQL, Bigtable (per-tablet)	BigQuery, Redshift, Parquet, ORC
Read pattern	Efficient for SELECT * (all columns of few rows)	Efficient for SELECT avg(col) (one column, all rows)
Write pattern	Fast single-row writes (one seek)	Slow writes (update N column files)
Compression	Limited (heterogeneous row data)	Excellent (column values are homogeneous, similar values together)
OLTP suitability	Excellent	Poor
OLAP suitability	Poor	Excellent
Projection pushdown	Not applicable	Skip unneeded columns entirely (no I/O)

💡 Bigtable's column families

Bigtable is technically a wide-column store but stores data row-by-row within a tablet. However, Bigtable can be configured to compress data within a column family together, giving some columnar benefits. This is different from true columnar stores like BigQuery which decompose tables into per-column files.

Deep Dives

Explore each storage system in detail with interactive diagrams and exam questions.

Bigtable

CP · Wide-Column · Range-Partitioned

Google's distributed storage system. 3-level tablet hierarchy, SSTable + memtable internals, Chubby integration, and the webtable data model.

3-Level HierarchySSTableBloom FiltersCompactionChubby

9 exam questionsOpen deep dive →

Amazon Dynamo

AP · Key-Value · Consistent Hashing

Amazon's always-available shopping cart storage. Consistent hashing ring, vector clocks, sloppy quorum, hinted handoff, and Merkle tree anti-entropy.

Consistent HashingVector ClocksSloppy QuorumHinted HandoffMerkle Trees

7 exam questionsOpen deep dive →