Appendix E — Glossary
Concise definitions; cross-link to first use in the chapters.
-
SSTable: Immutable on-disk table used by Cassandra to store partitions and rows.
-
TOC (
TOC.txt): Text file listing the set of components (Data/Index/Summary/etc.). -
Data.db: Primary data file containing serialized partitions, rows, and cells. -
Index.db: Partition index linking keys toData.dboffsets. -
Summary.db: Sampled promoted index for faster index navigation. -
Filter.db: Bloom filter for negative lookups. -
Statistics.db: Table-level metadata including histograms and timestamps. -
CompressionInfo.db: Chunk sizes, offsets, and checksums for compressedData.db. -
Digest (
Digest.crc32): File CRC for integrity verification. -
VInt: Variable-length integer encoding used for lengths and counters.
-
ZigZag: Encoding mapping signed integers to small unsigned values.
-
UDT: User-defined type in CQL.
-
SAI: Storage-Attached Index; secondary indexing subsystem (incl. vector).
-
BTI: B-Tree/Trie indexed SSTable format family in Cassandra 5.x. See
17-bti-formats.md. -
TrieMemtable: An opt-in memtable in Cassandra 5.0, using a byte-ordered prefix trie (
InMemoryTrie) for efficient memory usage and reduced GC pressure. Stores partitions usingByteComparablekeys with prefix sharing; sharded across CPU cores for write concurrency. The default memtable in 5.0 isSkipListMemtable(MemtableParams.java:99); opt in viamemtable: { class: TrieMemtable }in the table schema orcassandra.yaml. See04-from-cql-to-disk.mdand TrieMemtable.java. -
big format: Legacy SSTable format family (pre-BTI) with classic
BigTableReader/Writer. See02-anatomy-of-an-sstable.mdand17-bti-formats.mdfor contrasts. -
LCS/STCS/TWCS: Compaction strategies. See
15-compaction-strategies.md.
References:
- Cassandra 5.0 source packages: see
references/source-map.md.