Skip to content

Use Cases

These pages explain when and why to reach for CQLite, not just how to use its APIs. Each page covers a concrete pattern, shows what is working today, and is honest about what is still in progress.

PageWhat it covers
Lakehouse projections with the Cassandra SidecarSSTable-to-Parquet pipeline architecture, the delta-semantics caveat, the high-fidelity type mapping, and embedding the Parquet writer
Python: data science and ETLOffline analytics on snapshots and backups without a cluster; pandas and notebook workflows
Node.js: services and toolingData inspection services, ops dashboards, executeNative + streaming patterns
Operational scenariosMigration validation, test fixtures from production data, backup and snapshot inspection

All four use cases share the same core property: no cluster dependency in the read path. CQLite reads Cassandra 5.0 SSTables directly from the filesystem. That opens up workflows that would otherwise require a live Cassandra cluster — offline analytics, CI fixtures, DR validation, lakehouse projection — while keeping the toolchain simple.

  • Not a real-time replication solution. Reads are per-SSTable batch operations.
  • Not a query engine for the data lake. It produces files and rows; Spark, Trino, DuckDB, or pandas consume them.
  • Not a replacement for the Cassandra query path for production traffic.

See GitHub Issues for the roadmap.