Resource hub


Engineering

View all ->->
  • What is columnar storage?

    Al Brown • Last updated: May 8, 2026

    Columnar storage is an on-disk data layout where each column is stored contiguously. It cuts I/O for analytical queries and enables 5–10× compression.

  • Why columnar databases are fast

    Al Brown • Last updated: May 8, 2026

    Columnar databases are 10-1000× faster than row stores on analytical queries because two principles compound: efficient execution (storage layout, vectorisation) and smart pruning (data skipping, late materialisation).

  • How Apache Parquet, ORC, and Arrow lay out columnar data, and why they coexist instead of converging on one format.