Resource hub
Engineering
View all ->->What is columnar storage?
Al Brown • Last updated: May 8, 2026
Columnar storage is an on-disk data layout where each column is stored contiguously. It cuts I/O for analytical queries and enables 5–10× compression.
Why columnar databases are fast
Al Brown • Last updated: May 8, 2026
Columnar databases are 10-1000× faster than row stores on analytical queries because two principles compound: efficient execution (storage layout, vectorisation) and smart pruning (data skipping, late materialisation).
Columnar storage formats: Parquet, ORC, and Arrow explained
Al Brown • Last updated: May 8, 2026
How Apache Parquet, ORC, and Arrow lay out columnar data, and why they coexist instead of converging on one format.