Data lakehouses (in under 3 minutes)
Mark Needham
What is a data lakehouse, and why is everyone talking about them? This quick explainer covers everything you need to know.
We'll start by discussing data lakes and their common challenges—schema evolution, data integrity, query performance, data discovery, and access control—and then show you how the four-layer lakehouse architecture solves these problems.
You'll learn about open table formats like Iceberg, Delta Lake, and Hudi, plus how data catalogs like AWS Glue and Unity Catalog complete the picture. By the end, you'll understand why lakehouses are becoming the go-to solution for modern data architecture.

Scaling ClickHouse to petabytes of logs at OpenAI

How ClickHouse helps Anthropic scale observability

How Capital One Slingshot cut infrastructure costs by 50%
Engineering leaders at Capital One Slingshot share how they cut infrastructure costs by 50% and reduced average dashboard load time from 5+ to under 500ms with ClickHouse Cloud.