Writing to Apache Iceberg from ClickHouse
Mark Needham
This video explores ClickHouse's experimental support for Apache Iceberg writes, introduced in version 25.7. We'll walk you through the process of creating, querying, and writing to Iceberg tables using both PySpark and ClickHouse. This practical demonstration showcases the seamless integration between these powerful data technologies.
Key points covered:
- Setting up an Iceberg table with PySpark and importing New York taxi data
- Querying Iceberg tables in ClickHouse using the iceberg_local engine
- Performing inserts from ClickHouse into Iceberg tables
- Exploring Iceberg's snapshot history and time travel queries
- Demonstrating the interoperability between PySpark and ClickHouse when working with Iceberg

Open House
Scaling ClickHouse to petabytes of logs at OpenAI

Open House
How ClickHouse helps Anthropic scale observability

Open House, User stories
How Capital One Slingshot cut infrastructure costs by 50%
Engineering leaders at Capital One Slingshot share how they cut infrastructure costs by 50% and reduced average dashboard load time from 5+ to under 500ms with ClickHouse Cloud.