Tag: Apache Iceberg
All the articles with the tag "Apache Iceberg".
Partition and Organize Data for Performance
Published: at 03:00 PMA table with 500 million rows takes 45 seconds to query. After partitioning it by date, the same query — filtering on a single day — returns in 2 seconds. Th...
Schema Evolution Without Breaking Consumers
Published: at 01:00 PMA source team renames a column from `user_id` to `customer_id`. Twelve hours later, five dashboards show blank values, two ML pipelines fail, and the data en...
Data Modeling for the Lakehouse: What Changes
Published: at 12:00 PMTraditional data modeling assumed you controlled the database. You defined schemas up front, enforced foreign keys at write time, and optimized with indexes....