This is a collection of notes, implementations and examples for Apache Iceberg table format.
- Data Modification Strategies - Deep dive into Copy-on-Write (CoW) vs Merge-on-Read (MoR) strategies, performance characteristics, and hybrid approaches
- GDPR Compliance - Implementation guide for "right to be forgotten" compliance using Apache Iceberg's ACID guarantees and retention policies
- Iceberg Query Lifecycle - Comprehensive walkthrough of table creation, data insertion, MERGE operations, and metadata inspection with practical SQL examples
copy_on_write.png- Visual representation of Copy-on-Write strategymerge_on_read.png- Visual representation of Merge-on-Read strategy
docker-compose.yml- Complete containerized environment setup
To run the notebooks and examples, you need Docker CLI and Docker Compose.
Start the environment:
docker-compose upThis provides a complete Spark environment with Iceberg tables using MinIO as the storage backend.