Skip to content

jemusni07/apache-iceberg-notes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Apache Iceberg Handbook

This is a collection of notes, implementations and examples for Apache Iceberg table format.

Repository Contents

📚 Learning Resources (/learn/)

  • Data Modification Strategies - Deep dive into Copy-on-Write (CoW) vs Merge-on-Read (MoR) strategies, performance characteristics, and hybrid approaches
  • GDPR Compliance - Implementation guide for "right to be forgotten" compliance using Apache Iceberg's ACID guarantees and retention policies

📓 Interactive Notebooks (/notebooks/)

  • Iceberg Query Lifecycle - Comprehensive walkthrough of table creation, data insertion, MERGE operations, and metadata inspection with practical SQL examples

🖼️ Visual Resources (/images/)

  • copy_on_write.png - Visual representation of Copy-on-Write strategy
  • merge_on_read.png - Visual representation of Merge-on-Read strategy

🐳 Docker Environment

  • docker-compose.yml - Complete containerized environment setup

Getting Started

To run the notebooks and examples, you need Docker CLI and Docker Compose.

Start the environment:

docker-compose up

This provides a complete Spark environment with Iceberg tables using MinIO as the storage backend.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors