I Built a Columnar File Format in Pure Python — a tiny, readable Parquet

📰 Dev.to · Haji Rufai

I rebuilt the core ideas behind Parquet — encodings, compression, row groups, projection & predicate pushdown — in ~3,000 lines of pure Python. 91% smaller than CSV, 98% fewer bytes per query.

Published 8 Jun 2026
Read full article → ← Back to Reads