This Simple Folder Beats Vector Databases. Here's Why.

The AI How · Beginner ·🔍 RAG & Vector Search ·5d ago

About this lesson

For two years, the entire AI industry agreed on how to give a model memory - chop documents into chunks, turn them into vectors, store them in a database. Then Andrej Karpathy posted a single file to GitHub and said: just use a folder of text files. It worked better. Now Google has turned that folder into an official standard. This is the clearest breakdown of the Open Knowledge Format (OKF), why it beats RAG, and the three catches nobody is talking about. ⏱ CHAPTERS 0:00 The industry just changed its mind 0:29 Why RAG existed in the first place 0:49 How RAG works — and why it never remembers 1:21 Karpathy's LLM Wiki (the idea that started it) 1:45 How the wiki works — AI writes it, not you 2:16 Google's Open Knowledge Format spec 2:48 The forgiven spec — damage control with a nicer name 3:16 Three reasons the folder beats RAG 4:11 Two quick mixups (MCP vs OKF, SEO myths) 4:29 Catch 1 — who actually tends the garden? 4:54 Catch 2 — AI breaks its own format 5:16 Catch 3 — the type field is free text 5:38 The invisible moat (this part is underrated) 6:23 Will OKF stick, or is it the next Google graveyard? 🔗 LINKS ► Andrej Karpathy's LLM Wiki gist (April 4, 2026): https://gist.github.com/karpathy/442a6bf555914893e9891c11519de94f ► OKF Spec (GitHub): https://github.com/GoogleCloudPlatform/knowledge-catalog/blob/main/okf/SPEC.md ► OKF Repo (GoogleCloudPlatform/knowledge-catalog): https://github.com/GoogleCloudPlatform/knowledge-catalog/tree/main/okf ► Google Cloud blog — OKF announcement: https://cloud.google.com/blog/products/data-analytics/how-the-open-knowledge-format-can-improve-data-sharing 📬 NEWSLETTER Weekly breakdowns for AI builders — theaihow.com 🔔 Subscribe for takes like this every week. --- #OKF #OpenKnowledgeFormat #AIEngineering #LLMWiki #Karpathy #RAG #VectorDatabase #AIAgents #GoogleCloud #ContextEngineering #AIBuilder #LLM #AIMemory #AIArchitecture

Original Description

For two years, the entire AI industry agreed on how to give a model memory - chop documents into chunks, turn them into vectors, store them in a database. Then Andrej Karpathy posted a single file to GitHub and said: just use a folder of text files. It worked better. Now Google has turned that folder into an official standard. This is the clearest breakdown of the Open Knowledge Format (OKF), why it beats RAG, and the three catches nobody is talking about. ⏱ CHAPTERS 0:00 The industry just changed its mind 0:29 Why RAG existed in the first place 0:49 How RAG works — and why it never remembers 1:21 Karpathy's LLM Wiki (the idea that started it) 1:45 How the wiki works — AI writes it, not you 2:16 Google's Open Knowledge Format spec 2:48 The forgiven spec — damage control with a nicer name 3:16 Three reasons the folder beats RAG 4:11 Two quick mixups (MCP vs OKF, SEO myths) 4:29 Catch 1 — who actually tends the garden? 4:54 Catch 2 — AI breaks its own format 5:16 Catch 3 — the type field is free text 5:38 The invisible moat (this part is underrated) 6:23 Will OKF stick, or is it the next Google graveyard? 🔗 LINKS ► Andrej Karpathy's LLM Wiki gist (April 4, 2026): https://gist.github.com/karpathy/442a6bf555914893e9891c11519de94f ► OKF Spec (GitHub): https://github.com/GoogleCloudPlatform/knowledge-catalog/blob/main/okf/SPEC.md ► OKF Repo (GoogleCloudPlatform/knowledge-catalog): https://github.com/GoogleCloudPlatform/knowledge-catalog/tree/main/okf ► Google Cloud blog — OKF announcement: https://cloud.google.com/blog/products/data-analytics/how-the-open-knowledge-format-can-improve-data-sharing 📬 NEWSLETTER Weekly breakdowns for AI builders — theaihow.com 🔔 Subscribe for takes like this every week. --- #OKF #OpenKnowledgeFormat #AIEngineering #LLMWiki #Karpathy #RAG #VectorDatabase #AIAgents #GoogleCloud #ContextEngineering #AIBuilder #LLM #AIMemory #AIArchitecture
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Why you shouldn’t search your documents directly with AI
Learn why directly searching documents with AI can be inefficient and how retrieval-augmented systems can improve the process
Medium · Programming
Your AI Keeps Making Things Up. RAG Is How You Make It Use Real Facts Instead.
Learn how to use RAG to make your AI provide accurate answers based on real facts instead of making things up
Medium · RAG
Evaluation Metrics for RAG: Measure Retrieval, Generation, and End-to-End Quality With Numbers That…
Learn to evaluate RAG models using metrics that measure retrieval, generation, and end-to-end quality
Medium · AI
Evaluation Metrics for RAG: Measure Retrieval, Generation, and End-to-End Quality With Numbers That…
Learn to evaluate RAG models using metrics that measure retrieval, generation, and end-to-end quality
Medium · Data Science

Chapters (14)

The industry just changed its mind
0:29 Why RAG existed in the first place
0:49 How RAG works — and why it never remembers
1:21 Karpathy's LLM Wiki (the idea that started it)
1:45 How the wiki works — AI writes it, not you
2:16 Google's Open Knowledge Format spec
2:48 The forgiven spec — damage control with a nicer name
3:16 Three reasons the folder beats RAG
4:11 Two quick mixups (MCP vs OKF, SEO myths)
4:29 Catch 1 — who actually tends the garden?
4:54 Catch 2 — AI breaks its own format
5:16 Catch 3 — the type field is free text
5:38 The invisible moat (this part is underrated)
6:23 Will OKF stick, or is it the next Google graveyard?
Up next
RRF vs DBSF with Qdrant: Hybrid Retrieval Fusion for RAG in Python
Professor Py: AI Engineering
Watch →