Knowledge Capsules: Structured Nonparametric Memory Units for LLMs

📰 ArXiv cs.AI

arXiv:2604.20487v2 Announce Type: cross Abstract: Large language models (LLMs) encode knowledge in parametric weights, making it costly to update or extend without retraining. Retrieval-augmented generation (RAG) mitigates this limitation by appending retrieved text to the input, but operates purely through context expansion, where external knowledge competes as tokens within the attention mechanism. As a result, its influence is indirect and often unstable, particularly in long context and mult

Published 23 Apr 2026
Read full paper → ← Back to Reads