CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling

📰 ArXiv cs.AI

arXiv:2602.01766v2 Announce Type: replace-cross Abstract: The quadratic complexity and indefinitely growing key-value (KV) cache of standard Transformers pose a major barrier to long-context processing. To overcome this, we introduce the Collaborative Memory Transformer (CoMeT), a novel architecture that enables LLMs to handle arbitrarily long sequences with constant memory usage and linear time complexity. Designed as an efficient, plug-in module, CoMeT can be integrated into pre-trained models

Published 20 Apr 2026

Read full paper → ← Back to Reads