Less Is More: Elevating RAG via Performance-Driven Context Compression

📰 ArXiv cs.AI

arXiv:2508.19282v4 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) has emerged as a promising paradigm for improving the timeliness of knowledge updates and the factual accuracy of large language models. However, incorporating a large volume of retrieved documents significantly increases input length, leading to prohibitive computational costs. Existing compression approaches often compromise task performance, primarily due to their reliance on predefined heuristics.

Published 29 May 2026

Read full paper → ← Back to Reads