📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 6,347 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (16420) ArXiv cs.AI Dev.to AI Dev.to · FORUM WEB Forbes Innovation Medium · Programming Medium · AI

Escaping the Context Bottleneck: Active Context Curation for LLM Agents via Reinforcement Learning

arXiv:2604.11462v1 Announce Type: new Abstract: Large Language Models (LLMs) struggle with long-horizon tasks due to the "context bottleneck" and the "lost-in-t

ArXiv cs.AI 📄 Paper 1w ago

Three Roles, One Model: Role Orchestration at Inference Time to Close the Performance Gap Between Small and Large Agents

arXiv:2604.11465v1 Announce Type: new Abstract: Large language model (LLM) agents show promise on realistic tool-use tasks, but deploying capable agents on mode

ArXiv cs.AI 📄 Paper 1w ago

From Attribution to Action: A Human-Centered Application of Activation Steering

arXiv:2604.11467v1 Announce Type: new Abstract: Explainable AI (XAI) methods reveal which features influence model predictions, yet provide limited means for pr

ArXiv cs.AI 📄 Paper 1w ago

OOM-RL: Out-of-Money Reinforcement Learning Market-Driven Alignment for LLM-Based Multi-Agent Systems

arXiv:2604.11477v1 Announce Type: new Abstract: The alignment of Multi-Agent Systems (MAS) for autonomous software engineering is constrained by evaluator epist

ArXiv cs.AI 📄 Paper 1w ago

On the Complexity of the Discussion-based Semantics in Abstraction Argumentation

arXiv:2604.11480v1 Announce Type: new Abstract: We show that deciding whether an argument a is stronger than an argument b with respect to the discussion-based

ArXiv cs.AI 📄 Paper 1w ago

Anthropogenic Regional Adaptation in Multimodal Vision-Language Model

arXiv:2604.11490v1 Announce Type: new Abstract: While the field of vision-language (VL) has achieved remarkable success in integrating visual and textual inform

ArXiv cs.AI 📄 Paper 1w ago

Lectures on AI for Mathematics

arXiv:2604.11504v1 Announce Type: new Abstract: This book provides a comprehensive and accessible introduction to the emerging field of AI for mathematics. It c

ArXiv cs.AI 📄 Paper 1w ago

PAC-BENCH: Evaluating Multi-Agent Collaboration under Privacy Constraints

arXiv:2604.11523v1 Announce Type: new Abstract: We are entering an era in which individuals and organizations increasingly deploy dedicated AI agents that inter

ArXiv cs.AI 📄 Paper 1w ago

Limited Perfect Monotonical Surrogates constructed using low-cost recursive linkage discovery with guaranteed output

arXiv:2604.11524v1 Announce Type: new Abstract: Surrogates provide a cheap solution evaluation and offer significant leverage for optimizing computationally exp

ArXiv cs.AI 📄 Paper 1w ago

Problem Reductions at Scale: Agentic Integration of Computationally Hard Problems

arXiv:2604.11535v1 Announce Type: new Abstract: Solving an NP-hard optimization problem often requires reformulating it for a specific solver -- quantum hardwar

ArXiv cs.AI 📄 Paper 1w ago

A collaborative agent with two lightweight synergistic models for autonomous crystal materials research

arXiv:2604.11540v1 Announce Type: new Abstract: Current large language models require hundreds of billions of parameters yet struggle with domain-specific reaso

ArXiv cs.AI 📄 Paper 1w ago

SemaClaw: A Step Towards General-Purpose Personal AI Agents through Harness Engineering

arXiv:2604.11548v1 Announce Type: new Abstract: The rise of OpenClaw in early 2026 marks the moment when millions of users began deploying personal AI agents in

ArXiv cs.AI 📄 Paper 1w ago

UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents

arXiv:2604.11557v1 Announce Type: new Abstract: Tool-use capability is a fundamental component of LLM agents, enabling them to interact with external systems th

ArXiv cs.AI 📄 Paper 1w ago

Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models

arXiv:2604.11609v1 Announce Type: new Abstract: Large language models exhibit sycophantic tendencies--validating incorrect user beliefs to appear agreeable. We

ArXiv cs.AI 📄 Paper 1w ago

Context Kubernetes: Declarative Orchestration of Enterprise Knowledge for Agentic AI Systems

arXiv:2604.11623v1 Announce Type: new Abstract: We introduce Context Kubernetes, an architecture for orchestrating enterprise knowledge in agentic AI systems, w

ArXiv cs.AI 📄 Paper 1w ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

arXiv:2604.11626v1 Announce Type: new Abstract: Most reward models for visual generation reduce rich human judgments to a single unexplained score, discarding t

ArXiv cs.AI 📄 Paper 1w ago

Why Do Large Language Models Generate Harmful Content?

arXiv:2604.11663v1 Announce Type: new Abstract: Large Language Models (LLMs) have been shown to generate harmful content. However, the underlying causes of such

ArXiv cs.AI 📄 Paper 1w ago

DreamKG: A KG-Augmented Conversational System for People Experiencing Homelessness

arXiv:2604.11703v1 Announce Type: new Abstract: People experiencing homelessness (PEH) face substantial barriers to accessing timely, accurate information about

ArXiv cs.AI 📄 Paper 1w ago

Agentic Driving Coach: Robustness and Determinism of Agentic AI-Powered Human-in-the-Loop Cyber-Physical Systems

arXiv:2604.11705v1 Announce Type: new Abstract: Foundation models, including large language models (LLMs), are increasingly used for human-in-the-loop (HITL) cy

ArXiv cs.AI 📄 Paper 1w ago

A Mamba-Based Multimodal Network for Multiscale Blast-Induced Rapid Structural Damage Assessment

arXiv:2604.11709v1 Announce Type: new Abstract: Accurate and rapid structural damage assessment (SDA) is crucial for post-disaster management, helping responder

ArXiv cs.AI 📄 Paper 1w ago

SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context

arXiv:2604.11716v1 Announce Type: new Abstract: Prior representative ReAct-style approaches in autonomous Software Engineering (SWE) typically lack the explicit

ArXiv cs.AI 📄 Paper 1w ago

Collaborative Multi-Agent Scripts Generation for Enhancing Imperfect-Information Reasoning in Murder Mystery Games

arXiv:2604.11741v1 Announce Type: new Abstract: Vision-language models (VLMs) have shown impressive capabilities in perceptual tasks, yet they degrade in comple

ArXiv cs.AI 📄 Paper 1w ago

Retrieval Is Not Enough: Why Organizational AI Needs Epistemic Infrastructure

arXiv:2604.11759v1 Announce Type: new Abstract: Organizational knowledge used by AI agents typically lacks epistemic structure: retrieval systems surface semant

ArXiv cs.AI 📄 Paper 1w ago

GenTac: Generative Modeling and Forecasting of Soccer Tactics

arXiv:2604.11786v1 Announce Type: new Abstract: Modeling open-play soccer tactics is a formidable challenge due to the stochastic, multi-agent nature of the gam