📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 6,347 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (16420)
ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI
📄 Paper
1w ago
Escaping the Context Bottleneck: Active Context Curation for LLM Agents via Reinforcement Learning
arXiv:2604.11462v1 Announce Type: new Abstract: Large Language Models (LLMs) struggle with long-horizon tasks due to the "context bottleneck" and the "lost-in-t
ArXiv cs.AI
📄 Paper
1w ago
Three Roles, One Model: Role Orchestration at Inference Time to Close the Performance Gap Between Small and Large Agents
arXiv:2604.11465v1 Announce Type: new Abstract: Large language model (LLM) agents show promise on realistic tool-use tasks, but deploying capable agents on mode
ArXiv cs.AI
📄 Paper
1w ago
From Attribution to Action: A Human-Centered Application of Activation Steering
arXiv:2604.11467v1 Announce Type: new Abstract: Explainable AI (XAI) methods reveal which features influence model predictions, yet provide limited means for pr
ArXiv cs.AI
📄 Paper
1w ago
OOM-RL: Out-of-Money Reinforcement Learning Market-Driven Alignment for LLM-Based Multi-Agent Systems
arXiv:2604.11477v1 Announce Type: new Abstract: The alignment of Multi-Agent Systems (MAS) for autonomous software engineering is constrained by evaluator epist
ArXiv cs.AI
📄 Paper
1w ago
On the Complexity of the Discussion-based Semantics in Abstraction Argumentation
arXiv:2604.11480v1 Announce Type: new Abstract: We show that deciding whether an argument a is stronger than an argument b with respect to the discussion-based
ArXiv cs.AI
📄 Paper
1w ago
Anthropogenic Regional Adaptation in Multimodal Vision-Language Model
arXiv:2604.11490v1 Announce Type: new Abstract: While the field of vision-language (VL) has achieved remarkable success in integrating visual and textual inform
ArXiv cs.AI
📄 Paper
1w ago
Lectures on AI for Mathematics
arXiv:2604.11504v1 Announce Type: new Abstract: This book provides a comprehensive and accessible introduction to the emerging field of AI for mathematics. It c
ArXiv cs.AI
📄 Paper
1w ago
PAC-BENCH: Evaluating Multi-Agent Collaboration under Privacy Constraints
arXiv:2604.11523v1 Announce Type: new Abstract: We are entering an era in which individuals and organizations increasingly deploy dedicated AI agents that inter
ArXiv cs.AI
📄 Paper
1w ago
Limited Perfect Monotonical Surrogates constructed using low-cost recursive linkage discovery with guaranteed output
arXiv:2604.11524v1 Announce Type: new Abstract: Surrogates provide a cheap solution evaluation and offer significant leverage for optimizing computationally exp
ArXiv cs.AI
📄 Paper
1w ago
Problem Reductions at Scale: Agentic Integration of Computationally Hard Problems
arXiv:2604.11535v1 Announce Type: new Abstract: Solving an NP-hard optimization problem often requires reformulating it for a specific solver -- quantum hardwar
ArXiv cs.AI
📄 Paper
1w ago
A collaborative agent with two lightweight synergistic models for autonomous crystal materials research
arXiv:2604.11540v1 Announce Type: new Abstract: Current large language models require hundreds of billions of parameters yet struggle with domain-specific reaso
ArXiv cs.AI
📄 Paper
1w ago
SemaClaw: A Step Towards General-Purpose Personal AI Agents through Harness Engineering
arXiv:2604.11548v1 Announce Type: new Abstract: The rise of OpenClaw in early 2026 marks the moment when millions of users began deploying personal AI agents in
ArXiv cs.AI
📄 Paper
1w ago
UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents
arXiv:2604.11557v1 Announce Type: new Abstract: Tool-use capability is a fundamental component of LLM agents, enabling them to interact with external systems th
ArXiv cs.AI
📄 Paper
1w ago
Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models
arXiv:2604.11609v1 Announce Type: new Abstract: Large language models exhibit sycophantic tendencies--validating incorrect user beliefs to appear agreeable. We
ArXiv cs.AI
📄 Paper
1w ago
Context Kubernetes: Declarative Orchestration of Enterprise Knowledge for Agentic AI Systems
arXiv:2604.11623v1 Announce Type: new Abstract: We introduce Context Kubernetes, an architecture for orchestrating enterprise knowledge in agentic AI systems, w
ArXiv cs.AI
📄 Paper
1w ago
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time
arXiv:2604.11626v1 Announce Type: new Abstract: Most reward models for visual generation reduce rich human judgments to a single unexplained score, discarding t
ArXiv cs.AI
📄 Paper
1w ago
Why Do Large Language Models Generate Harmful Content?
arXiv:2604.11663v1 Announce Type: new Abstract: Large Language Models (LLMs) have been shown to generate harmful content. However, the underlying causes of such
ArXiv cs.AI
📄 Paper
1w ago
DreamKG: A KG-Augmented Conversational System for People Experiencing Homelessness
arXiv:2604.11703v1 Announce Type: new Abstract: People experiencing homelessness (PEH) face substantial barriers to accessing timely, accurate information about
ArXiv cs.AI
📄 Paper
1w ago
Agentic Driving Coach: Robustness and Determinism of Agentic AI-Powered Human-in-the-Loop Cyber-Physical Systems
arXiv:2604.11705v1 Announce Type: new Abstract: Foundation models, including large language models (LLMs), are increasingly used for human-in-the-loop (HITL) cy
ArXiv cs.AI
📄 Paper
1w ago
A Mamba-Based Multimodal Network for Multiscale Blast-Induced Rapid Structural Damage Assessment
arXiv:2604.11709v1 Announce Type: new Abstract: Accurate and rapid structural damage assessment (SDA) is crucial for post-disaster management, helping responder
ArXiv cs.AI
📄 Paper
1w ago
SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context
arXiv:2604.11716v1 Announce Type: new Abstract: Prior representative ReAct-style approaches in autonomous Software Engineering (SWE) typically lack the explicit
ArXiv cs.AI
📄 Paper
1w ago
Collaborative Multi-Agent Scripts Generation for Enhancing Imperfect-Information Reasoning in Murder Mystery Games
arXiv:2604.11741v1 Announce Type: new Abstract: Vision-language models (VLMs) have shown impressive capabilities in perceptual tasks, yet they degrade in comple
ArXiv cs.AI
📄 Paper
1w ago
Retrieval Is Not Enough: Why Organizational AI Needs Epistemic Infrastructure
arXiv:2604.11759v1 Announce Type: new Abstract: Organizational knowledge used by AI agents typically lacks epistemic structure: retrieval systems surface semant
ArXiv cs.AI
📄 Paper
1w ago
GenTac: Generative Modeling and Forecasting of Soccer Tactics
arXiv:2604.11786v1 Announce Type: new Abstract: Modeling open-play soccer tactics is a formidable challenge due to the stochastic, multi-agent nature of the gam
DeepCamp AI