📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 4,742 articles · Updated every 3 hours · View all reads

arXiv:2604.13151v1 Announce Type: new Abstract: Language Model (LM) agents are increasingly used in complex open-ended decision-making tasks, from AI coding to

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4h ago

SciFi: A Safe, Lightweight, User-Friendly, and Fully Autonomous Agentic AI Workflow for Scientific Applications

arXiv:2604.13180v1 Announce Type: new Abstract: Recent advances in agentic AI have enabled increasingly autonomous workflows, but existing systems still face su

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4h ago

Numerical Instability and Chaos: Quantifying the Unpredictability of Large Language Models

arXiv:2604.13206v1 Announce Type: new Abstract: As Large Language Models (LLMs) are increasingly integrated into agentic workflows, their unpredictability stemm

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4h ago

Optimizing Earth Observation Satellite Schedules under Unknown Operational Constraints: An Active Constraint Acquisition Approach

arXiv:2604.13283v1 Announce Type: new Abstract: Earth Observation (EO) satellite scheduling (deciding which imaging tasks to perform and when) is a well-studied

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4h ago

WebXSkill: Skill Learning for Autonomous Web Agents

arXiv:2604.13318v1 Announce Type: new Abstract: Autonomous web agents powered by large language models (LLMs) have shown promise in completing complex browser t

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4h ago

Listening Alone, Understanding Together: Collaborative Context Recovery for Privacy-Aware AI

arXiv:2604.13348v1 Announce Type: new Abstract: We introduce CONCORD, a privacy-aware asynchronous assistant-to-assistant (A2A) framework that leverages collabo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4h ago

ReSS: Learning Reasoning Models for Tabular Data Prediction via Symbolic Scaffold

arXiv:2604.13392v1 Announce Type: new Abstract: Tabular data remains prevalent in high-stakes domains such as healthcare and finance, where predictive models ar

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4h ago

Quantifying and Understanding Uncertainty in Large Reasoning Models

arXiv:2604.13395v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) have recently demonstrated significant improvements in complex reasoning. While qu

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4h ago

Towards Scalable Lightweight GUI Agents via Multi-role Orchestration

arXiv:2604.13488v1 Announce Type: new Abstract: Autonomous Graphical User Interface (GUI) agents powered by Multimodal Large Language Models (MLLMs) enable digi

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4h ago

RiskWebWorld: A Realistic Interactive Benchmark for GUI Agents in E-commerce Risk Management

arXiv:2604.13531v1 Announce Type: new Abstract: Graphical User Interface (GUI) agents show strong capabilities for automating web tasks, but existing interactiv

ArXiv cs.AI 📄 Paper 4h ago

Weight Patching: Toward Source-Level Mechanistic Localization in LLMs

arXiv:2604.13694v1 Announce Type: new Abstract: Mechanistic interpretability seeks to localize model behavior to the internal components that causally realize i

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4h ago

Rethinking AI Hardware: A Three-Layer Cognitive Architecture for Autonomous Agents

arXiv:2604.13757v1 Announce Type: new Abstract: The next generation of autonomous AI systems will be constrained not only by model capability, but by how intell

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4h ago

The cognitive companion: a lightweight parallel monitoring architecture for detecting and recovering from reasoning degradation in LLM agents

arXiv:2604.13759v1 Announce Type: new Abstract: Large language model (LLM) agents on multi-step tasks suffer reasoning degradation, looping, drift, stuck states

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 4h ago

AlphaCNOT: Learning CNOT Minimization with Model-Based Planning

arXiv:2604.13812v1 Announce Type: new Abstract: Quantum circuit optimization is a central task in Quantum Computing, as current Noisy Intermediate Scale Quantum

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4h ago

GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis

arXiv:2604.13888v1 Announce Type: new Abstract: The integration of Large Language Models (LLMs) into Geographic Information Systems (GIS) marks a paradigm shift

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4h ago

AI-Assisted Peer Review at Scale: The AAAI-26 AI Review Pilot

arXiv:2604.13940v1 Announce Type: new Abstract: Scientific peer review faces mounting strain as submission volumes surge, making it increasingly difficult to su

ArXiv cs.AI 📄 Paper 4h ago

[Emerging Ideas] Artificial Tripartite Intelligence: A Bio-Inspired, Sensor-First Architecture for Physical AI

arXiv:2604.13959v1 Announce Type: new Abstract: As AI moves from data centers to robots and wearables, scaling ever-larger models becomes insufficient. Physical

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4h ago

Reward Design for Physical Reasoning in Vision-Language Models

arXiv:2604.13993v1 Announce Type: new Abstract: Physical reasoning over visual inputs demands tight integration of visual perception, domain knowledge, and mult

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4h ago

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

arXiv:2604.14004v1 Announce Type: new Abstract: Memory-based self-evolution has emerged as a promising paradigm for coding agents. However, existing approaches

ArXiv cs.AI 🎮 Reinforcement Learning 📄 Paper ⚡ AI Lesson 4h ago

Hierarchical Reinforcement Learning with Runtime Safety Shielding for Power Grid Operation

arXiv:2604.14032v1 Announce Type: new Abstract: Reinforcement learning has shown promise for automating power-grid operation tasks such as topology control and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4h ago

TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration

arXiv:2604.14116v1 Announce Type: new Abstract: While Large Language Models (LLMs) have empowered AI research agents to perform isolated scientific tasks, autom

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4h ago

When Reasoning Models Hurt Behavioral Simulation: A Solver-Sampler Mismatch in Multi-Agent LLM Negotiation

arXiv:2604.11840v1 Announce Type: cross Abstract: Large language models are increasingly used as agents in social, economic, and policy simulations. A common as

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 4h ago

OVT-MLCS: An Online Visual Tool for MLCS Mining from Long or Big Sequences

arXiv:2604.13037v1 Announce Type: cross Abstract: Mining multiple longest common subsequences (\textit{MLCS}) from a set of sequences of three or more over a fi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4h ago

TableNet A Large-Scale Table Dataset with LLM-Powered Autonomous

arXiv:2604.13041v1 Announce Type: cross Abstract: Table Structure Recognition (TSR) requires the logical reasoning ability of large language models (LLMs) to ha