📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 4,742 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (12131)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4h ago
Exploration and Exploitation Errors Are Measurable for Language Model Agents
arXiv:2604.13151v1 Announce Type: new Abstract: Language Model (LM) agents are increasingly used in complex open-ended decision-making tasks, from AI coding to
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4h ago
SciFi: A Safe, Lightweight, User-Friendly, and Fully Autonomous Agentic AI Workflow for Scientific Applications
arXiv:2604.13180v1 Announce Type: new Abstract: Recent advances in agentic AI have enabled increasingly autonomous workflows, but existing systems still face su
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4h ago
Numerical Instability and Chaos: Quantifying the Unpredictability of Large Language Models
arXiv:2604.13206v1 Announce Type: new Abstract: As Large Language Models (LLMs) are increasingly integrated into agentic workflows, their unpredictability stemm
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4h ago
Optimizing Earth Observation Satellite Schedules under Unknown Operational Constraints: An Active Constraint Acquisition Approach
arXiv:2604.13283v1 Announce Type: new Abstract: Earth Observation (EO) satellite scheduling (deciding which imaging tasks to perform and when) is a well-studied
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4h ago
WebXSkill: Skill Learning for Autonomous Web Agents
arXiv:2604.13318v1 Announce Type: new Abstract: Autonomous web agents powered by large language models (LLMs) have shown promise in completing complex browser t
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4h ago
Listening Alone, Understanding Together: Collaborative Context Recovery for Privacy-Aware AI
arXiv:2604.13348v1 Announce Type: new Abstract: We introduce CONCORD, a privacy-aware asynchronous assistant-to-assistant (A2A) framework that leverages collabo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4h ago
ReSS: Learning Reasoning Models for Tabular Data Prediction via Symbolic Scaffold
arXiv:2604.13392v1 Announce Type: new Abstract: Tabular data remains prevalent in high-stakes domains such as healthcare and finance, where predictive models ar
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4h ago
Quantifying and Understanding Uncertainty in Large Reasoning Models
arXiv:2604.13395v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) have recently demonstrated significant improvements in complex reasoning. While qu
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4h ago
Towards Scalable Lightweight GUI Agents via Multi-role Orchestration
arXiv:2604.13488v1 Announce Type: new Abstract: Autonomous Graphical User Interface (GUI) agents powered by Multimodal Large Language Models (MLLMs) enable digi
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4h ago
RiskWebWorld: A Realistic Interactive Benchmark for GUI Agents in E-commerce Risk Management
arXiv:2604.13531v1 Announce Type: new Abstract: Graphical User Interface (GUI) agents show strong capabilities for automating web tasks, but existing interactiv
ArXiv cs.AI
📄 Paper
4h ago
Weight Patching: Toward Source-Level Mechanistic Localization in LLMs
arXiv:2604.13694v1 Announce Type: new Abstract: Mechanistic interpretability seeks to localize model behavior to the internal components that causally realize i
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4h ago
Rethinking AI Hardware: A Three-Layer Cognitive Architecture for Autonomous Agents
arXiv:2604.13757v1 Announce Type: new Abstract: The next generation of autonomous AI systems will be constrained not only by model capability, but by how intell
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4h ago
The cognitive companion: a lightweight parallel monitoring architecture for detecting and recovering from reasoning degradation in LLM agents
arXiv:2604.13759v1 Announce Type: new Abstract: Large language model (LLM) agents on multi-step tasks suffer reasoning degradation, looping, drift, stuck states
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
4h ago
AlphaCNOT: Learning CNOT Minimization with Model-Based Planning
arXiv:2604.13812v1 Announce Type: new Abstract: Quantum circuit optimization is a central task in Quantum Computing, as current Noisy Intermediate Scale Quantum
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4h ago
GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis
arXiv:2604.13888v1 Announce Type: new Abstract: The integration of Large Language Models (LLMs) into Geographic Information Systems (GIS) marks a paradigm shift
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4h ago
AI-Assisted Peer Review at Scale: The AAAI-26 AI Review Pilot
arXiv:2604.13940v1 Announce Type: new Abstract: Scientific peer review faces mounting strain as submission volumes surge, making it increasingly difficult to su
ArXiv cs.AI
📄 Paper
4h ago
[Emerging Ideas] Artificial Tripartite Intelligence: A Bio-Inspired, Sensor-First Architecture for Physical AI
arXiv:2604.13959v1 Announce Type: new Abstract: As AI moves from data centers to robots and wearables, scaling ever-larger models becomes insufficient. Physical
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4h ago
Reward Design for Physical Reasoning in Vision-Language Models
arXiv:2604.13993v1 Announce Type: new Abstract: Physical reasoning over visual inputs demands tight integration of visual perception, domain knowledge, and mult
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4h ago
Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents
arXiv:2604.14004v1 Announce Type: new Abstract: Memory-based self-evolution has emerged as a promising paradigm for coding agents. However, existing approaches
ArXiv cs.AI
🎮 Reinforcement Learning
📄 Paper
⚡ AI Lesson
4h ago
Hierarchical Reinforcement Learning with Runtime Safety Shielding for Power Grid Operation
arXiv:2604.14032v1 Announce Type: new Abstract: Reinforcement learning has shown promise for automating power-grid operation tasks such as topology control and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4h ago
TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration
arXiv:2604.14116v1 Announce Type: new Abstract: While Large Language Models (LLMs) have empowered AI research agents to perform isolated scientific tasks, autom
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4h ago
When Reasoning Models Hurt Behavioral Simulation: A Solver-Sampler Mismatch in Multi-Agent LLM Negotiation
arXiv:2604.11840v1 Announce Type: cross Abstract: Large language models are increasingly used as agents in social, economic, and policy simulations. A common as
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
4h ago
OVT-MLCS: An Online Visual Tool for MLCS Mining from Long or Big Sequences
arXiv:2604.13037v1 Announce Type: cross Abstract: Mining multiple longest common subsequences (\textit{MLCS}) from a set of sequences of three or more over a fi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4h ago
TableNet A Large-Scale Table Dataset with LLM-Powered Autonomous
arXiv:2604.13041v1 Announce Type: cross Abstract: Table Structure Recognition (TSR) requires the logical reasoning ability of large language models (LLMs) to ha
DeepCamp AI