📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (13937)
ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationOpenAI NewsMedium · Programming
ArXiv cs.AI
📄 Paper
6d ago
Interactive Program Synthesis for Modeling Collaborative Physical Activities from Narrated Demonstrations
arXiv:2509.24250v3 Announce Type: replace Abstract: Teaching systems physical tasks is a long standing goal in HCI, yet most prior work has focused on non colla
ArXiv cs.AI
📄 Paper
6d ago
Chain-in-Tree: Back to Sequential Reasoning in LLM Tree Search
arXiv:2509.25835v4 Announce Type: replace Abstract: Test-time scaling improves large language models (LLMs) on long-horizon reasoning tasks by allocating more c
ArXiv cs.AI
📄 Paper
6d ago
When Identity Skews Debate: Anonymization for Bias-Reduced Multi-Agent Reasoning
arXiv:2510.07517v5 Announce Type: replace Abstract: Multi-agent debate (MAD) aims to improve large language model (LLM) reasoning by letting multiple agents exc
ArXiv cs.AI
📄 Paper
6d ago
Thermally Activated Dual-Modal Adversarial Clothing against AI Surveillance Systems
arXiv:2511.09829v3 Announce Type: replace Abstract: Adversarial patches have emerged as a popular privacy-preserving approach for resisting AI-driven surveillan
ArXiv cs.AI
📄 Paper
6d ago
Sample-Efficient Neurosymbolic Deep Reinforcement Learning
arXiv:2601.02850v2 Announce Type: replace Abstract: Reinforcement Learning (RL) is a well-established framework for sequential decision-making in complex enviro
ArXiv cs.AI
📄 Paper
6d ago
Precomputing Multi-Agent Path Replanning using Temporal Flexibility
arXiv:2601.04884v2 Announce Type: replace Abstract: Executing a multi-agent plan can be challenging when an agent is delayed, because this typically creates con
ArXiv cs.AI
📄 Paper
6d ago
Reasoning Models Will Sometimes Lie About Their Reasoning
arXiv:2601.07663v3 Announce Type: replace Abstract: Hint-based faithfulness evaluations have established that Large Reasoning Models (LRMs) may not say what the
ArXiv cs.AI
📄 Paper
6d ago
The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?
arXiv:2601.23045v2 Announce Type: replace Abstract: As AI becomes more capable, we entrust it with more general and consequential tasks. The risks from failure
ArXiv cs.AI
📄 Paper
6d ago
Reasoning in a Combinatorial and Constrained World: Benchmarking LLMs on Natural-Language Combinatorial Optimization
arXiv:2602.02188v2 Announce Type: replace Abstract: While large language models (LLMs) have shown strong performance in math and logic reasoning, their ability
ArXiv cs.AI
📄 Paper
6d ago
H-AdminSim: A Multi-Agent Simulator for Realistic Hospital Administrative Workflows with FHIR Integration
arXiv:2602.05407v2 Announce Type: replace Abstract: Hospital administration departments handle a wide range of operational tasks and, in large hospitals, proces
ArXiv cs.AI
📄 Paper
6d ago
ReplicatorBench: Benchmarking LLM Agents for Replicability in Social and Behavioral Sciences
arXiv:2602.11354v2 Announce Type: replace Abstract: The literature has witnessed an emerging interest in AI agents for automated assessment of scientific papers
ArXiv cs.AI
📄 Paper
6d ago
PACED: Distillation and On-Policy Self-Distillation at the Frontier of Student Competence
arXiv:2603.11178v3 Announce Type: replace Abstract: Standard LLM distillation treats all training problems equally -- wasting compute on problems the student ha
ArXiv cs.AI
📄 Paper
6d ago
Reasoning Provenance for Autonomous AI Agents: Structured Behavioral Analytics Beyond State Checkpoints and Execution Traces
arXiv:2603.21692v2 Announce Type: replace Abstract: As AI agents transition from human-supervised copilots to autonomous platform infrastructure, the ability to
ArXiv cs.AI
📄 Paper
6d ago
TRU: Targeted Reverse Update for Efficient Multimodal Recommendation Unlearning
arXiv:2604.02183v2 Announce Type: replace Abstract: Multimodal recommendation systems (MRS) jointly model user-item interaction graphs and rich item content, bu
ArXiv cs.AI
📄 Paper
6d ago
Towards Knowledgeable Deep Research: Framework and Benchmark
arXiv:2604.07720v2 Announce Type: replace Abstract: Deep Research (DR) requires LLM agents to autonomously perform multi-step information seeking, processing, a
ArXiv cs.AI
📄 Paper
6d ago
Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution
arXiv:2604.07725v2 Announce Type: replace Abstract: We show that verifier-free evolution is bottlenecked by both diversity and efficiency: without external corr
ArXiv cs.AI
📄 Paper
6d ago
EigentSearch-Q+: Enhancing Deep Research Agents with Structured Reasoning Tools
arXiv:2604.07927v2 Announce Type: replace Abstract: Deep research requires reasoning over web evidence to answer open-ended questions, and it is a core capabili
ArXiv cs.AI
📄 Paper
6d ago
MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems
arXiv:2604.07956v2 Announce Type: replace Abstract: Industry classification schemes are integral parts of public and corporate databases as they classify busine
ArXiv cs.AI
📄 Paper
6d ago
ASPECT:Analogical Semantic Policy Execution via Language Conditioned Transfer
arXiv:2604.08355v2 Announce Type: replace Abstract: Reinforcement Learning (RL) agents often struggle to generalize knowledge to new tasks, even those structura
ArXiv cs.AI
📄 Paper
6d ago
Task-Distributionally Robust Data-Free Meta-Learning
arXiv:2311.14756v2 Announce Type: replace-cross Abstract: Data-Free Meta-Learning (DFML) aims to enable efficient learning of unseen few-shot tasks, by meta-lea
ArXiv cs.AI
📄 Paper
6d ago
Temporal Transfer Learning for Traffic Optimization with Coarse-grained Advisory Autonomy
arXiv:2312.09436v3 Announce Type: replace-cross Abstract: The recent development of connected and automated vehicle (CAV) technologies has spurred investigation
ArXiv cs.AI
📄 Paper
6d ago
Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning
arXiv:2404.10976v4 Announce Type: replace-cross Abstract: Cooperative Multi-Agent Reinforcement Learning (MARL) necessitates seamless collaboration among agents
ArXiv cs.AI
📄 Paper
6d ago
Detection and Characterization of Coordinated Online Behavior: A Survey
arXiv:2408.01257v2 Announce Type: replace-cross Abstract: Coordination is a fundamental aspect of life. The advent of social media has made it integral also to
ArXiv cs.AI
📄 Paper
6d ago
Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive Architecture
arXiv:2410.08559v5 Announce Type: replace-cross Abstract: Electrocardiogram (ECG) captures the heart's electrical signals, offering valuable information for dia
DeepCamp AI