5,060 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (13554) ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI 📄 Paper 4d ago
AffordSim: A Scalable Data Generator and Benchmark for Affordance-Aware Robotic Manipulation
arXiv:2604.11674v1 Announce Type: cross Abstract: Simulation-based data generation has become a dominant paradigm for training robotic manipulation policies, ye
ArXiv cs.AI 📄 Paper 4d ago
Legal2LogicICL: Improving Generalization in Transforming Legal Cases to Logical Formulas via Diverse Few-Shot Learning
arXiv:2604.11699v1 Announce Type: cross Abstract: This work aims to improve the generalization of logic-based legal reasoning systems by integrating recent adva
ArXiv cs.AI 📄 Paper 4d ago
Fairness is Not Flat: Geometric Phase Transitions Against Shortcut Learning
arXiv:2604.11704v1 Announce Type: cross Abstract: Deep Neural Networks are highly susceptible to shortcut learning, frequently memorizing low-dimensional spurio
ArXiv cs.AI 📄 Paper 4d ago
On the Robustness of Watermarking for Autoregressive Image Generation
arXiv:2604.11720v1 Announce Type: cross Abstract: The proliferation of autoregressive (AR) image generators demands reliable detection and attribution of their
ArXiv cs.AI 📄 Paper 4d ago
Evaluating Cooperation in LLM Social Groups through Elected Leadership
arXiv:2604.11721v1 Announce Type: cross Abstract: Governing common-pool resources requires agents to develop enduring strategies through cooperation and self-go
ArXiv cs.AI 📄 Paper 4d ago
Endogenous Information in Routing Games: Memory-Constrained Equilibria, Recall Braess Paradoxes, and Memory Design
arXiv:2604.11733v1 Announce Type: cross Abstract: We study routing games in which travelers optimize over routes that are remembered or surfaced, rather than ov
ArXiv cs.AI 📄 Paper 4d ago
Multi-ORFT: Stable Online Reinforcement Fine-Tuning for Multi-Agent Diffusion Planning in Cooperative Driving
arXiv:2604.11734v1 Announce Type: cross Abstract: Closed-loop cooperative driving requires planners that generate realistic multimodal multi-agent trajectories
ArXiv cs.AI 📄 Paper 4d ago
Discourse Diversity in Multi-Turn Empathic Dialogue
arXiv:2604.11742v1 Announce Type: cross Abstract: Large language models (LLMs) produce responses rated as highly empathic in single-turn settings (Ayers et al.,
ArXiv cs.AI 📄 Paper 4d ago
Grounded World Model for Semantically Generalizable Planning
arXiv:2604.11751v1 Announce Type: cross Abstract: In Model Predictive Control (MPC), world models predict the future outcomes of various action proposals, which
ArXiv cs.AI 📄 Paper 4d ago
StarVLA-$\alpha$: Reducing Complexity in Vision-Language-Action Systems
arXiv:2604.11757v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models have recently emerged as a promising paradigm for building general-purpose
ArXiv cs.AI 📄 Paper 4d ago
Efficient KernelSHAP Explanations for Patch-based 3D Medical Image Segmentation
arXiv:2604.11775v1 Announce Type: cross Abstract: Perturbation-based explainability methods such as KernelSHAP provide model-agnostic attributions but are typic
ArXiv cs.AI 📄 Paper 4d ago
General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks
arXiv:2604.11778v1 Announce Type: cross Abstract: Contemporary large language models (LLMs) have demonstrated remarkable reasoning capabilities, particularly in
ArXiv cs.AI 📄 Paper 4d ago
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents
arXiv:2604.11784v1 Announce Type: cross Abstract: GUI agents drive applications through their visual interfaces instead of programmatic APIs, interacting with a
ArXiv cs.AI 📄 Paper 4d ago
ClawGuard: A Runtime Security Framework for Tool-Augmented LLM Agents Against Indirect Prompt Injection
arXiv:2604.11790v1 Announce Type: cross Abstract: Tool-augmented Large Language Model (LLM) agents have demonstrated impressive capabilities in automating compl
ArXiv cs.AI 📄 Paper 4d ago
A Mechanistic Analysis of Looped Reasoning Language Models
arXiv:2604.11791v1 Announce Type: cross Abstract: Reasoning has become a central capability in large language models. Recent research has shown that reasoning p
ArXiv cs.AI 📄 Paper 4d ago
C-ReD: A Comprehensive Chinese Benchmark for AI-Generated Text Detection Derived from Real-World Prompts
arXiv:2604.11796v1 Announce Type: cross Abstract: Recently, large language models (LLMs) are capable of generating highly fluent textual content. While they off
ArXiv cs.AI 📄 Paper 4d ago
Budget-Aware Uncertainty for Radiotherapy Segmentation QA Using nnU-Net
arXiv:2604.11798v1 Announce Type: cross Abstract: Accurate delineation of the Clinical Target Volume (CTV) is essential for radiotherapy planning, yet remains t
ArXiv cs.AI 📄 Paper 4d ago
Solving Physics Olympiad via Reinforcement Learning on Physics Simulators
arXiv:2604.11805v1 Announce Type: cross Abstract: We have witnessed remarkable advances in LLM reasoning capabilities with the advent of DeepSeek-R1. However, m
ArXiv cs.AI 📄 Paper 4d ago
Physics-Informed State Space Models for Reliable Solar Irradiance Forecasting in Off-Grid Systems
arXiv:2604.11807v1 Announce Type: cross Abstract: The stable operation of autonomous off-grid photovoltaic systems dictates reliance on solar forecasting algori
ArXiv cs.AI 📄 Paper 4d ago
Can Large Language Models Infer Causal Relationships from Real-World Text?
arXiv:2505.18931v4 Announce Type: replace Abstract: Understanding and inferring causal relationships from texts is a core aspect of human cognition and is essen