📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 8,253 articles · Updated every 3 hours · View all reads

arXiv:2604.04804v1 Announce Type: cross Abstract: Learning from experience is critical for building capable large language model (LLM) agents, yet prevailing se

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago

Selecting Decision-Relevant Concepts in Reinforcement Learning

arXiv:2604.04808v1 Announce Type: cross Abstract: Training interpretable concept-based policies requires practitioners to manually select which human-understand

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection

arXiv:2604.04815v1 Announce Type: cross Abstract: The rapid development of Large Language Models (LLMs) has transformed fake news detection and fact-checking ta

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not

arXiv:2604.04825v1 Announce Type: cross Abstract: Large language models achieve strong performance on many language tasks, yet it remains unclear whether they i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

InfBaGel: Human-Object-Scene Interaction Generation with Dynamic Perception and Iterative Refinement

arXiv:2604.04843v1 Announce Type: cross Abstract: Human-object-scene interactions (HOSI) generation has broad applications in embodied AI, simulation, and anima

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Prompt Framework

arXiv:2604.04852v1 Announce Type: cross Abstract: Chain-of-Thought (CoT) prompting has been used to enhance the reasoning capability of LLMs. However, its relia

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Noise Immunity in In-Context Tabular Learning: An Empirical Robustness Analysis of TabPFN's Attention Mechanisms

arXiv:2604.04868v1 Announce Type: cross Abstract: Tabular foundation models (TFMs) such as TabPFN (Tabular Prior-Data Fitted Network) are designed to generalize

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago

DIRECT: Video Mashup Creation via Hierarchical Multi-Agent Planning and Intent-Guided Editing

arXiv:2604.04875v1 Announce Type: cross Abstract: Video mashup creation represents a complex video editing paradigm that recomposes existing footage to craft en

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago

Muon Dynamics as a Spectral Wasserstein Flow

arXiv:2604.04891v1 Announce Type: cross Abstract: Gradient normalization is central in deep-learning optimization because it stabilizes training and reduces sen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Entropy Modulation

arXiv:2604.04894v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has significantly advanced the reasoning capabilities of

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Agentic Federated Learning: The Future of Distributed Training Orchestration

arXiv:2604.04895v1 Announce Type: cross Abstract: Although Federated Learning (FL) promises privacy and distributed collaboration, its effectiveness in real-wor

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

arXiv:2604.04901v1 Announce Type: cross Abstract: Coworking AI agents operating within local file systems are rapidly emerging as a paradigm in human-AI interac

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago

How AI Aggregation Affects Knowledge

arXiv:2604.04906v1 Announce Type: cross Abstract: Artificial intelligence (AI) changes social learning when aggregated outputs become training data for future p

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago

Analyzing Symbolic Properties for DRL Agents in Systems and Networking

arXiv:2604.04914v1 Announce Type: cross Abstract: Deep reinforcement learning (DRL) has shown remarkable performance on complex control problems in systems and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Vero: An Open RL Recipe for General Visual Reasoning

arXiv:2604.04917v1 Announce Type: cross Abstract: What does it take to build a visual reasoner that works across charts, science, spatial understanding, and ope

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Your Pre-trained Diffusion Model Secretly Knows Restoration

arXiv:2604.04924v1 Announce Type: cross Abstract: Pre-trained diffusion models have enabled significant advancements in All-in-One Restoration (AiOR), offering

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Early Stopping for Large Reasoning Models via Confidence Dynamics

arXiv:2604.04930v1 Announce Type: cross Abstract: Large reasoning models rely on long chain-of-thought generation to solve complex problems, but extended reason

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

arXiv:2302.00797v4 Announce Type: replace Abstract: Opponent modeling methods typically involve two crucial steps: building a belief distribution over opponents

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago

A Multi-Agent Reinforcement Learning Framework for Public Health Decision Analysis

arXiv:2311.00855v3 Announce Type: replace Abstract: Human immunodeficiency virus (HIV) is a major public health concern in the United States (U.S.), with about

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible

arXiv:2411.06498v2 Announce Type: replace Abstract: A recent paper (van Rooij et al. 2024) claims to have proved that achieving human-like intelligence using le

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3w ago

Representation learning to advance multi-institutional studies with electronic health record data from US and France

arXiv:2502.08547v2 Announce Type: replace Abstract: The widespread adoption of electronic health records has created new opportunities for translational clinica

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

arXiv:2502.13388v3 Announce Type: replace Abstract: StarCraft II is a complex and dynamic real-time strategy (RTS) game environment, which is very suitable for

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models

arXiv:2506.17585v3 Announce Type: replace Abstract: Trustworthy language models should provide both correct and verifiable answers. However, citations generated

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago

Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game

arXiv:2508.02900v2 Announce Type: replace Abstract: There is a broad consensus that the inability to form long-term plans is one of the key limitations of curre