📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 6,347 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (16647) ArXiv cs.AI Dev.to AI Dev.to · FORUM WEB Forbes Innovation Medium · Programming Medium · AI

Human-like Working Memory Interference in Large Language Models

arXiv:2604.09670v1 Announce Type: cross Abstract: Intelligent systems must maintain and manipulate task-relevant information online to adapt to dynamic environm

ArXiv cs.AI 📄 Paper 1w ago

Active Inference with a Self-Prior in the Mirror-Mark Task

arXiv:2604.09673v1 Announce Type: cross Abstract: The mirror self-recognition test evaluates whether a subject touches a mark on its own body that is visible on

ArXiv cs.AI 📄 Paper 1w ago

Real-Time Voicemail Detection in Telephony Audio Using Temporal Speech Activity Features

arXiv:2604.09675v1 Announce Type: cross Abstract: Outbound AI calling systems must distinguish voicemail greetings from live human answers in real time to avoid

ArXiv cs.AI 📄 Paper 1w ago

A Comparative Theoretical Analysis of Entropy Control Methods in Reinforcement Learning

arXiv:2604.09676v1 Announce Type: cross Abstract: Reinforcement learning (RL) has become a key approach for enhancing reasoning in large language models (LLMs),

ArXiv cs.AI 📄 Paper 1w ago

NetAgentBench: A State-Centric Benchmark for Evaluating Agentic Network Configuration

arXiv:2604.09678v1 Announce Type: cross Abstract: As agentic network management gains popularity, there is a critical need for evaluation frameworks that transc

ArXiv cs.AI 📄 Paper 1w ago

Heterogeneous Consensus-Progressive Reasoning for Efficient Multi-Agent Debate

arXiv:2604.09679v1 Announce Type: cross Abstract: Multi-Agent Debate (MAD) is a collaborative framework in which multiple agents iteratively refine solutions th

ArXiv cs.AI 📄 Paper 1w ago

Decision-Theoretic Safety Assessment of Persona-Driven Multi-Agent Systems in O-RAN

arXiv:2604.09682v1 Announce Type: cross Abstract: Autonomous network management in Open Radio Access Networks requires intelligent decision making across confli

ArXiv cs.AI 📄 Paper 1w ago

Grid2Matrix: Revealing Digital Agnosia in Vision-Language Models

arXiv:2604.09687v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) excel on many multimodal reasoning benchmarks, but these evaluations often do no

ArXiv cs.AI 📄 Paper 1w ago

Face Density as a Proxy for Data Complexity: Quantifying the Hardness of Instance Count

arXiv:2604.09689v1 Announce Type: cross Abstract: Machine learning progress has historically prioritized model-centric innovations, yet achievable performance i

ArXiv cs.AI 📄 Paper 1w ago

CAGE: Bridging the Accuracy-Aesthetics Gap in Educational Diagrams via Code-Anchored Generative Enhancement

arXiv:2604.09691v1 Announce Type: cross Abstract: Educational diagrams -- labeled illustrations of biological processes, chemical structures, physical systems,

ArXiv cs.AI 📄 Paper 1w ago

TaFall: Balance-Informed Fall Detection via Passive Thermal Sensing

arXiv:2604.09693v1 Announce Type: cross Abstract: Falls are a major cause of injury and mortality among older adults, yet most incidents occur in private indoor

ArXiv cs.AI 📄 Paper 1w ago

Assessing Privacy Preservation and Utility in Online Vision-Language Models

arXiv:2604.09695v1 Announce Type: cross Abstract: The increasing use of Online Vision Language Models (OVLMs) for processing images has introduced significant p

ArXiv cs.AI 📄 Paper 1w ago

I Can't Believe TTA Is Not Better: When Test-Time Augmentation Hurts Medical Image Classification

arXiv:2604.09697v1 Announce Type: cross Abstract: Test-time augmentation (TTA)--aggregating predictions over multiple augmented copies of a test input--is widel

ArXiv cs.AI 📄 Paper 1w ago

Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation

arXiv:2604.09698v1 Announce Type: cross Abstract: The growing ubiquity of Extended Reality (XR) is driving Conversational Recommendation Systems (CRS) toward vi

ArXiv cs.AI 📄 Paper 1w ago

Attention-Guided Flow-Matching for Sparse 3D Geological Generation

arXiv:2604.09700v1 Announce Type: cross Abstract: Constructing high-resolution 3D geological models from sparse 1D borehole and 2D surface data is a highly ill-

ArXiv cs.AI 📄 Paper 1w ago

Identity-Aware U-Net: Fine-grained Cell Segmentation via Identity-Aware Representation Learning

arXiv:2604.09702v1 Announce Type: cross Abstract: Precise segmentation of objects with highly similar shapes remains a challenging problem in dense prediction,

ArXiv cs.AI 📄 Paper 1w ago

The Deployment Gap in AI Media Detection: Platform-Aware and Visually Constrained Adversarial Evaluation

arXiv:2604.09706v1 Announce Type: cross Abstract: Recent AI media detectors report near-perfect performance under clean laboratory evaluation, yet their robustn

ArXiv cs.AI 📄 Paper 1w ago

Orthogonal Quadratic Complements for Vision Transformer Feed-Forward Networks

arXiv:2604.09709v1 Announce Type: cross Abstract: Recent bilinear feed-forward replacements for vision transformers can substantially improve accuracy, but they

ArXiv cs.AI 📄 Paper 1w ago

LAST: Leveraging Tools as Hints to Enhance Spatial Reasoning for Multimodal Large Language Models

arXiv:2604.09712v1 Announce Type: cross Abstract: Spatial reasoning is a cornerstone capability for intelligent systems to perceive and interact with the physic

ArXiv cs.AI 📄 Paper 1w ago

Training Deep Visual Networks Beyond Loss and Accuracy Through a Dynamical Systems Approach

arXiv:2604.09716v1 Announce Type: cross Abstract: Deep visual recognition models are usually trained and evaluated using metrics such as loss and accuracy. Whil

ArXiv cs.AI 📄 Paper 1w ago

ConfigSpec: Profiling-Based Configuration Selection for Distributed Edge--Cloud Speculative LLM Serving

arXiv:2604.09722v1 Announce Type: cross Abstract: Speculative decoding enables collaborative Large Language Model (LLM) inference across cloud and edge by separ

ArXiv cs.AI 📄 Paper 1w ago

LOLGORITHM: Funny Comment Generation Agent For Short Videos

arXiv:2604.09729v1 Announce Type: cross Abstract: Short-form video platforms have become central to multimedia information dissemination, where comments play a

ArXiv cs.AI 📄 Paper 1w ago

SMART: When is it Actually Worth Expanding a Speculative Tree?

arXiv:2604.09731v1 Announce Type: cross Abstract: Tree-based speculative decoding accelerates autoregressive generation by verifying a branching tree of draft t

ArXiv cs.AI 📄 Paper 1w ago

Multi-Frequency Local Plasticity for Visual Representation Learning

arXiv:2604.09734v1 Announce Type: cross Abstract: We study how far structured architectural bias can compensate for the absence of end-to-end gradient-based rep