6,347 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 6,347 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (16647) ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI 📄 Paper 1w ago
NetAgentBench: A State-Centric Benchmark for Evaluating Agentic Network Configuration
arXiv:2604.09678v1 Announce Type: cross Abstract: As agentic network management gains popularity, there is a critical need for evaluation frameworks that transc
ArXiv cs.AI 📄 Paper 1w ago
Heterogeneous Consensus-Progressive Reasoning for Efficient Multi-Agent Debate
arXiv:2604.09679v1 Announce Type: cross Abstract: Multi-Agent Debate (MAD) is a collaborative framework in which multiple agents iteratively refine solutions th
ArXiv cs.AI 📄 Paper 1w ago
Decision-Theoretic Safety Assessment of Persona-Driven Multi-Agent Systems in O-RAN
arXiv:2604.09682v1 Announce Type: cross Abstract: Autonomous network management in Open Radio Access Networks requires intelligent decision making across confli
ArXiv cs.AI 📄 Paper 1w ago
Grid2Matrix: Revealing Digital Agnosia in Vision-Language Models
arXiv:2604.09687v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) excel on many multimodal reasoning benchmarks, but these evaluations often do no
ArXiv cs.AI 📄 Paper 1w ago
Face Density as a Proxy for Data Complexity: Quantifying the Hardness of Instance Count
arXiv:2604.09689v1 Announce Type: cross Abstract: Machine learning progress has historically prioritized model-centric innovations, yet achievable performance i
ArXiv cs.AI 📄 Paper 1w ago
CAGE: Bridging the Accuracy-Aesthetics Gap in Educational Diagrams via Code-Anchored Generative Enhancement
arXiv:2604.09691v1 Announce Type: cross Abstract: Educational diagrams -- labeled illustrations of biological processes, chemical structures, physical systems,
ArXiv cs.AI 📄 Paper 1w ago
TaFall: Balance-Informed Fall Detection via Passive Thermal Sensing
arXiv:2604.09693v1 Announce Type: cross Abstract: Falls are a major cause of injury and mortality among older adults, yet most incidents occur in private indoor
ArXiv cs.AI 📄 Paper 1w ago
Assessing Privacy Preservation and Utility in Online Vision-Language Models
arXiv:2604.09695v1 Announce Type: cross Abstract: The increasing use of Online Vision Language Models (OVLMs) for processing images has introduced significant p
ArXiv cs.AI 📄 Paper 1w ago
I Can't Believe TTA Is Not Better: When Test-Time Augmentation Hurts Medical Image Classification
arXiv:2604.09697v1 Announce Type: cross Abstract: Test-time augmentation (TTA)--aggregating predictions over multiple augmented copies of a test input--is widel
ArXiv cs.AI 📄 Paper 1w ago
Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation
arXiv:2604.09698v1 Announce Type: cross Abstract: The growing ubiquity of Extended Reality (XR) is driving Conversational Recommendation Systems (CRS) toward vi
ArXiv cs.AI 📄 Paper 1w ago
Attention-Guided Flow-Matching for Sparse 3D Geological Generation
arXiv:2604.09700v1 Announce Type: cross Abstract: Constructing high-resolution 3D geological models from sparse 1D borehole and 2D surface data is a highly ill-
ArXiv cs.AI 📄 Paper 1w ago
Identity-Aware U-Net: Fine-grained Cell Segmentation via Identity-Aware Representation Learning
arXiv:2604.09702v1 Announce Type: cross Abstract: Precise segmentation of objects with highly similar shapes remains a challenging problem in dense prediction,
ArXiv cs.AI 📄 Paper 1w ago
The Deployment Gap in AI Media Detection: Platform-Aware and Visually Constrained Adversarial Evaluation
arXiv:2604.09706v1 Announce Type: cross Abstract: Recent AI media detectors report near-perfect performance under clean laboratory evaluation, yet their robustn
ArXiv cs.AI 📄 Paper 1w ago
Orthogonal Quadratic Complements for Vision Transformer Feed-Forward Networks
arXiv:2604.09709v1 Announce Type: cross Abstract: Recent bilinear feed-forward replacements for vision transformers can substantially improve accuracy, but they
ArXiv cs.AI 📄 Paper 1w ago
LAST: Leveraging Tools as Hints to Enhance Spatial Reasoning for Multimodal Large Language Models
arXiv:2604.09712v1 Announce Type: cross Abstract: Spatial reasoning is a cornerstone capability for intelligent systems to perceive and interact with the physic
ArXiv cs.AI 📄 Paper 1w ago
Training Deep Visual Networks Beyond Loss and Accuracy Through a Dynamical Systems Approach
arXiv:2604.09716v1 Announce Type: cross Abstract: Deep visual recognition models are usually trained and evaluated using metrics such as loss and accuracy. Whil
ArXiv cs.AI 📄 Paper 1w ago
ConfigSpec: Profiling-Based Configuration Selection for Distributed Edge--Cloud Speculative LLM Serving
arXiv:2604.09722v1 Announce Type: cross Abstract: Speculative decoding enables collaborative Large Language Model (LLM) inference across cloud and edge by separ
ArXiv cs.AI 📄 Paper 1w ago
LOLGORITHM: Funny Comment Generation Agent For Short Videos
arXiv:2604.09729v1 Announce Type: cross Abstract: Short-form video platforms have become central to multimedia information dissemination, where comments play a
ArXiv cs.AI 📄 Paper 1w ago
SMART: When is it Actually Worth Expanding a Speculative Tree?
arXiv:2604.09731v1 Announce Type: cross Abstract: Tree-based speculative decoding accelerates autoregressive generation by verifying a branching tree of draft t
ArXiv cs.AI 📄 Paper 1w ago
Multi-Frequency Local Plasticity for Visual Representation Learning
arXiv:2604.09734v1 Announce Type: cross Abstract: We study how far structured architectural bias can compensate for the absence of end-to-end gradient-based rep