📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,344 articles · Updated every 3 hours · View all reads

arXiv:2512.09427v3 Announce Type: replace-cross Abstract: Existing memory management techniques severely hinder efficient Large Language Model serving on accele

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Physics-driven human-like working memory outperforms digital networks in dynamic vision

arXiv:2512.15829v3 Announce Type: replace-cross Abstract: While the unsustainable energy cost of artificial intelligence necessitates physics-driven computing,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Deep Neural Networks as Discrete Dynamical Systems: Implications for Physics-Informed Learning

arXiv:2601.00473v2 Announce Type: replace-cross Abstract: We revisit the analogy between feed-forward deep neural networks (DNNs) and discrete dynamical systems

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago

Understanding Pure Textual Reasoning for Blind Image Quality Assessment

arXiv:2601.02441v2 Announce Type: replace-cross Abstract: Textual reasoning has recently been widely adopted in Blind Image Quality Assessment (BIQA). However,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection

arXiv:2601.09195v2 Announce Type: replace-cross Abstract: Supervised fine-tuning (SFT) is a fundamental post-training strategy to align Large Language Models (L

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

arXiv:2601.10305v3 Announce Type: replace-cross Abstract: Vision-Language Pre-training (VLP) models have achieved remarkable success by leveraging large-scale i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

PASTA: A Scalable Framework for Multi-Policy AI Compliance Evaluation

arXiv:2601.11702v2 Announce Type: replace-cross Abstract: AI compliance is becoming increasingly critical as AI systems grow more powerful and pervasive. Yet th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

HalluJudge: A Reference-Free Hallucination Detection for Context Misalignment in Code Review Automation

arXiv:2601.19072v2 Announce Type: replace-cross Abstract: Large Language models (LLMs) have shown strong capabilities in code review automation, such as review

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making

arXiv:2602.02378v2 Announce Type: replace-cross Abstract: As LLMs expand from assistance to decision support, a dangerous pattern emerges: fluent agreement with

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

SPARE: Self-distillation for PARameter-Efficient Removal

arXiv:2602.07058v2 Announce Type: replace-cross Abstract: Machine Unlearning aims to remove the influence of specific data or concepts from trained models while

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

On Randomness in Agentic Evals

arXiv:2602.07150v3 Announce Type: replace-cross Abstract: Agentic systems are evaluated on benchmarks where agents interact with environments to solve tasks. Mo

ArXiv cs.AI 🏭 MLOps & LLMOps 📄 Paper ⚡ AI Lesson 1mo ago

KRONE: Hierarchical and Modular Log Anomaly Detection

arXiv:2602.07303v2 Announce Type: replace-cross Abstract: Log anomaly detection is crucial for uncovering system failures and security risks. Although logs orig

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

arXiv:2602.07906v4 Announce Type: replace-cross Abstract: Autonomous Machine Learning Engineering (MLE) requires agents to perform sustained, iterative optimiza

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago

OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model

arXiv:2602.12304v3 Announce Type: replace-cross Abstract: Existing mainstream video customization methods focus on generating identity-consistent videos based o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling

arXiv:2602.16485v2 Announce Type: replace-cross Abstract: Existing Multi-Agent Systems (MAS) typically rely on homogeneous model configurations, failing to expl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Smooth Gate Functions for Soft Advantage Policy Optimization

arXiv:2602.19345v2 Announce Type: replace-cross Abstract: Group Relative Policy Optimization (GRPO) has significantly advanced the training of large language mo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parametric Policies

arXiv:2602.23811v3 Announce Type: replace-cross Abstract: We investigate the theoretical aspects of offline reinforcement learning (RL) under general function a

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago

OSS-CRS: Liberating AIxCC Cyber Reasoning Systems for Real-World Open-Source Security

arXiv:2603.08566v2 Announce Type: replace-cross Abstract: DARPA's AI Cyber Challenge (AIxCC) showed that cyber reasoning systems (CRSs) can go beyond vulnerabil

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings

arXiv:2603.09643v3 Announce Type: replace-cross Abstract: Current evaluation frameworks and benchmarks for LLM powered agents focus on text chat driven agents,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Exploring Collatz Dynamics with Human-LLM Collaboration

arXiv:2603.11066v3 Announce Type: replace-cross Abstract: We develop a structural and quantitative framework for analyzing the Collatz map through modular dynam

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Red-Teaming Vision-Language-Action Models via Quality Diversity Prompt Generation for Robust Robot Policies

arXiv:2603.12510v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models have significant potential to enable general-purpose robotic syste

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents

arXiv:2603.12564v3 Announce Type: replace-cross Abstract: Tool-augmented LLM agents increasingly serve as multi-turn advisors in high-stakes domains, yet their

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Geometry-Guided Camera Motion Understanding in VideoLLMs

arXiv:2603.13119v2 Announce Type: replace-cross Abstract: Camera motion is a fundamental geometric signal that shapes visual perception and cinematic style, yet

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1mo ago

Pixel-level Scene Understanding in One Token: Visual States Need What-is-Where Composition

arXiv:2603.13904v2 Announce Type: replace-cross Abstract: For robotic agents operating in dynamic environments, learning visual state representations from strea