📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 1,754 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
SPARE: Self-distillation for PARameter-Efficient Removal
arXiv:2602.07058v2 Announce Type: replace-cross Abstract: Machine Unlearning aims to remove the influence of specific data or concepts from trained models while
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
On Randomness in Agentic Evals
arXiv:2602.07150v3 Announce Type: replace-cross Abstract: Agentic systems are evaluated on benchmarks where agents interact with environments to solve tasks. Mo
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
KRONE: Hierarchical and Modular Log Anomaly Detection
arXiv:2602.07303v2 Announce Type: replace-cross Abstract: Log anomaly detection is crucial for uncovering system failures and security risks. Although logs orig
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering
arXiv:2602.07906v4 Announce Type: replace-cross Abstract: Autonomous Machine Learning Engineering (MLE) requires agents to perform sustained, iterative optimiza
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model
arXiv:2602.12304v3 Announce Type: replace-cross Abstract: Existing mainstream video customization methods focus on generating identity-consistent videos based o
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
arXiv:2602.16485v2 Announce Type: replace-cross Abstract: Existing Multi-Agent Systems (MAS) typically rely on homogeneous model configurations, failing to expl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Smooth Gate Functions for Soft Advantage Policy Optimization
arXiv:2602.19345v2 Announce Type: replace-cross Abstract: Group Relative Policy Optimization (GRPO) has significantly advanced the training of large language mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parametric Policies
arXiv:2602.23811v3 Announce Type: replace-cross Abstract: We investigate the theoretical aspects of offline reinforcement learning (RL) under general function a
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
OSS-CRS: Liberating AIxCC Cyber Reasoning Systems for Real-World Open-Source Security
arXiv:2603.08566v2 Announce Type: replace-cross Abstract: DARPA's AI Cyber Challenge (AIxCC) showed that cyber reasoning systems (CRSs) can go beyond vulnerabil
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings
arXiv:2603.09643v3 Announce Type: replace-cross Abstract: Current evaluation frameworks and benchmarks for LLM powered agents focus on text chat driven agents,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Exploring Collatz Dynamics with Human-LLM Collaboration
arXiv:2603.11066v3 Announce Type: replace-cross Abstract: We develop a structural and quantitative framework for analyzing the Collatz map through modular dynam
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Red-Teaming Vision-Language-Action Models via Quality Diversity Prompt Generation for Robust Robot Policies
arXiv:2603.12510v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models have significant potential to enable general-purpose robotic syste
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents
arXiv:2603.12564v3 Announce Type: replace-cross Abstract: Tool-augmented LLM agents increasingly serve as multi-turn advisors in high-stakes domains, yet their
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Geometry-Guided Camera Motion Understanding in VideoLLMs
arXiv:2603.13119v2 Announce Type: replace-cross Abstract: Camera motion is a fundamental geometric signal that shapes visual perception and cinematic style, yet
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
6d ago
Pixel-level Scene Understanding in One Token: Visual States Need What-is-Where Composition
arXiv:2603.13904v2 Announce Type: replace-cross Abstract: For robotic agents operating in dynamic environments, learning visual state representations from strea
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
FedPBS: Proximal-Balanced Scaling Federated Learning Model for Robust Personalized Training for Non-IID Data
arXiv:2603.13909v2 Announce Type: replace-cross Abstract: Federated learning (FL) enables a set of distributed clients to jointly train machine learning models
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning
arXiv:2603.14867v2 Announce Type: replace-cross Abstract: Many strategic decision-making problems, such as environment design for warehouse robots, can be natur
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models
arXiv:2603.15970v3 Announce Type: replace-cross Abstract: Several data warehouse and database providers have recently introduced extensions to SQL called AI Que
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval
arXiv:2603.17872v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have achieved unprecedented fluency but remain susceptible to "hallucinat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Evolutionarily Stable Stackelberg Equilibrium
arXiv:2603.18385v2 Announce Type: replace-cross Abstract: We present a new solution concept called evolutionarily stable Stackelberg equilibrium (SESS). We stud
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
Ontology-Guided Diffusion for Zero-Shot Visual Sim2Real Transfer
arXiv:2603.18719v2 Announce Type: replace-cross Abstract: Bridging the simulation-to-reality (sim2real) gap remains challenging as labelled real-world data is s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models
arXiv:2603.20957v2 Announce Type: replace-cross Abstract: Frontier LLM companies have repeatedly assured courts and regulators that their models do not store co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection
arXiv:2603.21576v2 Announce Type: replace-cross Abstract: Long-context LLM inference is bottlenecked not by compute but by the O(n) memory bandwidth cost of sca
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
Extending Precipitation Nowcasting Horizons via Spectral Fusion of Radar Observations and Foundation Model Priors
arXiv:2603.21768v2 Announce Type: replace-cross Abstract: Precipitation nowcasting is critical for disaster mitigation and aviation safety. However, radar-only
DeepCamp AI