AI News — Latest Developments & Breakthroughs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

SPARE: Self-distillation for PARameter-Efficient Removal

arXiv:2602.07058v2 Announce Type: replace-cross Abstract: Machine Unlearning aims to remove the influence of specific data or concepts from trained models while

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

On Randomness in Agentic Evals

arXiv:2602.07150v3 Announce Type: replace-cross Abstract: Agentic systems are evaluated on benchmarks where agents interact with environments to solve tasks. Mo

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

KRONE: Hierarchical and Modular Log Anomaly Detection

arXiv:2602.07303v2 Announce Type: replace-cross Abstract: Log anomaly detection is crucial for uncovering system failures and security risks. Although logs orig

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

arXiv:2602.07906v4 Announce Type: replace-cross Abstract: Autonomous Machine Learning Engineering (MLE) requires agents to perform sustained, iterative optimiza

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model

arXiv:2602.12304v3 Announce Type: replace-cross Abstract: Existing mainstream video customization methods focus on generating identity-consistent videos based o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling

arXiv:2602.16485v2 Announce Type: replace-cross Abstract: Existing Multi-Agent Systems (MAS) typically rely on homogeneous model configurations, failing to expl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Smooth Gate Functions for Soft Advantage Policy Optimization

arXiv:2602.19345v2 Announce Type: replace-cross Abstract: Group Relative Policy Optimization (GRPO) has significantly advanced the training of large language mo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parametric Policies

arXiv:2602.23811v3 Announce Type: replace-cross Abstract: We investigate the theoretical aspects of offline reinforcement learning (RL) under general function a

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

OSS-CRS: Liberating AIxCC Cyber Reasoning Systems for Real-World Open-Source Security

arXiv:2603.08566v2 Announce Type: replace-cross Abstract: DARPA's AI Cyber Challenge (AIxCC) showed that cyber reasoning systems (CRSs) can go beyond vulnerabil

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings

arXiv:2603.09643v3 Announce Type: replace-cross Abstract: Current evaluation frameworks and benchmarks for LLM powered agents focus on text chat driven agents,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Exploring Collatz Dynamics with Human-LLM Collaboration

arXiv:2603.11066v3 Announce Type: replace-cross Abstract: We develop a structural and quantitative framework for analyzing the Collatz map through modular dynam

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Red-Teaming Vision-Language-Action Models via Quality Diversity Prompt Generation for Robust Robot Policies

arXiv:2603.12510v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models have significant potential to enable general-purpose robotic syste

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents

arXiv:2603.12564v3 Announce Type: replace-cross Abstract: Tool-augmented LLM agents increasingly serve as multi-turn advisors in high-stakes domains, yet their

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Geometry-Guided Camera Motion Understanding in VideoLLMs

arXiv:2603.13119v2 Announce Type: replace-cross Abstract: Camera motion is a fundamental geometric signal that shapes visual perception and cinematic style, yet

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 6d ago

Pixel-level Scene Understanding in One Token: Visual States Need What-is-Where Composition

arXiv:2603.13904v2 Announce Type: replace-cross Abstract: For robotic agents operating in dynamic environments, learning visual state representations from strea

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

FedPBS: Proximal-Balanced Scaling Federated Learning Model for Robust Personalized Training for Non-IID Data

arXiv:2603.13909v2 Announce Type: replace-cross Abstract: Federated learning (FL) enables a set of distributed clients to jointly train machine learning models

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning

arXiv:2603.14867v2 Announce Type: replace-cross Abstract: Many strategic decision-making problems, such as environment design for warehouse robots, can be natur

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

arXiv:2603.15970v3 Announce Type: replace-cross Abstract: Several data warehouse and database providers have recently introduced extensions to SQL called AI Que

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval

arXiv:2603.17872v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have achieved unprecedented fluency but remain susceptible to "hallucinat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Evolutionarily Stable Stackelberg Equilibrium

arXiv:2603.18385v2 Announce Type: replace-cross Abstract: We present a new solution concept called evolutionarily stable Stackelberg equilibrium (SESS). We stud

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Ontology-Guided Diffusion for Zero-Shot Visual Sim2Real Transfer

arXiv:2603.18719v2 Announce Type: replace-cross Abstract: Bridging the simulation-to-reality (sim2real) gap remains challenging as labelled real-world data is s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models

arXiv:2603.20957v2 Announce Type: replace-cross Abstract: Frontier LLM companies have repeatedly assured courts and regulators that their models do not store co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection

arXiv:2603.21576v2 Announce Type: replace-cross Abstract: Long-context LLM inference is bottlenecked not by compute but by the O(n) memory bandwidth cost of sca

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Extending Precipitation Nowcasting Horizons via Spectral Fusion of Radar Observations and Foundation Model Priors

arXiv:2603.21768v2 Announce Type: replace-cross Abstract: Precipitation nowcasting is critical for disaster mitigation and aviation safety. However, radar-only

📰 ArXiv cs.AI