AI News — Latest Developments & Breakthroughs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model

arXiv:2603.25184v1 Announce Type: cross Abstract: Reinforcement learning (RL) has become essential for post-training large language models (LLMs) in reasoning t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Probing the Lack of Stable Internal Beliefs in LLMs

arXiv:2603.25187v1 Announce Type: cross Abstract: Persona-driven large language models (LLMs) require consistent behavioral tendencies across interactions to si

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

A Decade-Scale Benchmark Evaluating LLMs' Clinical Practice Guidelines Detection and Adherence in Multi-turn Conversations

arXiv:2603.25196v1 Announce Type: cross Abstract: Clinical practice guidelines (CPGs) play a pivotal role in ensuring evidence-based decision-making and improvi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction

arXiv:2603.25209v1 Announce Type: cross Abstract: Generating long videos using pre-trained video diffusion models, which are typically trained on short clips, p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

A Wireless World Model for AI-Native 6G Networks

arXiv:2603.25216v1 Announce Type: cross Abstract: Integrating AI into the physical layer is a cornerstone of 6G networks. However, current data-driven approache

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing

arXiv:2603.25226v1 Announce Type: cross Abstract: The emergence of Large Language Models (LLMs) has catalyzed a paradigm shift in programming, giving rise to "v

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA

arXiv:2603.25243v1 Announce Type: cross Abstract: Large language models and autonomous agents are increasingly explored for EDA automation, but many existing in

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

FEAST: Fully Connected Expressive Attention for Spatial Transcriptomics

arXiv:2603.25247v1 Announce Type: cross Abstract: Spatial Transcriptomics (ST) provides spatially-resolved gene expression, offering crucial insights into tissu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Activation Matters: Test-time Activated Negative Labels for OOD Detection with Vision-Language Models

arXiv:2603.25250v1 Announce Type: cross Abstract: Out-of-distribution (OOD) detection aims to identify samples that deviate from in-distribution (ID). One popul

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

Does Explanation Correctness Matter? Linking Computational XAI Evaluation to Human Understanding

arXiv:2603.25251v1 Announce Type: cross Abstract: Explainable AI (XAI) methods are commonly evaluated with functional metrics such as correctness, which computa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation

arXiv:2603.25253v1 Announce Type: cross Abstract: Large language models (LLMs) hold considerable potential for advancing scientific discovery, yet systematic as

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

CRAFT: Grounded Multi-Agent Coordination Under Partial Information

arXiv:2603.25268v1 Announce Type: cross Abstract: We introduce CRAFT, a multi-agent benchmark for evaluating pragmatic communication in large language models un

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

CSI-tuples-based 3D Channel Fingerprints Construction Assisted by MultiModal Learning

arXiv:2603.25288v1 Announce Type: cross Abstract: Low-altitude communications can promote the integration of aerial and terrestrial wireless resources, expand n

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Revealing the influence of participant failures on model quality in cross-silo Federated Learning

arXiv:2603.25289v1 Announce Type: cross Abstract: Federated Learning (FL) is a paradigm for training machine learning (ML) models in collaborative settings whil

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study

arXiv:2603.25322v1 Announce Type: cross Abstract: Alzheimer's disease (AD) is a growing global health challenge as populations age, and timely, accurate diagnos

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

arXiv:2603.25325v1 Announce Type: cross Abstract: Weight pruning is a standard technique for compressing large language models, yet its effect on learned intern

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

Adaptive Chunking: Optimizing Chunking-Method Selection for RAG

arXiv:2603.25333v1 Announce Type: cross Abstract: The effectiveness of Retrieval-Augmented Generation (RAG) is highly dependent on how documents are chunked, th

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 4d ago

Image Rotation Angle Estimation: Comparing Circular-Aware Methods

arXiv:2603.25351v1 Announce Type: cross Abstract: Automatic image rotation estimation is a key preprocessing step in many vision pipelines. This task is challen

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics

arXiv:2603.25366v1 Announce Type: cross Abstract: Autonomous object search is challenging for mobile robots operating in indoor environments due to partial obse

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

GlowQ: Group-Shared LOw-Rank Approximation for Quantized LLMs

arXiv:2603.25385v1 Announce Type: cross Abstract: Quantization techniques such as BitsAndBytes, AWQ, and GPTQ are widely used as a standard method in deploying

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

A Causal Framework for Evaluating ICU Discharge Strategies

arXiv:2603.25397v1 Announce Type: cross Abstract: In this applied paper, we address the difficult open problem of when to discharge patients from the Intensive

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Shape and Substance: Dual-Layer Side-Channel Attacks on Local Vision-Language Models

arXiv:2603.25403v1 Announce Type: cross Abstract: On-device Vision-Language Models (VLMs) promise data privacy via local execution. However, we show that the ar

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

System Design for Maintaining Internal State Consistency in Long-Horizon Robotic Tabletop Games

arXiv:2603.25405v1 Announce Type: cross Abstract: Long-horizon tabletop games pose a distinct systems challenge for robotics: small perceptual or execution erro

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Decidable By Construction: Design-Time Verification for Trustworthy AI

arXiv:2603.25414v1 Announce Type: cross Abstract: A prevailing assumption in machine learning is that model correctness must be enforced after the fact. We obse

📰 ArXiv cs.AI