AI News — Latest Developments & Breakthroughs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Mechanistically Interpreting Compression in Vision-Language Models

arXiv:2603.25035v1 Announce Type: new Abstract: Compressed vision-language models (VLMs) are widely used to reduce memory and compute costs, making them a suita

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago

MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting

arXiv:2603.25046v1 Announce Type: new Abstract: Precipitation forecasting remains a persistent challenge in tropical regions like Vietnam, where complex topogra

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Sparse Visual Thought Circuits in Vision-Language Models

arXiv:2603.25075v1 Announce Type: new Abstract: Sparse autoencoders (SAEs) improve interpretability in multimodal models, but it remains unclear whether SAE fea

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents

arXiv:2603.25097v1 Announce Type: new Abstract: Large Language Model based agents increasingly operate in high stakes, multi turn settings where factual groundi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

When Sensing Varies with Contexts: Context-as-Transform for Tactile Few-Shot Class-Incremental Learning

arXiv:2603.25115v1 Announce Type: new Abstract: Few-Shot Class-Incremental Learning (FSCIL) can be particularly susceptible to acquisition contexts with only a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

arXiv:2603.25133v1 Announce Type: new Abstract: Rubric-based evaluation has become a prevailing paradigm for evaluating instruction following in large language

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

UniAI-GraphRAG: Synergizing Ontology-Guided Extraction, Multi-Dimensional Clustering, and Dual-Channel Fusion for Robust Multi-Hop Reasoning

arXiv:2603.25152v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) systems face significant challenges in complex reasoning, multi-hop queries

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

arXiv:2603.25158v1 Announce Type: new Abstract: Equipping Large Language Model (LLM) agents with domain-specific skills is critical for tackling complex tasks.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

The Competence Shadow: Theory and Bounds of AI Assistance in Safety Engineering

arXiv:2603.25197v1 Announce Type: new Abstract: As AI assistants become integrated into safety engineering workflows for Physical AI systems, a critical questio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation

arXiv:2603.25266v1 Announce Type: new Abstract: Probabilistic abstract interpretation is a theory used to extract particular properties of a computer program wh

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago

Distribution and Clusters Approximations as Abstract Domains in Probabilistic Abstract Interpretation to Neural Network Analysis

arXiv:2603.25273v1 Announce Type: new Abstract: The probabilistic abstract interpretation framework of neural network analysis analyzes a neural network by anal

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago

A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion

arXiv:2603.25283v1 Announce Type: new Abstract: Gait is increasingly recognized as a vital sign, yet current approaches treat it as a symptom of specific pathol

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

SliderQuant: Accurate Post-Training Quantization for LLMs

arXiv:2603.25284v1 Announce Type: new Abstract: In this paper, we address post-training quantization (PTQ) for large language models (LLMs) from an overlooked p

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago

DAGverse: Building Document-Grounded Semantic DAGs from Scientific Papers

arXiv:2603.25293v1 Announce Type: new Abstract: Directed Acyclic Graphs (DAGs) are widely used to represent structured knowledge in scientific and technical dom

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Evaluating Language Models for Harmful Manipulation

arXiv:2603.25326v1 Announce Type: new Abstract: Interest in the concept of AI-driven harmful manipulation is growing, yet current approaches to evaluating it ar

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automated and Human-Driven Vehicles

arXiv:2603.25328v1 Announce Type: new Abstract: Automated Vehicle (AV) control in mixed traffic, where AVs coexist with human-driven vehicles, poses significant

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Agentic Trust Coordination for Federated Learning through Adaptive Thresholding and Autonomous Decision Making in Sustainable and Resilient Industrial Networks

arXiv:2603.25334v1 Announce Type: new Abstract: Distributed intelligence in industrial networks increasingly integrates sensing, communication, and computation

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

4OPS: Structural Difficulty Modeling in Integer Arithmetic Puzzles

arXiv:2603.25356v1 Announce Type: new Abstract: Arithmetic puzzle games provide a controlled setting for studying difficulty in mathematical reasoning tasks, a

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago

Does Structured Intent Representation Generalize? A Cross-Language, Cross-Model Empirical Study of 5W3H Prompting

arXiv:2603.25379v1 Announce Type: new Abstract: Does structured intent representation generalize across languages and models? We study PPS (Prompt Protocol Spec

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models

arXiv:2603.25412v1 Announce Type: new Abstract: Large language models (LLMs) increasingly rely on explicit chain-of-thought (CoT) reasoning to solve complex tas

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation

arXiv:2603.25415v1 Announce Type: new Abstract: Semantic world models enable embodied agents to reason about objects, relations, and spatial context beyond pure

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Cross-Model Disagreement as a Label-Free Correctness Signal

arXiv:2603.25450v1 Announce Type: new Abstract: Detecting when a language model is wrong without ground truth labels is a fundamental challenge for safe deploym

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Retraining as Approximate Bayesian Inference

arXiv:2603.25480v1 Announce Type: new Abstract: Model retraining is usually treated as an ongoing maintenance task. But as Harrison Katz now argues, retraining

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents

arXiv:2603.25498v1 Announce Type: new Abstract: As the Web transitions from static retrieval to generative interaction, the escalating environmental footprint o

📰 ArXiv cs.AI