AI News — Latest Developments & Breakthroughs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

When AI Shows Its Work, Is It Actually Working? Step-Level Evaluation Reveals Frontier Language Models Frequently Bypass Their Own Reasoning

arXiv:2603.22816v1 Announce Type: cross Abstract: Language models increasingly "show their work" by writing step-by-step reasoning before answering. But are the

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment

arXiv:2603.22819v1 Announce Type: cross Abstract: Tables are pervasive in diverse documents, making table recognition (TR) a fundamental task in document analys

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection

arXiv:2603.22840v1 Announce Type: cross Abstract: Unsupervised anomaly detection plays a pivotal role in industrial defect inspection and medical image analysis

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago

UAV-DETR: DETR for Anti-Drone Target Detection

arXiv:2603.22841v1 Announce Type: cross Abstract: Drone detection is pivotal in numerous security and counter-UAV applications. However, existing deep learning-

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago

UniQueR: Unified Query-based Feedforward 3D Reconstruction

arXiv:2603.22851v1 Announce Type: cross Abstract: We present UniQueR, a unified query-based feedforward framework for efficient and accurate 3D reconstruction f

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Agent Audit: A Security Analysis System for LLM Agent Applications

arXiv:2603.22853v1 Announce Type: cross Abstract: What should a developer inspect before deploying an LLM agent: the model, the tool code, the deployment config

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

Avoiding Over-smoothing in Social Media Rumor Detection with Pre-trained Propagation Tree Transformer

arXiv:2603.22854v1 Announce Type: cross Abstract: Deep learning techniques for rumor detection typically utilize Graph Neural Networks (GNNs) to analyze post re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

The Coordinate System Problem in Persistent Structural Memory for Neural Architectures

arXiv:2603.22858v1 Announce Type: cross Abstract: We introduce the Dual-View Pheromone Pathway Network (DPPN), an architecture that routes sparse attention thro

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Agent-Sentry: Bounding LLM Agents via Execution Provenance

arXiv:2603.22868v1 Announce Type: cross Abstract: Agentic computing systems, which autonomously spawn new functionalities based on natural language instructions

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models

arXiv:2603.22876v1 Announce Type: cross Abstract: Learning a generalist control policy for dexterous manipulation typically relies on large-scale datasets. Give

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

Confidence Calibration under Ambiguous Ground Truth

arXiv:2603.22879v1 Announce Type: cross Abstract: Confidence calibration assumes a unique ground-truth label per input, yet this assumption fails wherever annot

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

Off-Policy Evaluation and Learning for Survival Outcomes under Censoring

arXiv:2603.22900v1 Announce Type: cross Abstract: Optimizing survival outcomes, such as patient survival or customer retention, is a critical objective in data-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling

arXiv:2603.22911v1 Announce Type: cross Abstract: Due to the great saving of computation and memory overhead, token compression has become a research hot-spot f

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

From the AI Act to a European AI Agency: Completing the Union's Regulatory Architecture

arXiv:2603.22912v1 Announce Type: cross Abstract: As artificial intelligence (AI) technologies continue to advance, effective risk assessment, regulation, and o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

arXiv:2603.22918v1 Announce Type: cross Abstract: Video understanding with multimodal large language models (MLLMs) remains challenging due to the long token se

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

The EU AI Act and the Rights-based Approach to Technological Governance

arXiv:2603.22920v1 Announce Type: cross Abstract: The EU AI Act constitutes an important development in shaping the Union's digital regulatory architecture. The

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Set-Valued Prediction for Large Language Models with Feasibility-Aware Coverage Guarantees

arXiv:2603.22966v1 Announce Type: cross Abstract: Large language models (LLMs) inherently operate over a large generation space, yet conventional usage typicall

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

DariMis: Harm-Aware Modeling for Dari Misinformation Detection on YouTube

arXiv:2603.22977v1 Announce Type: cross Abstract: Dari, the primary language of Afghanistan, is spoken by tens of millions of people yet remains largely absent

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Can Graph Foundation Models Generalize Over Architecture?

arXiv:2603.22984v1 Announce Type: cross Abstract: Graph foundation models (GFMs) have recently attracted interest due to the promise of graph neural network (GN

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents

arXiv:2603.23007v1 Announce Type: cross Abstract: The rapid adoption of mobile graphical user interface (GUI) agents, which autonomously control applications an

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

A Sobering Look at Tabular Data Generation via Probabilistic Circuits

arXiv:2603.23016v1 Announce Type: cross Abstract: Tabular data is more challenging to generate than text and images, due to its heterogeneous features and much

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

Concept-based explanations of Segmentation and Detection models in Natural Disaster Management

arXiv:2603.23020v1 Announce Type: cross Abstract: Deep learning models for flood and wildfire segmentation and object detection enable precise, real-time disast

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation

arXiv:2603.23030v1 Announce Type: cross Abstract: A sliding-window inference strategy is commonly adopted in recent training-free open-vocabulary semantic segme

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago

YOLOv10 with Kolmogorov-Arnold networks and vision-language foundation models for interpretable object detection and trustworthy multimodal AI in computer vision perception

arXiv:2603.23037v1 Announce Type: cross Abstract: The interpretable object detection capabilities of a novel Kolmogorov-Arnold network framework are examined he

📰 ArXiv cs.AI