2,281 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 2,281 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (5221) ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIHackernoon
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
When AI Shows Its Work, Is It Actually Working? Step-Level Evaluation Reveals Frontier Language Models Frequently Bypass Their Own Reasoning
arXiv:2603.22816v1 Announce Type: cross Abstract: Language models increasingly "show their work" by writing step-by-step reasoning before answering. But are the
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment
arXiv:2603.22819v1 Announce Type: cross Abstract: Tables are pervasive in diverse documents, making table recognition (TR) a fundamental task in document analys
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection
arXiv:2603.22840v1 Announce Type: cross Abstract: Unsupervised anomaly detection plays a pivotal role in industrial defect inspection and medical image analysis
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago
UAV-DETR: DETR for Anti-Drone Target Detection
arXiv:2603.22841v1 Announce Type: cross Abstract: Drone detection is pivotal in numerous security and counter-UAV applications. However, existing deep learning-
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago
UniQueR: Unified Query-based Feedforward 3D Reconstruction
arXiv:2603.22851v1 Announce Type: cross Abstract: We present UniQueR, a unified query-based feedforward framework for efficient and accurate 3D reconstruction f
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Agent Audit: A Security Analysis System for LLM Agent Applications
arXiv:2603.22853v1 Announce Type: cross Abstract: What should a developer inspect before deploying an LLM agent: the model, the tool code, the deployment config
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
Avoiding Over-smoothing in Social Media Rumor Detection with Pre-trained Propagation Tree Transformer
arXiv:2603.22854v1 Announce Type: cross Abstract: Deep learning techniques for rumor detection typically utilize Graph Neural Networks (GNNs) to analyze post re
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
The Coordinate System Problem in Persistent Structural Memory for Neural Architectures
arXiv:2603.22858v1 Announce Type: cross Abstract: We introduce the Dual-View Pheromone Pathway Network (DPPN), an architecture that routes sparse attention thro
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Agent-Sentry: Bounding LLM Agents via Execution Provenance
arXiv:2603.22868v1 Announce Type: cross Abstract: Agentic computing systems, which autonomously spawn new functionalities based on natural language instructions
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models
arXiv:2603.22876v1 Announce Type: cross Abstract: Learning a generalist control policy for dexterous manipulation typically relies on large-scale datasets. Give
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
Confidence Calibration under Ambiguous Ground Truth
arXiv:2603.22879v1 Announce Type: cross Abstract: Confidence calibration assumes a unique ground-truth label per input, yet this assumption fails wherever annot
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
Off-Policy Evaluation and Learning for Survival Outcomes under Censoring
arXiv:2603.22900v1 Announce Type: cross Abstract: Optimizing survival outcomes, such as patient survival or customer retention, is a critical objective in data-
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling
arXiv:2603.22911v1 Announce Type: cross Abstract: Due to the great saving of computation and memory overhead, token compression has become a research hot-spot f
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
From the AI Act to a European AI Agency: Completing the Union's Regulatory Architecture
arXiv:2603.22912v1 Announce Type: cross Abstract: As artificial intelligence (AI) technologies continue to advance, effective risk assessment, regulation, and o
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
EVA: Efficient Reinforcement Learning for End-to-End Video Agent
arXiv:2603.22918v1 Announce Type: cross Abstract: Video understanding with multimodal large language models (MLLMs) remains challenging due to the long token se
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
The EU AI Act and the Rights-based Approach to Technological Governance
arXiv:2603.22920v1 Announce Type: cross Abstract: The EU AI Act constitutes an important development in shaping the Union's digital regulatory architecture. The
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Set-Valued Prediction for Large Language Models with Feasibility-Aware Coverage Guarantees
arXiv:2603.22966v1 Announce Type: cross Abstract: Large language models (LLMs) inherently operate over a large generation space, yet conventional usage typicall
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
DariMis: Harm-Aware Modeling for Dari Misinformation Detection on YouTube
arXiv:2603.22977v1 Announce Type: cross Abstract: Dari, the primary language of Afghanistan, is spoken by tens of millions of people yet remains largely absent
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Can Graph Foundation Models Generalize Over Architecture?
arXiv:2603.22984v1 Announce Type: cross Abstract: Graph foundation models (GFMs) have recently attracted interest due to the promise of graph neural network (GN
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents
arXiv:2603.23007v1 Announce Type: cross Abstract: The rapid adoption of mobile graphical user interface (GUI) agents, which autonomously control applications an
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
A Sobering Look at Tabular Data Generation via Probabilistic Circuits
arXiv:2603.23016v1 Announce Type: cross Abstract: Tabular data is more challenging to generate than text and images, due to its heterogeneous features and much
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
Concept-based explanations of Segmentation and Detection models in Natural Disaster Management
arXiv:2603.23020v1 Announce Type: cross Abstract: Deep learning models for flood and wildfire segmentation and object detection enable precise, real-time disast
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation
arXiv:2603.23030v1 Announce Type: cross Abstract: A sliding-window inference strategy is commonly adopted in recent training-free open-vocabulary semantic segme
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago
YOLOv10 with Kolmogorov-Arnold networks and vision-language foundation models for interpretable object detection and trustworthy multimodal AI in computer vision perception
arXiv:2603.23037v1 Announce Type: cross Abstract: The interpretable object detection capabilities of a novel Kolmogorov-Arnold network framework are examined he