📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 4,216 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (10830) ArXiv cs.AI Dev.to · FORUM WEB Dev.to AI Forbes Innovation OpenAI News Hugging Face Blog

MatRes: Zero-Shot Test-Time Model Adaptation for Simultaneous Matching and Restoration

arXiv:2604.10081v1 Announce Type: cross Abstract: Real-world image pairs often exhibit both severe degradations and large viewpoint changes, making image restor

ArXiv cs.AI 📄 Paper 3h ago

Degradation-Consistent Paired Training for Robust AI-Generated Image Detection

arXiv:2604.10102v1 Announce Type: cross Abstract: AI-generated image detectors suffer significant performance degradation under real-world image corruptions suc

ArXiv cs.AI 📄 Paper 3h ago

CircuitSynth: Reliable Synthetic Data Generation

arXiv:2604.10114v1 Announce Type: cross Abstract: The generation of high-fidelity synthetic data is a cornerstone of modern machine learning, yet Large Language

ArXiv cs.AI 📄 Paper 3h ago

A Dual Cross-Attention Graph Learning Framework For Multimodal MRI-Based Major Depressive Disorder Detection

arXiv:2604.10116v1 Announce Type: cross Abstract: Major depressive disorder (MDD) is a prevalent mental disorder associated with complex neurobiological changes

ArXiv cs.AI 📄 Paper 3h ago

MR-Coupler: Automated Metamorphic Test Generation via Functional Coupling Analysis

arXiv:2604.10126v1 Announce Type: cross Abstract: Metamorphic testing (MT) is a widely recognized technique for alleviating the oracle problem in software testi

ArXiv cs.AI 📄 Paper 3h ago

VGA-Bench: A Unified Benchmark and Multi-Model Framework for Video Aesthetics and Generation Quality Evaluation

arXiv:2604.10127v1 Announce Type: cross Abstract: The rapid advancement of AIGC-based video generation has underscored the critical need for comprehensive evalu

ArXiv cs.AI 📄 Paper 3h ago

Semantic Manipulation Localization

arXiv:2604.10132v1 Announce Type: cross Abstract: Image Manipulation Localization (IML) aims to identify edited regions in an image. However, with the increasin

ArXiv cs.AI 📄 Paper 3h ago

Think in Sentences: Explicit Sentence Boundaries Enhance Language Model's Capabilities

arXiv:2604.10135v1 Announce Type: cross Abstract: Researchers have explored different ways to improve large language models (LLMs)' capabilities via dummy token

ArXiv cs.AI 📄 Paper 3h ago

MOSAIC: Multi-Domain Orthogonal Session Adaptive Intent Capture for Prescient Recommendations

arXiv:2604.10147v1 Announce Type: cross Abstract: Capturing user intent across heterogeneous behavioral domains stands as a fundamental challenge in session-bas

ArXiv cs.AI 📄 Paper 3h ago

A Temporally Augmented Graph Attention Network for Affordance Classification

arXiv:2604.10149v1 Announce Type: cross Abstract: Graph attention networks (GATs) provide one of the best frameworks for learning node representations in relati

ArXiv cs.AI 📄 Paper 3h ago

Virtual Smart Metering in District Heating Networks via Heterogeneous Spatial-Temporal Graph Neural Networks

arXiv:2604.10166v1 Announce Type: cross Abstract: Intelligent operation of thermal energy networks aims to improve energy efficiency, reliability, and operation

ArXiv cs.AI 📄 Paper 3h ago

Wolkowicz-Styan Upper Bound on the Hessian Eigenspectrum for Cross-Entropy Loss in Nonlinear Smooth Neural Networks

arXiv:2604.10202v1 Announce Type: cross Abstract: Neural networks (NNs) are central to modern machine learning and achieve state-of-the-art results in many appl

ArXiv cs.AI 📄 Paper 3h ago

Exploring the impact of fairness-aware criteria in AutoML

arXiv:2604.10224v1 Announce Type: cross Abstract: Machine Learning (ML) systems are increasingly used to support decision-making processes that affect individua

ArXiv cs.AI 📄 Paper 3h ago

Adapting 2D Multi-Modal Large Language Model for 3D CT Image Analysis

arXiv:2604.10233v1 Announce Type: cross Abstract: 3D medical image analysis is of great importance in disease diagnosis and treatment. Recently, multimodal larg

ArXiv cs.AI 📄 Paper 3h ago

FashionMV: Product-Level Composed Image Retrieval with Multi-View Fashion Data

arXiv:2604.10297v1 Announce Type: cross Abstract: Composed Image Retrieval (CIR) retrieves target images using a reference image paired with modification text.

ArXiv cs.AI 📄 Paper 3h ago

From Helpful to Trustworthy: LLM Agents for Pair Programming

arXiv:2604.10300v1 Announce Type: cross Abstract: LLM-based coding agents are increasingly used to generate code, tests, and documentation. Still, their outputs

ArXiv cs.AI 📄 Paper 3h ago

Class-Adaptive Cooperative Perception for Multi-Class LiDAR-based 3D Object Detection in V2X Systems

arXiv:2604.10305v1 Announce Type: cross Abstract: Cooperative perception allows connected vehicles and roadside infrastructure to share sensor observations, cre

ArXiv cs.AI 📄 Paper 3h ago

Jailbreaking the Matrix: Nullspace Steering for Controlled Model Subversion

arXiv:2604.10326v1 Announce Type: cross Abstract: Large language models remain vulnerable to jailbreak attacks -- inputs designed to bypass safety mechanisms an

ArXiv cs.AI 📄 Paper 3h ago

A Diffusion-Contrastive Graph Neural Network with Virtual Nodes for Wind Nowcasting in Unobserved Regions

arXiv:2604.10328v1 Announce Type: cross Abstract: Accurate weather nowcasting remains one of the central challenges in atmospheric science, with critical implic

ArXiv cs.AI 📄 Paper 3h ago

Multinex: Lightweight Low-light Image Enhancement via Multi-prior Retinex

arXiv:2604.10359v1 Announce Type: cross Abstract: Low-light image enhancement (LLIE) aims to restore natural visibility, color fidelity, and structural detail u

ArXiv cs.AI 📄 Paper 3h ago

FishRoPE: Projective Rotary Position Embeddings for Omnidirectional Visual Perception

arXiv:2604.10391v1 Announce Type: cross Abstract: Vision foundation models (VFMs) and Bird's Eye View (BEV) representation have advanced visual perception subst

ArXiv cs.AI 📄 Paper 3h ago

Intent-aligned Formal Specification Synthesis via Traceable Refinement

arXiv:2604.10392v1 Announce Type: cross Abstract: Large language models are increasingly used to generate code from natural language, but ensuring correctness r

ArXiv cs.AI 📄 Paper 3h ago

Rethinking Video Human-Object Interaction: Set Prediction over Time for Unified Detection and Anticipation

arXiv:2604.10397v1 Announce Type: cross Abstract: Video-based human-object interaction (HOI) understanding requires both detecting ongoing interactions and anti

ArXiv cs.AI 📄 Paper 3h ago

IMPACT: A Dataset for Multi-Granularity Human Procedural Action Understanding in Industrial Assembly

arXiv:2604.10409v1 Announce Type: cross Abstract: We introduce IMPACT, a synchronized five-view RGB-D dataset for deployment-oriented industrial procedural unde