📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 4,216 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (10830)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI
📄 Paper
3h ago
MatRes: Zero-Shot Test-Time Model Adaptation for Simultaneous Matching and Restoration
arXiv:2604.10081v1 Announce Type: cross Abstract: Real-world image pairs often exhibit both severe degradations and large viewpoint changes, making image restor
ArXiv cs.AI
📄 Paper
3h ago
Degradation-Consistent Paired Training for Robust AI-Generated Image Detection
arXiv:2604.10102v1 Announce Type: cross Abstract: AI-generated image detectors suffer significant performance degradation under real-world image corruptions suc
ArXiv cs.AI
📄 Paper
3h ago
CircuitSynth: Reliable Synthetic Data Generation
arXiv:2604.10114v1 Announce Type: cross Abstract: The generation of high-fidelity synthetic data is a cornerstone of modern machine learning, yet Large Language
ArXiv cs.AI
📄 Paper
3h ago
A Dual Cross-Attention Graph Learning Framework For Multimodal MRI-Based Major Depressive Disorder Detection
arXiv:2604.10116v1 Announce Type: cross Abstract: Major depressive disorder (MDD) is a prevalent mental disorder associated with complex neurobiological changes
ArXiv cs.AI
📄 Paper
3h ago
MR-Coupler: Automated Metamorphic Test Generation via Functional Coupling Analysis
arXiv:2604.10126v1 Announce Type: cross Abstract: Metamorphic testing (MT) is a widely recognized technique for alleviating the oracle problem in software testi
ArXiv cs.AI
📄 Paper
3h ago
VGA-Bench: A Unified Benchmark and Multi-Model Framework for Video Aesthetics and Generation Quality Evaluation
arXiv:2604.10127v1 Announce Type: cross Abstract: The rapid advancement of AIGC-based video generation has underscored the critical need for comprehensive evalu
ArXiv cs.AI
📄 Paper
3h ago
Semantic Manipulation Localization
arXiv:2604.10132v1 Announce Type: cross Abstract: Image Manipulation Localization (IML) aims to identify edited regions in an image. However, with the increasin
ArXiv cs.AI
📄 Paper
3h ago
Think in Sentences: Explicit Sentence Boundaries Enhance Language Model's Capabilities
arXiv:2604.10135v1 Announce Type: cross Abstract: Researchers have explored different ways to improve large language models (LLMs)' capabilities via dummy token
ArXiv cs.AI
📄 Paper
3h ago
MOSAIC: Multi-Domain Orthogonal Session Adaptive Intent Capture for Prescient Recommendations
arXiv:2604.10147v1 Announce Type: cross Abstract: Capturing user intent across heterogeneous behavioral domains stands as a fundamental challenge in session-bas
ArXiv cs.AI
📄 Paper
3h ago
A Temporally Augmented Graph Attention Network for Affordance Classification
arXiv:2604.10149v1 Announce Type: cross Abstract: Graph attention networks (GATs) provide one of the best frameworks for learning node representations in relati
ArXiv cs.AI
📄 Paper
3h ago
Virtual Smart Metering in District Heating Networks via Heterogeneous Spatial-Temporal Graph Neural Networks
arXiv:2604.10166v1 Announce Type: cross Abstract: Intelligent operation of thermal energy networks aims to improve energy efficiency, reliability, and operation
ArXiv cs.AI
📄 Paper
3h ago
Wolkowicz-Styan Upper Bound on the Hessian Eigenspectrum for Cross-Entropy Loss in Nonlinear Smooth Neural Networks
arXiv:2604.10202v1 Announce Type: cross Abstract: Neural networks (NNs) are central to modern machine learning and achieve state-of-the-art results in many appl
ArXiv cs.AI
📄 Paper
3h ago
Exploring the impact of fairness-aware criteria in AutoML
arXiv:2604.10224v1 Announce Type: cross Abstract: Machine Learning (ML) systems are increasingly used to support decision-making processes that affect individua
ArXiv cs.AI
📄 Paper
3h ago
Adapting 2D Multi-Modal Large Language Model for 3D CT Image Analysis
arXiv:2604.10233v1 Announce Type: cross Abstract: 3D medical image analysis is of great importance in disease diagnosis and treatment. Recently, multimodal larg
ArXiv cs.AI
📄 Paper
3h ago
FashionMV: Product-Level Composed Image Retrieval with Multi-View Fashion Data
arXiv:2604.10297v1 Announce Type: cross Abstract: Composed Image Retrieval (CIR) retrieves target images using a reference image paired with modification text.
ArXiv cs.AI
📄 Paper
3h ago
From Helpful to Trustworthy: LLM Agents for Pair Programming
arXiv:2604.10300v1 Announce Type: cross Abstract: LLM-based coding agents are increasingly used to generate code, tests, and documentation. Still, their outputs
ArXiv cs.AI
📄 Paper
3h ago
Class-Adaptive Cooperative Perception for Multi-Class LiDAR-based 3D Object Detection in V2X Systems
arXiv:2604.10305v1 Announce Type: cross Abstract: Cooperative perception allows connected vehicles and roadside infrastructure to share sensor observations, cre
ArXiv cs.AI
📄 Paper
3h ago
Jailbreaking the Matrix: Nullspace Steering for Controlled Model Subversion
arXiv:2604.10326v1 Announce Type: cross Abstract: Large language models remain vulnerable to jailbreak attacks -- inputs designed to bypass safety mechanisms an
ArXiv cs.AI
📄 Paper
3h ago
A Diffusion-Contrastive Graph Neural Network with Virtual Nodes for Wind Nowcasting in Unobserved Regions
arXiv:2604.10328v1 Announce Type: cross Abstract: Accurate weather nowcasting remains one of the central challenges in atmospheric science, with critical implic
ArXiv cs.AI
📄 Paper
3h ago
Multinex: Lightweight Low-light Image Enhancement via Multi-prior Retinex
arXiv:2604.10359v1 Announce Type: cross Abstract: Low-light image enhancement (LLIE) aims to restore natural visibility, color fidelity, and structural detail u
ArXiv cs.AI
📄 Paper
3h ago
FishRoPE: Projective Rotary Position Embeddings for Omnidirectional Visual Perception
arXiv:2604.10391v1 Announce Type: cross Abstract: Vision foundation models (VFMs) and Bird's Eye View (BEV) representation have advanced visual perception subst
ArXiv cs.AI
📄 Paper
3h ago
Intent-aligned Formal Specification Synthesis via Traceable Refinement
arXiv:2604.10392v1 Announce Type: cross Abstract: Large language models are increasingly used to generate code from natural language, but ensuring correctness r
ArXiv cs.AI
📄 Paper
3h ago
Rethinking Video Human-Object Interaction: Set Prediction over Time for Unified Detection and Anticipation
arXiv:2604.10397v1 Announce Type: cross Abstract: Video-based human-object interaction (HOI) understanding requires both detecting ongoing interactions and anti
ArXiv cs.AI
📄 Paper
3h ago
IMPACT: A Dataset for Multi-Granularity Human Procedural Action Understanding in Industrial Assembly
arXiv:2604.10409v1 Announce Type: cross Abstract: We introduce IMPACT, a synchronized five-view RGB-D dataset for deployment-oriented industrial procedural unde
DeepCamp AI