📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 4,216 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (10830) ArXiv cs.AI Dev.to · FORUM WEB Dev.to AI Forbes Innovation OpenAI News Hugging Face Blog

Efficient Personalization of Generative User Interfaces

arXiv:2604.09876v1 Announce Type: cross Abstract: Generative user interfaces (UIs) create new opportunities to adapt interfaces to individual users on demand, b

ArXiv cs.AI 📄 Paper 3h ago

DINO_4D: Semantic-Aware 4D Reconstruction

arXiv:2604.09877v1 Announce Type: cross Abstract: In the intersection of computer vision and robotic perception, 4D reconstruction of dynamic scenes serve as th

ArXiv cs.AI 📄 Paper 3h ago

Not Your Stereo-Typical Estimator: Combining Vision and Language for Volume Perception

arXiv:2604.09886v1 Announce Type: cross Abstract: Accurate volume estimation of objects from visual data is a long-standing challenge in computer vision with si

ArXiv cs.AI 📄 Paper 3h ago

Should We be Pedantic About Reasoning Errors in Machine Translation?

arXiv:2604.09890v1 Announce Type: cross Abstract: Across multiple language pairings (English $\to$ \{Spanish, French, German, Mandarin, Japanese, Urdu, Cantones

ArXiv cs.AI 📄 Paper 3h ago

Diffusion Denoiser Achievable Analysis for Finite Blocklength Unsourced Random Access

arXiv:2604.09904v1 Announce Type: cross Abstract: Polyanskiy proposed a framework for the unsourced multiple access channel (MAC) problem where users employ a c

ArXiv cs.AI 📄 Paper 3h ago

From UAV Imagery to Agronomic Reasoning: A Multimodal LLM Benchmark for Plant Phenotyping

arXiv:2604.09907v1 Announce Type: cross Abstract: To improve crop genetics, high-throughput, effective and comprehensive phenotyping is a critical prerequisite.

ArXiv cs.AI 📄 Paper 3h ago

The Rise and Fall of $G$ in AGI

arXiv:2604.09911v1 Announce Type: cross Abstract: In the psychological literature the term `general intelligence' describes correlations between abilities and n

ArXiv cs.AI 📄 Paper 3h ago

A Hybrid Intelligent Framework for Uncertainty-Aware Condition Monitoring of Industrial Systems

arXiv:2604.09932v1 Announce Type: cross Abstract: Hybrid approaches that combine data-driven learning with physics-based insight have shown promise for improvin

ArXiv cs.AI 📄 Paper 3h ago

I Walk the Line: Examining the Role of Gestalt Continuity in Object Binding for Vision Transformers

arXiv:2604.09942v1 Announce Type: cross Abstract: Object binding is a foundational process in visual cognition, during which low-level perceptual features are j

ArXiv cs.AI 📄 Paper 3h ago

Cross-Cultural Value Awareness in Large Vision-Language Models

arXiv:2604.09945v1 Announce Type: cross Abstract: The rapid adoption of large vision-language models (LVLMs) in recent years has been accompanied by growing fai

ArXiv cs.AI 📄 Paper 3h ago

Rebooting Microreboot: Architectural Support for Safe, Parallel Recovery in Microservice Systems

arXiv:2604.09963v1 Announce Type: cross Abstract: Microreboot enables fast recovery by restarting only the failing component, but in modern microservices naive

ArXiv cs.AI 📄 Paper 3h ago

Muon$^2$: Boosting Muon via Adaptive Second-Moment Preconditioning

arXiv:2604.09967v1 Announce Type: cross Abstract: Muon has emerged as a promising optimizer for large-scale foundation model pre-training by exploiting the matr

ArXiv cs.AI 📄 Paper 3h ago

A Minimal Model of Representation Collapse: Frustration, Stop-Gradient, and Dynamics

arXiv:2604.09979v1 Announce Type: cross Abstract: Self-supervised representation learning is central to modern machine learning because it extracts structured l

ArXiv cs.AI 📄 Paper 3h ago

FlowPalm: Optical Flow Driven Non-Rigid Deformation for Geometrically Diverse Palmprint Generation

arXiv:2604.09989v1 Announce Type: cross Abstract: Recently, synthetic palmprints have been increasingly used as substitutes for real data to train recognition m

ArXiv cs.AI 📄 Paper 3h ago

Agentic Application in Power Grid Static Analysis: Automatic Code Generation and Error Correction

arXiv:2604.09995v1 Announce Type: cross Abstract: This paper introduces an LLM agent that automates power grid static analysis by converting natural language in

ArXiv cs.AI 📄 Paper 3h ago

Like a Hammer, It Can Build, It Can Break: Large Language Model Uses, Perceptions, and Adoption in Cybersecurity Operations on Reddit

arXiv:2604.09998v1 Announce Type: cross Abstract: Large language models (LLMs) have recently emerged as promising tools for augmenting Security Operations Cente

ArXiv cs.AI 📄 Paper 3h ago

Demographic and Linguistic Bias Evaluation in Omnimodal Language Models

arXiv:2604.10014v1 Announce Type: cross Abstract: This paper provides a comprehensive evaluation of demographic and linguistic biases in omnimodal language mode

ArXiv cs.AI 📄 Paper 3h ago

FREE-Switch: Frequency-based Dynamic LoRA Switch for Style Transfer

arXiv:2604.10023v1 Announce Type: cross Abstract: With the growing availability of open-sourced adapters trained on the same diffusion backbone for diverse scen

ArXiv cs.AI 📄 Paper 3h ago

LVSum: A Benchmark for Timestamp-Aware Long Video Summarization

arXiv:2604.10024v1 Announce Type: cross Abstract: Long video summarization presents significant challenges for current multimodal large language models (MLLMs),

ArXiv cs.AI 📄 Paper 3h ago

CoSToM:Causal-oriented Steering for Intrinsic Theory-of-Mind Alignment in Large Language Models

arXiv:2604.10031v1 Announce Type: cross Abstract: Theory of Mind (ToM), the ability to attribute mental states to others, is a hallmark of social intelligence.

ArXiv cs.AI 📄 Paper 3h ago

Closed-Form Concept Erasure via Double Projections

arXiv:2604.10032v1 Announce Type: cross Abstract: While modern generative models such as diffusion-based architectures have enabled impressive creative capabili

ArXiv cs.AI 📄 Paper 3h ago

Computational Implementation of a Model of Category-Theoretic Metaphor Comprehension

arXiv:2604.10035v1 Announce Type: cross Abstract: In this study, we developed a computational implementation for a model of metaphor comprehension based on the

ArXiv cs.AI 📄 Paper 3h ago

ASPIRin: Action Space Projection for Interactivity-Optimized Reinforcement Learning in Full-Duplex Speech Language Models

arXiv:2604.10065v1 Announce Type: cross Abstract: End-to-end full-duplex Speech Language Models (SLMs) require precise turn-taking for natural interaction. Howe

ArXiv cs.AI 📄 Paper 3h ago

Graph-RHO: Critical-path-aware Heterogeneous Graph Network for Long-Horizon Flexible Job-Shop Scheduling

arXiv:2604.10073v1 Announce Type: cross Abstract: Long-horizon Flexible Job-Shop Scheduling~(FJSP) presents a formidable combinatorial challenge due to complex,