📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 4,216 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (10830)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI
📄 Paper
3h ago
Efficient Personalization of Generative User Interfaces
arXiv:2604.09876v1 Announce Type: cross Abstract: Generative user interfaces (UIs) create new opportunities to adapt interfaces to individual users on demand, b
ArXiv cs.AI
📄 Paper
3h ago
DINO_4D: Semantic-Aware 4D Reconstruction
arXiv:2604.09877v1 Announce Type: cross Abstract: In the intersection of computer vision and robotic perception, 4D reconstruction of dynamic scenes serve as th
ArXiv cs.AI
📄 Paper
3h ago
Not Your Stereo-Typical Estimator: Combining Vision and Language for Volume Perception
arXiv:2604.09886v1 Announce Type: cross Abstract: Accurate volume estimation of objects from visual data is a long-standing challenge in computer vision with si
ArXiv cs.AI
📄 Paper
3h ago
Should We be Pedantic About Reasoning Errors in Machine Translation?
arXiv:2604.09890v1 Announce Type: cross Abstract: Across multiple language pairings (English $\to$ \{Spanish, French, German, Mandarin, Japanese, Urdu, Cantones
ArXiv cs.AI
📄 Paper
3h ago
Diffusion Denoiser Achievable Analysis for Finite Blocklength Unsourced Random Access
arXiv:2604.09904v1 Announce Type: cross Abstract: Polyanskiy proposed a framework for the unsourced multiple access channel (MAC) problem where users employ a c
ArXiv cs.AI
📄 Paper
3h ago
From UAV Imagery to Agronomic Reasoning: A Multimodal LLM Benchmark for Plant Phenotyping
arXiv:2604.09907v1 Announce Type: cross Abstract: To improve crop genetics, high-throughput, effective and comprehensive phenotyping is a critical prerequisite.
ArXiv cs.AI
📄 Paper
3h ago
The Rise and Fall of $G$ in AGI
arXiv:2604.09911v1 Announce Type: cross Abstract: In the psychological literature the term `general intelligence' describes correlations between abilities and n
ArXiv cs.AI
📄 Paper
3h ago
A Hybrid Intelligent Framework for Uncertainty-Aware Condition Monitoring of Industrial Systems
arXiv:2604.09932v1 Announce Type: cross Abstract: Hybrid approaches that combine data-driven learning with physics-based insight have shown promise for improvin
ArXiv cs.AI
📄 Paper
3h ago
I Walk the Line: Examining the Role of Gestalt Continuity in Object Binding for Vision Transformers
arXiv:2604.09942v1 Announce Type: cross Abstract: Object binding is a foundational process in visual cognition, during which low-level perceptual features are j
ArXiv cs.AI
📄 Paper
3h ago
Cross-Cultural Value Awareness in Large Vision-Language Models
arXiv:2604.09945v1 Announce Type: cross Abstract: The rapid adoption of large vision-language models (LVLMs) in recent years has been accompanied by growing fai
ArXiv cs.AI
📄 Paper
3h ago
Rebooting Microreboot: Architectural Support for Safe, Parallel Recovery in Microservice Systems
arXiv:2604.09963v1 Announce Type: cross Abstract: Microreboot enables fast recovery by restarting only the failing component, but in modern microservices naive
ArXiv cs.AI
📄 Paper
3h ago
Muon$^2$: Boosting Muon via Adaptive Second-Moment Preconditioning
arXiv:2604.09967v1 Announce Type: cross Abstract: Muon has emerged as a promising optimizer for large-scale foundation model pre-training by exploiting the matr
ArXiv cs.AI
📄 Paper
3h ago
A Minimal Model of Representation Collapse: Frustration, Stop-Gradient, and Dynamics
arXiv:2604.09979v1 Announce Type: cross Abstract: Self-supervised representation learning is central to modern machine learning because it extracts structured l
ArXiv cs.AI
📄 Paper
3h ago
FlowPalm: Optical Flow Driven Non-Rigid Deformation for Geometrically Diverse Palmprint Generation
arXiv:2604.09989v1 Announce Type: cross Abstract: Recently, synthetic palmprints have been increasingly used as substitutes for real data to train recognition m
ArXiv cs.AI
📄 Paper
3h ago
Agentic Application in Power Grid Static Analysis: Automatic Code Generation and Error Correction
arXiv:2604.09995v1 Announce Type: cross Abstract: This paper introduces an LLM agent that automates power grid static analysis by converting natural language in
ArXiv cs.AI
📄 Paper
3h ago
Like a Hammer, It Can Build, It Can Break: Large Language Model Uses, Perceptions, and Adoption in Cybersecurity Operations on Reddit
arXiv:2604.09998v1 Announce Type: cross Abstract: Large language models (LLMs) have recently emerged as promising tools for augmenting Security Operations Cente
ArXiv cs.AI
📄 Paper
3h ago
Demographic and Linguistic Bias Evaluation in Omnimodal Language Models
arXiv:2604.10014v1 Announce Type: cross Abstract: This paper provides a comprehensive evaluation of demographic and linguistic biases in omnimodal language mode
ArXiv cs.AI
📄 Paper
3h ago
FREE-Switch: Frequency-based Dynamic LoRA Switch for Style Transfer
arXiv:2604.10023v1 Announce Type: cross Abstract: With the growing availability of open-sourced adapters trained on the same diffusion backbone for diverse scen
ArXiv cs.AI
📄 Paper
3h ago
LVSum: A Benchmark for Timestamp-Aware Long Video Summarization
arXiv:2604.10024v1 Announce Type: cross Abstract: Long video summarization presents significant challenges for current multimodal large language models (MLLMs),
ArXiv cs.AI
📄 Paper
3h ago
CoSToM:Causal-oriented Steering for Intrinsic Theory-of-Mind Alignment in Large Language Models
arXiv:2604.10031v1 Announce Type: cross Abstract: Theory of Mind (ToM), the ability to attribute mental states to others, is a hallmark of social intelligence.
ArXiv cs.AI
📄 Paper
3h ago
Closed-Form Concept Erasure via Double Projections
arXiv:2604.10032v1 Announce Type: cross Abstract: While modern generative models such as diffusion-based architectures have enabled impressive creative capabili
ArXiv cs.AI
📄 Paper
3h ago
Computational Implementation of a Model of Category-Theoretic Metaphor Comprehension
arXiv:2604.10035v1 Announce Type: cross Abstract: In this study, we developed a computational implementation for a model of metaphor comprehension based on the
ArXiv cs.AI
📄 Paper
3h ago
ASPIRin: Action Space Projection for Interactivity-Optimized Reinforcement Learning in Full-Duplex Speech Language Models
arXiv:2604.10065v1 Announce Type: cross Abstract: End-to-end full-duplex Speech Language Models (SLMs) require precise turn-taking for natural interaction. Howe
ArXiv cs.AI
📄 Paper
3h ago
Graph-RHO: Critical-path-aware Heterogeneous Graph Network for Long-Horizon Flexible Job-Shop Scheduling
arXiv:2604.10073v1 Announce Type: cross Abstract: Long-horizon Flexible Job-Shop Scheduling~(FJSP) presents a formidable combinatorial challenge due to complex,
DeepCamp AI