5,060 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (13389) ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI 📄 Paper 4d ago
Efficient Personalization of Generative User Interfaces
arXiv:2604.09876v1 Announce Type: cross Abstract: Generative user interfaces (UIs) create new opportunities to adapt interfaces to individual users on demand, b
ArXiv cs.AI 📄 Paper 4d ago
DINO_4D: Semantic-Aware 4D Reconstruction
arXiv:2604.09877v1 Announce Type: cross Abstract: In the intersection of computer vision and robotic perception, 4D reconstruction of dynamic scenes serve as th
ArXiv cs.AI 📄 Paper 4d ago
Not Your Stereo-Typical Estimator: Combining Vision and Language for Volume Perception
arXiv:2604.09886v1 Announce Type: cross Abstract: Accurate volume estimation of objects from visual data is a long-standing challenge in computer vision with si
ArXiv cs.AI 📄 Paper 4d ago
Should We be Pedantic About Reasoning Errors in Machine Translation?
arXiv:2604.09890v1 Announce Type: cross Abstract: Across multiple language pairings (English $\to$ \{Spanish, French, German, Mandarin, Japanese, Urdu, Cantones
ArXiv cs.AI 📄 Paper 4d ago
Diffusion Denoiser Achievable Analysis for Finite Blocklength Unsourced Random Access
arXiv:2604.09904v1 Announce Type: cross Abstract: Polyanskiy proposed a framework for the unsourced multiple access channel (MAC) problem where users employ a c
ArXiv cs.AI 📄 Paper 4d ago
From UAV Imagery to Agronomic Reasoning: A Multimodal LLM Benchmark for Plant Phenotyping
arXiv:2604.09907v1 Announce Type: cross Abstract: To improve crop genetics, high-throughput, effective and comprehensive phenotyping is a critical prerequisite.
ArXiv cs.AI 📄 Paper 4d ago
The Rise and Fall of $G$ in AGI
arXiv:2604.09911v1 Announce Type: cross Abstract: In the psychological literature the term `general intelligence' describes correlations between abilities and n
ArXiv cs.AI 📄 Paper 4d ago
A Hybrid Intelligent Framework for Uncertainty-Aware Condition Monitoring of Industrial Systems
arXiv:2604.09932v1 Announce Type: cross Abstract: Hybrid approaches that combine data-driven learning with physics-based insight have shown promise for improvin
ArXiv cs.AI 📄 Paper 4d ago
I Walk the Line: Examining the Role of Gestalt Continuity in Object Binding for Vision Transformers
arXiv:2604.09942v1 Announce Type: cross Abstract: Object binding is a foundational process in visual cognition, during which low-level perceptual features are j
ArXiv cs.AI 📄 Paper 4d ago
Cross-Cultural Value Awareness in Large Vision-Language Models
arXiv:2604.09945v1 Announce Type: cross Abstract: The rapid adoption of large vision-language models (LVLMs) in recent years has been accompanied by growing fai
ArXiv cs.AI 📄 Paper 4d ago
Rebooting Microreboot: Architectural Support for Safe, Parallel Recovery in Microservice Systems
arXiv:2604.09963v1 Announce Type: cross Abstract: Microreboot enables fast recovery by restarting only the failing component, but in modern microservices naive
ArXiv cs.AI 📄 Paper 4d ago
Muon$^2$: Boosting Muon via Adaptive Second-Moment Preconditioning
arXiv:2604.09967v1 Announce Type: cross Abstract: Muon has emerged as a promising optimizer for large-scale foundation model pre-training by exploiting the matr
ArXiv cs.AI 📄 Paper 4d ago
A Minimal Model of Representation Collapse: Frustration, Stop-Gradient, and Dynamics
arXiv:2604.09979v1 Announce Type: cross Abstract: Self-supervised representation learning is central to modern machine learning because it extracts structured l
ArXiv cs.AI 📄 Paper 4d ago
FlowPalm: Optical Flow Driven Non-Rigid Deformation for Geometrically Diverse Palmprint Generation
arXiv:2604.09989v1 Announce Type: cross Abstract: Recently, synthetic palmprints have been increasingly used as substitutes for real data to train recognition m
ArXiv cs.AI 📄 Paper 4d ago
Agentic Application in Power Grid Static Analysis: Automatic Code Generation and Error Correction
arXiv:2604.09995v1 Announce Type: cross Abstract: This paper introduces an LLM agent that automates power grid static analysis by converting natural language in
ArXiv cs.AI 📄 Paper 4d ago
Like a Hammer, It Can Build, It Can Break: Large Language Model Uses, Perceptions, and Adoption in Cybersecurity Operations on Reddit
arXiv:2604.09998v1 Announce Type: cross Abstract: Large language models (LLMs) have recently emerged as promising tools for augmenting Security Operations Cente
ArXiv cs.AI 📄 Paper 4d ago
Demographic and Linguistic Bias Evaluation in Omnimodal Language Models
arXiv:2604.10014v1 Announce Type: cross Abstract: This paper provides a comprehensive evaluation of demographic and linguistic biases in omnimodal language mode
ArXiv cs.AI 📄 Paper 4d ago
FREE-Switch: Frequency-based Dynamic LoRA Switch for Style Transfer
arXiv:2604.10023v1 Announce Type: cross Abstract: With the growing availability of open-sourced adapters trained on the same diffusion backbone for diverse scen
ArXiv cs.AI 📄 Paper 4d ago
LVSum: A Benchmark for Timestamp-Aware Long Video Summarization
arXiv:2604.10024v1 Announce Type: cross Abstract: Long video summarization presents significant challenges for current multimodal large language models (MLLMs),
ArXiv cs.AI 📄 Paper 4d ago
CoSToM:Causal-oriented Steering for Intrinsic Theory-of-Mind Alignment in Large Language Models
arXiv:2604.10031v1 Announce Type: cross Abstract: Theory of Mind (ToM), the ability to attribute mental states to others, is a hallmark of social intelligence.