📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (13890)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsMedium · Programming
ArXiv cs.AI
📄 Paper
6d ago
PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing
arXiv:2604.09111v1 Announce Type: cross Abstract: Recently, artificial intelligence-based dubbing technology has advanced, enabling automated dubbing (AD) to co
ArXiv cs.AI
📄 Paper
6d ago
Interactive ASR: Towards Human-Like Interaction and Semantic Coherence Evaluation for Agentic Speech Recognition
arXiv:2604.09121v1 Announce Type: cross Abstract: Recent years have witnessed remarkable progress in automatic speech recognition (ASR), driven by advances in m
ArXiv cs.AI
📄 Paper
6d ago
EquiformerV3: Scaling Efficient, Expressive, and General SE(3)-Equivariant Graph Attention Transformers
arXiv:2604.09130v1 Announce Type: cross Abstract: As $SE(3)$-equivariant graph neural networks mature as a core tool for 3D atomistic modeling, improving their
ArXiv cs.AI
📄 Paper
6d ago
CORA: Conformal Risk-Controlled Agents for Safeguarded Mobile GUI Automation
arXiv:2604.09155v1 Announce Type: cross Abstract: Graphical user interface (GUI) agents powered by vision language models (VLMs) are rapidly moving from passive
ArXiv cs.AI
📄 Paper
6d ago
Structuring versus Problematizing: How LLM-based Agents Scaffold Learning in Diagnostic Reasoning
arXiv:2604.09158v1 Announce Type: cross Abstract: Supporting students in developing diagnostic reasoning is a key challenge across educational domains. Novices
ArXiv cs.AI
📄 Paper
6d ago
Persona-E$^2$: A Human-Grounded Dataset for Personality-Shaped Emotional Responses to Textual Events
arXiv:2604.09162v1 Announce Type: cross Abstract: Most affective computing research treats emotion as a static property of text, focusing on the writer's sentim
ArXiv cs.AI
📄 Paper
6d ago
Generalization and Scaling Laws for Mixture-of-Experts Transformers
arXiv:2604.09175v1 Announce Type: cross Abstract: We develop a theory of generalization and scaling for Mixture-of-Experts (MoE) Transformers that cleanly separ
ArXiv cs.AI
📄 Paper
6d ago
Do LLMs Follow Their Own Rules? A Reflexive Audit of Self-Stated Safety Policies
arXiv:2604.09189v1 Announce Type: cross Abstract: LLMs internalize safety policies through RLHF, yet these policies are never formally specified and remain diff
ArXiv cs.AI
📄 Paper
6d ago
Vision Transformers for Preoperative CT-Based Prediction of Histopathologic Chemotherapy Response Score in High-Grade Serous Ovarian Carcinoma
arXiv:2604.09197v1 Announce Type: cross Abstract: Purpose. High-grade serous ovarian carcinoma (HGSOC) is characterized by pronounced biological and spatial het
ArXiv cs.AI
📄 Paper
6d ago
Artificial intelligence can persuade people to take political actions
arXiv:2604.09200v1 Announce Type: cross Abstract: There is substantial concern about the ability of advanced artificial intelligence to influence people's behav
ArXiv cs.AI
📄 Paper
6d ago
On the Role of DAG topology in Energy-Aware Cloud Scheduling : A GNN-Based Deep Reinforcement Learning Approach
arXiv:2604.09202v1 Announce Type: cross Abstract: Cloud providers must assign heterogeneous compute resources to workflow DAGs while balancing competing objecti
ArXiv cs.AI
📄 Paper
6d ago
GRM: Utility-Aware Jailbreak Attacks on Audio LLMs via Gradient-Ratio Masking
arXiv:2604.09222v1 Announce Type: cross Abstract: Audio large language models (ALLMs) enable rich speech-text interaction, but they also introduce jailbreak vul
ArXiv cs.AI
📄 Paper
6d ago
The Fast Lane Hypothesis: Von Economo Neurons Implement a Biological Speed-Accuracy Tradeoff
arXiv:2604.09229v1 Announce Type: cross Abstract: Von Economo neurons (VENs) are large bipolar projection neurons found exclusively in the anterior cingulate co
ArXiv cs.AI
📄 Paper
6d ago
Neural Distribution Prior for LiDAR Out-of-Distribution Detection
arXiv:2604.09232v1 Announce Type: cross Abstract: LiDAR-based perception is critical for autonomous driving due to its robustness to poor lighting and visibilit
ArXiv cs.AI
📄 Paper
6d ago
Statistical Properties of the King Wen Sequence: An Anti-Habituation Structure That Does Not Improve Neural Network Training
arXiv:2604.09234v1 Announce Type: cross Abstract: The King Wen sequence of the I-Ching (c. 1000 BC) orders 64 hexagrams -- states of a six-dimensional binary sp
ArXiv cs.AI
📄 Paper
6d ago
DDSP-QbE++: Improving Speech Quality for Speech Anonymisation for Atypical Speech
arXiv:2604.09246v1 Announce Type: cross Abstract: Differentiable Digital Signal Processing (DDSP) pipelines for voice conversion rely on subtractive synthesis,
ArXiv cs.AI
📄 Paper
6d ago
Mosaic: Multimodal Jailbreak against Closed-Source VLMs via Multi-View Ensemble Optimization
arXiv:2604.09253v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) are powerful but remain vulnerable to multimodal jailbreak attacks. Existing att
ArXiv cs.AI
📄 Paper
6d ago
SkillMOO: Multi-Objective Optimization of Agent Skills for Software Engineering
arXiv:2604.09297v1 Announce Type: cross Abstract: Agent skills provide modular, task-specific guidance for LLM- based coding agents, but manually tuning skill b
ArXiv cs.AI
📄 Paper
6d ago
SatQNet: Satellite-assisted Quantum Network Entanglement Routing Using Directed Line Graph Neural Networks
arXiv:2604.09306v1 Announce Type: cross Abstract: Quantum networks are expected to become a key enabler for interconnecting quantum devices. In contrast to clas
ArXiv cs.AI
📄 Paper
6d ago
Visually-Guided Policy Optimization for Multimodal Reasoning
arXiv:2604.09349v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has significantly advanced the reasoning ability of visi
ArXiv cs.AI
📄 Paper
6d ago
LLM-Rosetta: A Hub-and-Spoke Intermediate Representation for Cross-Provider LLM API Translation
arXiv:2604.09360v1 Announce Type: cross Abstract: The rapid proliferation of Large Language Model (LLM) providers--each exposing proprietary API formats--has cr
ArXiv cs.AI
📄 Paper
6d ago
BadSkill: Backdoor Attacks on Agent Skills via Model-in-Skill Poisoning
arXiv:2604.09378v1 Announce Type: cross Abstract: Agent ecosystems increasingly rely on installable skills to extend functionality, and some skills bundle learn
ArXiv cs.AI
📄 Paper
6d ago
The AI Codebase Maturity Model: From Assisted Coding to Self-Sustaining Systems
arXiv:2604.09388v1 Announce Type: cross Abstract: AI coding tools are widely adopted, but most teams plateau at prompt-and-review without a framework for system
ArXiv cs.AI
📄 Paper
6d ago
Yes, But Not Always. Generative AI Needs Nuanced Opt-in
arXiv:2604.09413v1 Announce Type: cross Abstract: This paper argues that a one-size-fits-all approach to specifying consent for the use of creative works in gen
DeepCamp AI