📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (13890) ArXiv cs.AI Dev.to · FORUM WEB Dev.to AI Forbes Innovation OpenAI News Medium · Programming

PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing

arXiv:2604.09111v1 Announce Type: cross Abstract: Recently, artificial intelligence-based dubbing technology has advanced, enabling automated dubbing (AD) to co

ArXiv cs.AI 📄 Paper 6d ago

Interactive ASR: Towards Human-Like Interaction and Semantic Coherence Evaluation for Agentic Speech Recognition

arXiv:2604.09121v1 Announce Type: cross Abstract: Recent years have witnessed remarkable progress in automatic speech recognition (ASR), driven by advances in m

ArXiv cs.AI 📄 Paper 6d ago

EquiformerV3: Scaling Efficient, Expressive, and General SE(3)-Equivariant Graph Attention Transformers

arXiv:2604.09130v1 Announce Type: cross Abstract: As $SE(3)$-equivariant graph neural networks mature as a core tool for 3D atomistic modeling, improving their

ArXiv cs.AI 📄 Paper 6d ago

CORA: Conformal Risk-Controlled Agents for Safeguarded Mobile GUI Automation

arXiv:2604.09155v1 Announce Type: cross Abstract: Graphical user interface (GUI) agents powered by vision language models (VLMs) are rapidly moving from passive

ArXiv cs.AI 📄 Paper 6d ago

Structuring versus Problematizing: How LLM-based Agents Scaffold Learning in Diagnostic Reasoning

arXiv:2604.09158v1 Announce Type: cross Abstract: Supporting students in developing diagnostic reasoning is a key challenge across educational domains. Novices

ArXiv cs.AI 📄 Paper 6d ago

Persona-E$^2$: A Human-Grounded Dataset for Personality-Shaped Emotional Responses to Textual Events

arXiv:2604.09162v1 Announce Type: cross Abstract: Most affective computing research treats emotion as a static property of text, focusing on the writer's sentim

ArXiv cs.AI 📄 Paper 6d ago

Generalization and Scaling Laws for Mixture-of-Experts Transformers

arXiv:2604.09175v1 Announce Type: cross Abstract: We develop a theory of generalization and scaling for Mixture-of-Experts (MoE) Transformers that cleanly separ

ArXiv cs.AI 📄 Paper 6d ago

Do LLMs Follow Their Own Rules? A Reflexive Audit of Self-Stated Safety Policies

arXiv:2604.09189v1 Announce Type: cross Abstract: LLMs internalize safety policies through RLHF, yet these policies are never formally specified and remain diff

ArXiv cs.AI 📄 Paper 6d ago

Vision Transformers for Preoperative CT-Based Prediction of Histopathologic Chemotherapy Response Score in High-Grade Serous Ovarian Carcinoma

arXiv:2604.09197v1 Announce Type: cross Abstract: Purpose. High-grade serous ovarian carcinoma (HGSOC) is characterized by pronounced biological and spatial het

ArXiv cs.AI 📄 Paper 6d ago

Artificial intelligence can persuade people to take political actions

arXiv:2604.09200v1 Announce Type: cross Abstract: There is substantial concern about the ability of advanced artificial intelligence to influence people's behav

ArXiv cs.AI 📄 Paper 6d ago

On the Role of DAG topology in Energy-Aware Cloud Scheduling : A GNN-Based Deep Reinforcement Learning Approach

arXiv:2604.09202v1 Announce Type: cross Abstract: Cloud providers must assign heterogeneous compute resources to workflow DAGs while balancing competing objecti

ArXiv cs.AI 📄 Paper 6d ago

GRM: Utility-Aware Jailbreak Attacks on Audio LLMs via Gradient-Ratio Masking

arXiv:2604.09222v1 Announce Type: cross Abstract: Audio large language models (ALLMs) enable rich speech-text interaction, but they also introduce jailbreak vul

ArXiv cs.AI 📄 Paper 6d ago

The Fast Lane Hypothesis: Von Economo Neurons Implement a Biological Speed-Accuracy Tradeoff

arXiv:2604.09229v1 Announce Type: cross Abstract: Von Economo neurons (VENs) are large bipolar projection neurons found exclusively in the anterior cingulate co

ArXiv cs.AI 📄 Paper 6d ago

Neural Distribution Prior for LiDAR Out-of-Distribution Detection

arXiv:2604.09232v1 Announce Type: cross Abstract: LiDAR-based perception is critical for autonomous driving due to its robustness to poor lighting and visibilit

ArXiv cs.AI 📄 Paper 6d ago

Statistical Properties of the King Wen Sequence: An Anti-Habituation Structure That Does Not Improve Neural Network Training

arXiv:2604.09234v1 Announce Type: cross Abstract: The King Wen sequence of the I-Ching (c. 1000 BC) orders 64 hexagrams -- states of a six-dimensional binary sp

ArXiv cs.AI 📄 Paper 6d ago

DDSP-QbE++: Improving Speech Quality for Speech Anonymisation for Atypical Speech

arXiv:2604.09246v1 Announce Type: cross Abstract: Differentiable Digital Signal Processing (DDSP) pipelines for voice conversion rely on subtractive synthesis,

ArXiv cs.AI 📄 Paper 6d ago

Mosaic: Multimodal Jailbreak against Closed-Source VLMs via Multi-View Ensemble Optimization

arXiv:2604.09253v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) are powerful but remain vulnerable to multimodal jailbreak attacks. Existing att

ArXiv cs.AI 📄 Paper 6d ago

SkillMOO: Multi-Objective Optimization of Agent Skills for Software Engineering

arXiv:2604.09297v1 Announce Type: cross Abstract: Agent skills provide modular, task-specific guidance for LLM- based coding agents, but manually tuning skill b

ArXiv cs.AI 📄 Paper 6d ago

SatQNet: Satellite-assisted Quantum Network Entanglement Routing Using Directed Line Graph Neural Networks

arXiv:2604.09306v1 Announce Type: cross Abstract: Quantum networks are expected to become a key enabler for interconnecting quantum devices. In contrast to clas

ArXiv cs.AI 📄 Paper 6d ago

Visually-Guided Policy Optimization for Multimodal Reasoning

arXiv:2604.09349v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has significantly advanced the reasoning ability of visi

ArXiv cs.AI 📄 Paper 6d ago

LLM-Rosetta: A Hub-and-Spoke Intermediate Representation for Cross-Provider LLM API Translation

arXiv:2604.09360v1 Announce Type: cross Abstract: The rapid proliferation of Large Language Model (LLM) providers--each exposing proprietary API formats--has cr

ArXiv cs.AI 📄 Paper 6d ago

BadSkill: Backdoor Attacks on Agent Skills via Model-in-Skill Poisoning

arXiv:2604.09378v1 Announce Type: cross Abstract: Agent ecosystems increasingly rely on installable skills to extend functionality, and some skills bundle learn

ArXiv cs.AI 📄 Paper 6d ago

The AI Codebase Maturity Model: From Assisted Coding to Self-Sustaining Systems

arXiv:2604.09388v1 Announce Type: cross Abstract: AI coding tools are widely adopted, but most teams plateau at prompt-and-review without a framework for system

ArXiv cs.AI 📄 Paper 6d ago

Yes, But Not Always. Generative AI Needs Nuanced Opt-in

arXiv:2604.09413v1 Announce Type: cross Abstract: This paper argues that a one-size-fits-all approach to specifying consent for the use of creative works in gen