6,347 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 6,347 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (16781) ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI 📄 Paper 1w ago
Efficient Process Reward Modeling via Contrastive Mutual Information
arXiv:2604.10660v1 Announce Type: cross Abstract: Recent research has devoted considerable effort to verifying the intermediate reasoning steps of chain-of-thou
ArXiv cs.AI 📄 Paper 1w ago
DynamicsLLM: a Dynamic Analysis-based Tool for Generating Intelligent Execution Traces Using LLMs to Detect Android Behavioural Code Smells
arXiv:2604.10661v1 Announce Type: cross Abstract: Mobile apps have become essential of our daily lives, making code quality a critical concern for developers. B
ArXiv cs.AI 📄 Paper 1w ago
Learning and Enforcing Context-Sensitive Control for LLMs
arXiv:2604.10667v1 Announce Type: cross Abstract: Controlling the output of Large Language Models (LLMs) through context-sensitive constraints has emerged as a
ArXiv cs.AI 📄 Paper 1w ago
Skill-SD: Skill-Conditioned Self-Distillation for Multi-turn LLM Agents
arXiv:2604.10674v1 Announce Type: cross Abstract: Reinforcement learning (RL) has been widely used to train LLM agents for multi-turn interactive tasks, but its
ArXiv cs.AI 📄 Paper 1w ago
Critical-CoT: A Robust Defense Framework against Reasoning-Level Backdoor Attacks in Large Language Models
arXiv:2604.10681v1 Announce Type: cross Abstract: Large Language Models (LLMs), despite their impressive capabilities across domains, have been shown to be vuln
ArXiv cs.AI 📄 Paper 1w ago
SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting
arXiv:2604.10688v1 Announce Type: cross Abstract: On-policy reinforcement learning has become the dominant paradigm for reasoning alignment in large language mo
ArXiv cs.AI 📄 Paper 1w ago
Bringing Value Models Back: Generative Critics for Value Modeling in LLM Reinforcement Learning
arXiv:2604.10701v1 Announce Type: cross Abstract: Credit assignment is a central challenge in reinforcement learning (RL). Classical actor-critic methods addres
ArXiv cs.AI 📄 Paper 1w ago
Architecture-Agnostic Modality-Isolated Gated Fusion for Robust Multi-Modal Prostate MRI Segmentation
arXiv:2604.10702v1 Announce Type: cross Abstract: Multi-parametric prostate MRI -- combining T2-weighted, apparent diffusion coefficient, and high b-value diffu
ArXiv cs.AI 📄 Paper 1w ago
Audio-Omni: Extending Multi-modal Understanding to Versatile Audio Generation and Editing
arXiv:2604.10708v1 Announce Type: cross Abstract: Recent progress in multimodal models has spurred rapid advances in audio understanding, generation, and editin
ArXiv cs.AI 📄 Paper 1w ago
Detecting RAG Extraction Attack via Dual-Path Runtime Integrity Game
arXiv:2604.10717v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) systems augment large language models with external knowledge, yet introd
ArXiv cs.AI 📄 Paper 1w ago
Turning Generators into Retrievers: Unlocking MLLMs for Natural Language-Guided Geo-Localization
arXiv:2604.10721v1 Announce Type: cross Abstract: Natural-language Guided Cross-view Geo-localization (NGCG) aims to retrieve geo-tagged satellite imagery using
ArXiv cs.AI 📄 Paper 1w ago
Tail-Aware Information-Theoretic Generalization for RLHF and SGLD
arXiv:2604.10727v1 Announce Type: cross Abstract: Classical information-theoretic generalization bounds typically control the generalization gap through KL-base
ArXiv cs.AI 📄 Paper 1w ago
Perceived Importance of Cognitive Skills Among Computing Students in the Era of AI
arXiv:2604.10730v1 Announce Type: cross Abstract: The availability and increasing integration of generative AI tools have transformed computing education. While
ArXiv cs.AI 📄 Paper 1w ago
Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models
arXiv:2604.10733v1 Announce Type: cross Abstract: Large language models increasingly serve as conversational agents that adopt personas and role-play characters
ArXiv cs.AI 📄 Paper 1w ago
Deep-Reporter: Deep Research for Grounded Multimodal Long-Form Generation
arXiv:2604.10741v1 Announce Type: cross Abstract: Recent agentic search frameworks enable deep research via iterative planning and retrieval, reducing hallucina
ArXiv cs.AI 📄 Paper 1w ago
Generating Multiple-Choice Knowledge Questions with Interpretable Difficulty Estimation using Knowledge Graphs and Large Language Models
arXiv:2604.10748v1 Announce Type: cross Abstract: Generating multiple-choice questions (MCQs) with difficulty estimation remains challenging in automated MCQ-ge
ArXiv cs.AI 📄 Paper 1w ago
Prosociality by Coupling, Not Mere Observation: Homeostatic Sharing in an Inspectable Recurrent Artificial Life Agent
arXiv:2604.10760v1 Announce Type: cross Abstract: Artificial agents can be made to "help" for many reasons, including explicit social reward, hard-coded prosoci
ArXiv cs.AI 📄 Paper 1w ago
Lung Cancer Detection Using Deep Learning
arXiv:2604.10765v1 Announce Type: cross Abstract: Lung cancer, the second leading cause of cancer-related deaths, is primarily linked to long-term tobacco smoki
ArXiv cs.AI 📄 Paper 1w ago
Do BERT Embeddings Encode Narrative Dimensions? A Token-Level Probing Analysis of Time, Space, Causality, and Character in Fiction
arXiv:2604.10786v1 Announce Type: cross Abstract: Narrative understanding requires multidimensional semantic structures. This study investigates whether BERT em
ArXiv cs.AI 📄 Paper 1w ago
TInR: Exploring Tool-Internalized Reasoning in Large Language Models
arXiv:2604.10788v1 Announce Type: cross Abstract: Tool-Integrated Reasoning (TIR) has emerged as a promising direction by extending Large Language Models' (LLMs