📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 6,347 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (16470) ArXiv cs.AI Dev.to AI Dev.to · FORUM WEB Forbes Innovation Medium · Programming Medium · AI

AI Integrity: A New Paradigm for Verifiable AI Governance

arXiv:2604.11065v1 Announce Type: new Abstract: AI systems increasingly shape high-stakes decisions in healthcare, law, defense, and education, yet existing gov

ArXiv cs.AI 📄 Paper 1w ago

PRISM Risk Signal Framework: Hierarchy-Based Red Lines for AI Behavioral Risk

arXiv:2604.11070v1 Announce Type: new Abstract: Current approaches to AI safety define red lines at the case level: specific prompts, specific outputs, specific

ArXiv cs.AI 📄 Paper 1w ago

Hodoscope: Unsupervised Monitoring for AI Misbehaviors

arXiv:2604.11072v1 Announce Type: new Abstract: Existing approaches to monitoring AI agents rely on supervised evaluation: human-written rules or LLM-based judg

ArXiv cs.AI 📄 Paper 1w ago

Towards Proactive Information Probing: Customer Service Chatbots Harvesting Value from Conversation

arXiv:2604.11077v1 Announce Type: new Abstract: Customer service chatbots are increasingly expected to serve not merely as reactive support tools for users, but

ArXiv cs.AI 📄 Paper 1w ago

Do Agent Rules Shape or Distort? Guardrails Beat Guidance in Coding Agents

arXiv:2604.11088v1 Announce Type: new Abstract: Developers increasingly guide AI coding agents through natural language instruction files (e.g., CLAUDE.md, .cur

ArXiv cs.AI 📄 Paper 1w ago

Frugal Knowledge Graph Construction with Local LLMs: A Zero-Shot Pipeline, Self-Consistency and Wisdom of Artificial Crowds

arXiv:2604.11104v1 Announce Type: new Abstract: This paper presents an empirical study of a multi-model zero-shot pipeline for knowledge graph construction and

ArXiv cs.AI 📄 Paper 1w ago

Persona Non Grata: Single-Method Safety Evaluation Is Incomplete for Persona-Imbued LLMs

arXiv:2604.11120v1 Announce Type: new Abstract: Personality imbuing customizes LLM behavior, but safety evaluations almost always study prompt-based personas al

ArXiv cs.AI 📄 Paper 1w ago

A Proposed Biomedical Data Policy Framework to Reduce Fragmentation, Improve Quality, and Incentivize Sharing in Indian Healthcare in the era of Artificial Intelligence and Digital Health

arXiv:2604.11125v1 Announce Type: new Abstract: India generates vast biomedical data through postgraduate research, government hospital services and audits, gov

ArXiv cs.AI 📄 Paper 1w ago

MADQRL: Distributed Quantum Reinforcement Learning Framework for Multi-Agent Environments

arXiv:2604.11131v1 Announce Type: new Abstract: Reinforcement learning (RL) is one of the most practical ways to learn from real-life use-cases. Motivated from

ArXiv cs.AI 📄 Paper 1w ago

From Answers to Arguments: Toward Trustworthy Clinical Diagnostic Reasoning with Toulmin-Guided Curriculum Goal-Conditioned Learning

arXiv:2604.11137v1 Announce Type: new Abstract: The integration of Large Language Models (LLMs) into clinical decision support is critically obstructed by their

ArXiv cs.AI 📄 Paper 1w ago

Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model

arXiv:2604.11154v1 Announce Type: new Abstract: New multi-modal large language models (MLLMs) are continuously being trained and deployed, following rapid devel

ArXiv cs.AI 📄 Paper 1w ago

Measuring the Authority Stack of AI Systems: Empirical Analysis of 366,120 Forced-Choice Responses Across 8 AI Models

arXiv:2604.11216v1 Announce Type: new Abstract: What values, evidence preferences, and source trust hierarchies do AI systems actually exhibit when facing struc

ArXiv cs.AI 📄 Paper 1w ago

Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization

arXiv:2604.11259v1 Announce Type: new Abstract: Mobile GUI agents powered by Multimodal Large Language Models (MLLMs) can execute complex tasks on mobile device

ArXiv cs.AI 📄 Paper 1w ago

Inspectable AI for Science: A Research Object Approach to Generative AI Governance

arXiv:2604.11261v1 Announce Type: new Abstract: This paper introduces AI as a Research Object (AI-RO), a paradigm for governing the use of generative AI in scie

ArXiv cs.AI 📄 Paper 1w ago

Consistency of AI-Generated Exercise Prescriptions: A Repeated Generation Study Using a Large Language Model

arXiv:2604.11287v1 Announce Type: new Abstract: Background: Large language models (LLMs) have been explored as tools for generating personalized exercise prescr

ArXiv cs.AI 📄 Paper 1w ago

BankerToolBench: Evaluating AI Agents in End-to-End Investment Banking Workflows

arXiv:2604.11304v1 Announce Type: new Abstract: Existing AI benchmarks lack the fidelity to assess economically meaningful progress on professional workflows. T

ArXiv cs.AI 📄 Paper 1w ago

PaperScope: A Multi-Modal Multi-Document Benchmark for Agentic Deep Research Across Massive Scientific Papers

arXiv:2604.11307v1 Announce Type: new Abstract: Leveraging Multi-modal Large Language Models (MLLMs) to accelerate frontier scientific research is promising, ye

ArXiv cs.AI 📄 Paper 1w ago

Select Smarter, Not More: Prompt-Aware Evaluation Scheduling with Submodular Guarantees

arXiv:2604.11328v1 Announce Type: new Abstract: Automatic prompt optimization (APO) hinges on the quality of its evaluation signal, yet scoring every prompt can

ArXiv cs.AI 📄 Paper 1w ago

Dynamic Summary Generation for Interpretable Multimodal Depression Detection

arXiv:2604.11334v1 Announce Type: new Abstract: Depression remains widely underdiagnosed and undertreated because stigma and subjective symptom ratings hinder r

ArXiv cs.AI 📄 Paper 1w ago

CoRe-ECG: Advancing Self-Supervised Representation Learning for 12-Lead ECG via Contrastive and Reconstructive Synergy

arXiv:2604.11359v1 Announce Type: new Abstract: Accurate interpretation of electrocardiogram (ECG) remains challenging due to the scarcity of labeled data and t

ArXiv cs.AI 📄 Paper 1w ago

The Missing Knowledge Layer in Cognitive Architectures for AI Agents

arXiv:2604.11364v1 Announce Type: new Abstract: The two most influential cognitive architecture frameworks for AI agents, CoALA [21] and JEPA [12], both lack an

ArXiv cs.AI 📄 Paper 1w ago

Learning from Contrasts: Synthesizing Reasoning Paths from Diverse Search Trajectories

arXiv:2604.11365v1 Announce Type: new Abstract: Monte Carlo Tree Search (MCTS) has been widely used for automated reasoning data exploration, but current superv

ArXiv cs.AI 📄 Paper 1w ago

From Agent Loops to Structured Graphs:A Scheduler-Theoretic Framework for LLM Agent Execution

arXiv:2604.11378v1 Announce Type: new Abstract: The dominant paradigm for building LLM based agents is the Agent Loop, an iterative cycle where a single languag

ArXiv cs.AI 📄 Paper 1w ago

Beyond RAG for Cyber Threat Intelligence: A Systematic Evaluation of Graph-Based and Agentic Retrieval

arXiv:2604.11419v1 Announce Type: new Abstract: Cyber threat intelligence (CTI) analysts must answer complex questions over large collections of narrative secur