📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 4,506 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (11921)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI
📄 Paper
1d ago
Sanity Checks for Agentic Data Science
arXiv:2604.11003v1 Announce Type: new Abstract: Agentic data science (ADS) pipelines have grown rapidly in both capability and adoption, with systems such as Op
ArXiv cs.AI
📄 Paper
1d ago
Diffusion-CAM: Faithful Visual Explanations for dMLLMs
arXiv:2604.11005v1 Announce Type: new Abstract: While diffusion Multimodal Large Language Models (dMLLMs) have recently achieved remarkable strides in multimoda
ArXiv cs.AI
📄 Paper
1d ago
Min-$k$ Sampling: Decoupling Truncation from Temperature Scaling via Relative Logit Dynamics
arXiv:2604.11012v1 Announce Type: new Abstract: The quality of text generated by large language models depends critically on the decoding sampling strategy. Whi
ArXiv cs.AI
📄 Paper
1d ago
Introspective Diffusion Language Models
arXiv:2604.11035v1 Announce Type: new Abstract: Diffusion language models promise parallel generation, yet still lag behind autoregressive (AR) models in qualit
ArXiv cs.AI
📄 Paper
1d ago
Intelligent Approval of Access Control Flow in Office Automation Systems via Relational Modeling
arXiv:2604.11040v1 Announce Type: new Abstract: Office automation (OA) systems play a crucial role in enterprise operations and management, with access control
ArXiv cs.AI
📄 Paper
1d ago
From Topology to Trajectory: LLM-Driven World Models For Supply Chain Resilience
arXiv:2604.11041v1 Announce Type: new Abstract: Semiconductor supply chains face unprecedented resilience challenges amidst global geopolitical turbulence. Conv
ArXiv cs.AI
📄 Paper
1d ago
EmergentBridge: Improving Zero-Shot Cross-Modal Transfer in Unified Multimodal Embedding Models
arXiv:2604.11043v1 Announce Type: new Abstract: Unified multimodal embedding spaces underpin practical applications such as cross-modal retrieval and zero-shot
ArXiv cs.AI
📄 Paper
1d ago
AI Integrity: A New Paradigm for Verifiable AI Governance
arXiv:2604.11065v1 Announce Type: new Abstract: AI systems increasingly shape high-stakes decisions in healthcare, law, defense, and education, yet existing gov
ArXiv cs.AI
📄 Paper
1d ago
PRISM Risk Signal Framework: Hierarchy-Based Red Lines for AI Behavioral Risk
arXiv:2604.11070v1 Announce Type: new Abstract: Current approaches to AI safety define red lines at the case level: specific prompts, specific outputs, specific
ArXiv cs.AI
📄 Paper
1d ago
Hodoscope: Unsupervised Monitoring for AI Misbehaviors
arXiv:2604.11072v1 Announce Type: new Abstract: Existing approaches to monitoring AI agents rely on supervised evaluation: human-written rules or LLM-based judg
ArXiv cs.AI
📄 Paper
1d ago
Towards Proactive Information Probing: Customer Service Chatbots Harvesting Value from Conversation
arXiv:2604.11077v1 Announce Type: new Abstract: Customer service chatbots are increasingly expected to serve not merely as reactive support tools for users, but
ArXiv cs.AI
📄 Paper
1d ago
Do Agent Rules Shape or Distort? Guardrails Beat Guidance in Coding Agents
arXiv:2604.11088v1 Announce Type: new Abstract: Developers increasingly guide AI coding agents through natural language instruction files (e.g., CLAUDE.md, .cur
ArXiv cs.AI
📄 Paper
1d ago
Frugal Knowledge Graph Construction with Local LLMs: A Zero-Shot Pipeline, Self-Consistency and Wisdom of Artificial Crowds
arXiv:2604.11104v1 Announce Type: new Abstract: This paper presents an empirical study of a multi-model zero-shot pipeline for knowledge graph construction and
ArXiv cs.AI
📄 Paper
1d ago
Persona Non Grata: Single-Method Safety Evaluation Is Incomplete for Persona-Imbued LLMs
arXiv:2604.11120v1 Announce Type: new Abstract: Personality imbuing customizes LLM behavior, but safety evaluations almost always study prompt-based personas al
ArXiv cs.AI
📄 Paper
1d ago
A Proposed Biomedical Data Policy Framework to Reduce Fragmentation, Improve Quality, and Incentivize Sharing in Indian Healthcare in the era of Artificial Intelligence and Digital Health
arXiv:2604.11125v1 Announce Type: new Abstract: India generates vast biomedical data through postgraduate research, government hospital services and audits, gov
ArXiv cs.AI
📄 Paper
1d ago
MADQRL: Distributed Quantum Reinforcement Learning Framework for Multi-Agent Environments
arXiv:2604.11131v1 Announce Type: new Abstract: Reinforcement learning (RL) is one of the most practical ways to learn from real-life use-cases. Motivated from
ArXiv cs.AI
📄 Paper
1d ago
From Answers to Arguments: Toward Trustworthy Clinical Diagnostic Reasoning with Toulmin-Guided Curriculum Goal-Conditioned Learning
arXiv:2604.11137v1 Announce Type: new Abstract: The integration of Large Language Models (LLMs) into clinical decision support is critically obstructed by their
ArXiv cs.AI
📄 Paper
1d ago
Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model
arXiv:2604.11154v1 Announce Type: new Abstract: New multi-modal large language models (MLLMs) are continuously being trained and deployed, following rapid devel
ArXiv cs.AI
📄 Paper
1d ago
Measuring the Authority Stack of AI Systems: Empirical Analysis of 366,120 Forced-Choice Responses Across 8 AI Models
arXiv:2604.11216v1 Announce Type: new Abstract: What values, evidence preferences, and source trust hierarchies do AI systems actually exhibit when facing struc
ArXiv cs.AI
📄 Paper
1d ago
Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization
arXiv:2604.11259v1 Announce Type: new Abstract: Mobile GUI agents powered by Multimodal Large Language Models (MLLMs) can execute complex tasks on mobile device
ArXiv cs.AI
📄 Paper
1d ago
Inspectable AI for Science: A Research Object Approach to Generative AI Governance
arXiv:2604.11261v1 Announce Type: new Abstract: This paper introduces AI as a Research Object (AI-RO), a paradigm for governing the use of generative AI in scie
ArXiv cs.AI
📄 Paper
1d ago
Consistency of AI-Generated Exercise Prescriptions: A Repeated Generation Study Using a Large Language Model
arXiv:2604.11287v1 Announce Type: new Abstract: Background: Large language models (LLMs) have been explored as tools for generating personalized exercise prescr
ArXiv cs.AI
📄 Paper
1d ago
BankerToolBench: Evaluating AI Agents in End-to-End Investment Banking Workflows
arXiv:2604.11304v1 Announce Type: new Abstract: Existing AI benchmarks lack the fidelity to assess economically meaningful progress on professional workflows. T
ArXiv cs.AI
📄 Paper
1d ago
PaperScope: A Multi-Modal Multi-Document Benchmark for Agentic Deep Research Across Massive Scientific Papers
arXiv:2604.11307v1 Announce Type: new Abstract: Leveraging Multi-modal Large Language Models (MLLMs) to accelerate frontier scientific research is promising, ye
DeepCamp AI