📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,272 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (9679) ArXiv cs.AI Dev.to · FORUM WEB Forbes Innovation Dev.to AI OpenAI News Hugging Face Blog

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

TRACE: Capability-Targeted Agentic Training

arXiv:2604.05336v1 Announce Type: new Abstract: Large Language Models (LLMs) deployed in agentic environments must exercise multiple capabilities across differe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Dynamic Agentic AI Expert Profiler System Architecture for Multidomain Intelligence Modeling

arXiv:2604.05345v1 Announce Type: new Abstract: In today's artificial intelligence driven world, modern systems communicate with people from diverse backgrounds

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

From Retinal Evidence to Safe Decisions: RETINA-SAFE and ECRT for Hallucination Risk Triage in Medical LLMs

arXiv:2604.05348v1 Announce Type: new Abstract: Hallucinations in medical large language models (LLMs) remain a safety-critical issue, particularly when availab

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

ETR: Entropy Trend Reward for Efficient Chain-of-Thought Reasoning

arXiv:2604.05355v1 Announce Type: new Abstract: Chain-of-thought (CoT) reasoning improves large language model performance on complex tasks, but often produces

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment

arXiv:2604.05358v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) mitigates hallucination but does not eliminate it: a deployed system must s

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 4d ago

TFRBench: A Reasoning Benchmark for Evaluating Forecasting Systems

arXiv:2604.05364v1 Announce Type: new Abstract: We introduce TFRBench, the first benchmark designed to evaluate the reasoning capabilities of forecasting system

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

LLM-as-Judge for Semantic Judging of Powerline Segmentation in UAV Inspection

arXiv:2604.05371v1 Announce Type: new Abstract: The deployment of lightweight segmentation models on drones for autonomous power line inspection presents a crit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Towards Effective In-context Cross-domain Knowledge Transfer via Domain-invariant-neurons-based Retrieval

arXiv:2604.05383v1 Announce Type: new Abstract: Large language models (LLMs) have made notable progress in logical reasoning, yet still fall short of human-leve

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

Neural Assistive Impulses: Synthesizing Exaggerated Motions for Physics-based Characters

arXiv:2604.05394v1 Announce Type: new Abstract: Physics-based character animation has become a fundamental approach for synthesizing realistic, physically plaus

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Reason Analogically via Cross-domain Prior Knowledge: An Empirical Study of Cross-domain Knowledge Transfer for In-Context Learning

arXiv:2604.05396v1 Announce Type: new Abstract: Despite its success, existing in-context learning (ICL) relies on in-domain expert demonstrations, limiting its

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

HYVE: Hybrid Views for LLM Context Engineering over Machine Data

arXiv:2604.05400v1 Announce Type: new Abstract: Machine data is central to observability and diagnosis in modern computing systems, appearing in logs, metrics,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

CODESTRUCT: Code Agents over Structured Action Spaces

arXiv:2604.05407v1 Announce Type: new Abstract: LLM-based code agents treat repositories as unstructured text, applying edits through brittle string matching th

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

Multi-Agent Pathfinding with Non-Unit Integer Edge Costs via Enhanced Conflict-Based Search and Graph Discretization

arXiv:2604.05416v1 Announce Type: new Abstract: Multi-Agent Pathfinding (MAPF) plays a critical role in various domains. Traditional MAPF methods typically assu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection

arXiv:2604.05424v1 Announce Type: new Abstract: PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection Siyuan Cheng, Bozhong Tian, Yanch

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Automated Auditing of Hospital Discharge Summaries for Care Transitions

arXiv:2604.05435v1 Announce Type: new Abstract: Incomplete or inconsistent discharge documentation is a primary driver of care fragmentation and avoidable readm

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

Adaptive Serverless Resource Management via Slot-Survival Prediction and Event-Driven Lifecycle Control

arXiv:2604.05465v1 Announce Type: new Abstract: Serverless computing eliminates infrastructure management overhead but introduces significant challenges regardi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

OntoTKGE: Ontology-Enhanced Temporal Knowledge Graph Extrapolation

arXiv:2604.05468v1 Announce Type: new Abstract: Temporal knowledge graph (TKG) extrapolation is an important task that aims to predict future facts through hist

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Can We Trust a Black-box LLM? LLM Untrustworthy Boundary Detection via Bias-Diffusion and Multi-Agent Reinforcement Learning

arXiv:2604.05483v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown a high capability in answering questions on a diverse range of topics. H

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

Auditable Agents

arXiv:2604.05485v1 Announce Type: new Abstract: LLM agents call tools, query databases, delegate tasks, and trigger external side effects. Once an agent system

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

SCMAPR: Self-Correcting Multi-Agent Prompt Refinement for Complex-Scenario Text-to-Video Generation

arXiv:2604.05489v1 Announce Type: new Abstract: Text-to-Video (T2V) generation has benefited from recent advances in diffusion models, yet current systems still

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Thinking Diffusion: Penalize and Guide Visual-Grounded Reasoning in Diffusion Multimodal Language Models

arXiv:2604.05497v1 Announce Type: new Abstract: Diffusion large language models (dLLMs) are emerging as promising alternatives to autoregressive (AR) LLMs. Rece

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 4d ago

OmniDiagram: Advancing Unified Diagram Code Generation via Visual Interrogation Reward

arXiv:2604.05514v1 Announce Type: new Abstract: The paradigm of programmable diagram generation is evolving rapidly, playing a crucial role in structured visual

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

UniCreative: Unifying Long-form Logic and Short-form Sparkle via Reference-Free Reinforcement Learning

arXiv:2604.05517v1 Announce Type: new Abstract: A fundamental challenge in creative writing lies in reconciling the inherent tension between maintaining global

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Market-Bench: Benchmarking Large Language Models on Economic and Trade Competition

arXiv:2604.05523v1 Announce Type: new Abstract: The ability of large language models (LLMs) to manage and acquire economic resources remains unclear. In this pa