3,272 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,272 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (9679) ArXiv cs.AIDev.to · FORUM WEBForbes InnovationDev.to AIOpenAI NewsHugging Face Blog
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment
arXiv:2604.05358v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) mitigates hallucination but does not eliminate it: a deployed system must s
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 4d ago
TFRBench: A Reasoning Benchmark for Evaluating Forecasting Systems
arXiv:2604.05364v1 Announce Type: new Abstract: We introduce TFRBench, the first benchmark designed to evaluate the reasoning capabilities of forecasting system
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
LLM-as-Judge for Semantic Judging of Powerline Segmentation in UAV Inspection
arXiv:2604.05371v1 Announce Type: new Abstract: The deployment of lightweight segmentation models on drones for autonomous power line inspection presents a crit
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Towards Effective In-context Cross-domain Knowledge Transfer via Domain-invariant-neurons-based Retrieval
arXiv:2604.05383v1 Announce Type: new Abstract: Large language models (LLMs) have made notable progress in logical reasoning, yet still fall short of human-leve
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Neural Assistive Impulses: Synthesizing Exaggerated Motions for Physics-based Characters
arXiv:2604.05394v1 Announce Type: new Abstract: Physics-based character animation has become a fundamental approach for synthesizing realistic, physically plaus
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Reason Analogically via Cross-domain Prior Knowledge: An Empirical Study of Cross-domain Knowledge Transfer for In-Context Learning
arXiv:2604.05396v1 Announce Type: new Abstract: Despite its success, existing in-context learning (ICL) relies on in-domain expert demonstrations, limiting its
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
HYVE: Hybrid Views for LLM Context Engineering over Machine Data
arXiv:2604.05400v1 Announce Type: new Abstract: Machine data is central to observability and diagnosis in modern computing systems, appearing in logs, metrics,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
CODESTRUCT: Code Agents over Structured Action Spaces
arXiv:2604.05407v1 Announce Type: new Abstract: LLM-based code agents treat repositories as unstructured text, applying edits through brittle string matching th
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Multi-Agent Pathfinding with Non-Unit Integer Edge Costs via Enhanced Conflict-Based Search and Graph Discretization
arXiv:2604.05416v1 Announce Type: new Abstract: Multi-Agent Pathfinding (MAPF) plays a critical role in various domains. Traditional MAPF methods typically assu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection
arXiv:2604.05424v1 Announce Type: new Abstract: PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection Siyuan Cheng, Bozhong Tian, Yanch
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Automated Auditing of Hospital Discharge Summaries for Care Transitions
arXiv:2604.05435v1 Announce Type: new Abstract: Incomplete or inconsistent discharge documentation is a primary driver of care fragmentation and avoidable readm
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Adaptive Serverless Resource Management via Slot-Survival Prediction and Event-Driven Lifecycle Control
arXiv:2604.05465v1 Announce Type: new Abstract: Serverless computing eliminates infrastructure management overhead but introduces significant challenges regardi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
OntoTKGE: Ontology-Enhanced Temporal Knowledge Graph Extrapolation
arXiv:2604.05468v1 Announce Type: new Abstract: Temporal knowledge graph (TKG) extrapolation is an important task that aims to predict future facts through hist
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Can We Trust a Black-box LLM? LLM Untrustworthy Boundary Detection via Bias-Diffusion and Multi-Agent Reinforcement Learning
arXiv:2604.05483v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown a high capability in answering questions on a diverse range of topics. H
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Auditable Agents
arXiv:2604.05485v1 Announce Type: new Abstract: LLM agents call tools, query databases, delegate tasks, and trigger external side effects. Once an agent system
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
SCMAPR: Self-Correcting Multi-Agent Prompt Refinement for Complex-Scenario Text-to-Video Generation
arXiv:2604.05489v1 Announce Type: new Abstract: Text-to-Video (T2V) generation has benefited from recent advances in diffusion models, yet current systems still
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Thinking Diffusion: Penalize and Guide Visual-Grounded Reasoning in Diffusion Multimodal Language Models
arXiv:2604.05497v1 Announce Type: new Abstract: Diffusion large language models (dLLMs) are emerging as promising alternatives to autoregressive (AR) LLMs. Rece
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 4d ago
OmniDiagram: Advancing Unified Diagram Code Generation via Visual Interrogation Reward
arXiv:2604.05514v1 Announce Type: new Abstract: The paradigm of programmable diagram generation is evolving rapidly, playing a crucial role in structured visual
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
UniCreative: Unifying Long-form Logic and Short-form Sparkle via Reference-Free Reinforcement Learning
arXiv:2604.05517v1 Announce Type: new Abstract: A fundamental challenge in creative writing lies in reconciling the inherent tension between maintaining global
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Market-Bench: Benchmarking Large Language Models on Economic and Trade Competition
arXiv:2604.05523v1 Announce Type: new Abstract: The ability of large language models (LLMs) to manage and acquire economic resources remains unclear. In this pa