📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 1,213 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
AI Mental Models: Learned Intuition and Deliberation in a Bounded Neural Architecture
arXiv:2603.22561v1 Announce Type: new Abstract: This paper asks whether a bounded neural architecture can exhibit a meaningful division of labor between intuiti
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Understanding LLM Performance Degradation in Multi-Instance Processing: The Roles of Instance Count and Context Length
arXiv:2603.22608v1 Announce Type: new Abstract: Users often rely on Large Language Models (LLMs) for processing multiple documents or performing analysis over a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Bridging the Know-Act Gap via Task-Level Autoregressive Reasoning
arXiv:2603.22619v1 Announce Type: new Abstract: LLMs often generate seemingly valid answers to flawed or ill-posed inputs. This is not due to missing knowledge:
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Graph-Aware Late Chunking for Retrieval-Augmented Generation in Biomedical Literature
arXiv:2603.22633v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) systems for biomedical literature are typically evaluated using ranking met
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Benchmarking Multi-Agent LLM Architectures for Financial Document Processing: A Comparative Study of Orchestration Patterns, Cost-Accuracy Tradeoffs and Production Scaling Strategies
arXiv:2603.22651v1 Announce Type: new Abstract: The adoption of large language models (LLMs) for structured information extraction from financial documents has
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
MuQ-Eval: An Open-Source Per-Sample Quality Metric for AI Music Generation Evaluation
arXiv:2603.22677v1 Announce Type: new Abstract: Distributional metrics such as Fr\'echet Audio Distance cannot score individual music clips and correlate poorly
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment
arXiv:2603.22721v1 Announce Type: new Abstract: Recent progress in artificial intelligence has encouraged numerous attempts to understand and decode human visua
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Beyond Binary Correctness: Scaling Evaluation of Long-Horizon Agents on Subjective Enterprise Tasks
arXiv:2603.22744v1 Announce Type: new Abstract: Large language models excel on objectively verifiable tasks such as math and programming, where evaluation reduc
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
CLiGNet: Clinical Label-Interaction Graph Network for Medical Specialty Classification from Clinical Transcriptions
arXiv:2603.22752v1 Announce Type: new Abstract: Automated classification of clinical transcriptions into medical specialties is essential for routing, coding, a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases
arXiv:2603.22767v1 Announce Type: new Abstract: Observational studies can yield clinically actionable evidence at scale, but executing them on real-world databa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
AgriPestDatabase-v1.0: A Structured Insect Dataset for Training Agricultural Large Language Model
arXiv:2603.22777v1 Announce Type: new Abstract: Agricultural pest management increasingly relies on timely and accurate access to expert knowledge, yet high qua
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
ABSTRAL: Automatic Design of Multi-Agent Systems Through Iterative Refinement and Topology Optimization
arXiv:2603.22791v1 Announce Type: new Abstract: How should multi-agent systems be designed, and can that design knowledge be captured in a form that is inspecta
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Reliable Classroom AI via Neuro-Symbolic Multimodal Reasoning
arXiv:2603.22793v1 Announce Type: new Abstract: Classroom AI is rapidly expanding from low-level perception toward higher-level judgments about engagement, conf
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts
arXiv:2603.22813v1 Announce Type: new Abstract: Humans often juggle multiple, sometimes conflicting objectives and shift their priorities as circumstances chang
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
Empirical Comparison of Agent Communication Protocols for Task Orchestration
arXiv:2603.22823v1 Announce Type: new Abstract: Context. Nowadays, artificial intelligence agent systems are transforming from single-tool interactions to compl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Improving Safety Alignment via Balanced Direct Preference Optimization
arXiv:2603.22829v1 Announce Type: new Abstract: With the rapid development and widespread application of Large Language Models (LLMs), their potential safety ri
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
6d ago
PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal
arXiv:2603.22844v1 Announce Type: new Abstract: Surgical smoke severely degrades intraoperative video quality, obscuring anatomical structures and limiting surg
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models
arXiv:2603.22846v1 Announce Type: new Abstract: Embodied Visual Tracking (EVT), a core dynamic task in embodied intelligence, requires an agent to precisely fol
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Chain-of-Authorization: Internalizing Authorization into Large Language Models via Reasoning Trajectories
arXiv:2603.22869v1 Announce Type: new Abstract: Large Language Models (LLMs) have become core cognitive components in modern artificial intelligence (AI) system
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Dynamical Systems Theory Behind a Hierarchical Reasoning Model
arXiv:2603.22871v1 Announce Type: new Abstract: Current large language models (LLMs) primarily rely on linear sequence generation and massive parameter counts,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Continuous Optimization for Satisfiability Modulo Theories on Linear Real Arithmetic
arXiv:2603.22877v1 Announce Type: new Abstract: Efficient solutions for satisfiability modulo theories (SMT) are integral in industrial applications such as har
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Separating Diagnosis from Control: Auditable Policy Adaptation in Agent-Based Simulations with LLM-Based Diagnostics
arXiv:2603.22904v1 Announce Type: new Abstract: Mitigating elderly loneliness requires policy interventions that achieve both adaptability and auditability. Exi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
ProGRank: Probe-Gradient Reranking to Defend Dense-Retriever RAG from Corpus Poisoning
arXiv:2603.22934v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) improves the reliability of large language model applications by grounding
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Ran Score: a LLM-based Evaluation Score for Radiology Report Generation
arXiv:2603.22935v1 Announce Type: new Abstract: Chest X-ray report generation and automated evaluation are limited by poor recognition of low-prevalence abnorma
DeepCamp AI