📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 3,241 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (11477)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference
arXiv:2604.05887v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) have advanced unified reasoning over text, images, and videos, but thei
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Context-Value-Action Architecture for Value-Driven Large Language Model Agents
arXiv:2604.05939v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown promise in simulating human behavior, yet existing agents often exhibit
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning
arXiv:2604.05943v1 Announce Type: new Abstract: Recent advances in multi-agent reinforcement learning (MARL) have demonstrated success in numerous challenging d
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Towards Trustworthy Report Generation: A Deep Research Agent with Progressive Confidence Estimation and Calibration
arXiv:2604.05952v1 Announce Type: new Abstract: As agent-based systems continue to evolve, deep research agents are capable of automatically generating research
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Beyond Compromise: Pareto-Lenient Consensus for Efficient Multi-Preference LLM Alignment
arXiv:2604.05965v1 Announce Type: new Abstract: Transcending the single-preference paradigm, aligning LLMs with diverse human values is pivotal for robust deplo
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Flowr -- Scaling Up Retail Supply Chain Operations Through Agentic AI in Large Scale Supermarket Chains
arXiv:2604.05987v1 Announce Type: new Abstract: Retail supply chain operations in supermarket chains involve continuous, high-volume manual workflows spanning d
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Epistemic Blinding: An Inference-Time Protocol for Auditing Prior Contamination in LLM-Assisted Analysis
arXiv:2604.06013v1 Announce Type: new Abstract: This paper presents epistemic blinding in the context of an agentic system that uses large language models to re
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
How LLMs Follow Instructions: Skillful Coordination, Not a Universal Mechanism
arXiv:2604.06015v1 Announce Type: new Abstract: Instruction tuning is commonly assumed to endow language models with a domain-general ability to follow instruct
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Artificial Intelligence and the Structure of Mathematics
arXiv:2604.06107v1 Announce Type: new Abstract: Recent progress in artificial intelligence (AI) is unlocking transformative capabilities for mathematics. There
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
ACE-Bench: Agent Configurable Evaluation with Scalable Horizons and Controllable Difficulty under Lightweight Environments
arXiv:2604.06111v1 Announce Type: new Abstract: Existing Agent benchmarks suffer from two critical limitations: high environment interaction overhead (up to 41\
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents
arXiv:2604.06132v1 Announce Type: new Abstract: Large language models are increasingly deployed as autonomous agents executing multi-step workflows in real-worl
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1w ago
Contextuality as an External Bookkeeping Cost under Fixed Shared-State Semantics
arXiv:2601.20167v2 Announce Type: cross Abstract: Contextuality is a central feature distinguishing quantum from classical probability theories, but its operati
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems
arXiv:2604.04936v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) systems critically depend on effective document chunking strategies to ba
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
TDA-RC: Task-Driven Alignment for Knowledge-Based Reasoning Chains in Large Language Models
arXiv:2604.04942v1 Announce Type: cross Abstract: Enhancing the reasoning capability of large language models (LLMs) remains a core challenge in natural languag
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
The Illusion of Latent Generalization: Bi-directionality and the Reversal Curse
arXiv:2604.04943v1 Announce Type: cross Abstract: The reversal curse describes a failure of autoregressive language models to retrieve a fact in reverse order (
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Inclusion-of-Thoughts: Mitigating Preference Instability via Purifying the Decision Space
arXiv:2604.04944v1 Announce Type: cross Abstract: Multiple-choice questions (MCQs) are widely used to evaluate large language models (LLMs). However, LLMs remai
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
SUMMIR: A Hallucination-Aware Framework for Ranking Sports Insights from LLMs
arXiv:2604.04947v1 Announce Type: cross Abstract: With the rapid proliferation of online sports journalism, extracting meaningful pre-game and post-game insight
ArXiv cs.AI
🔍 RAG & Vector Search
📄 Paper
⚡ AI Lesson
1w ago
From PDF to RAG-Ready: Evaluating Document Conversion Frameworks for Domain-Specific Question Answering
arXiv:2604.04948v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) systems depend critically on the quality of document preprocessing, yet n
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Learning to Retrieve from Agent Trajectories
arXiv:2604.04949v1 Announce Type: cross Abstract: Information retrieval (IR) systems have traditionally been designed and trained for human users, with learning
ArXiv cs.AI
🛡️ AI Safety & Ethics
📄 Paper
⚡ AI Lesson
1w ago
Synthetic Trust Attacks: Modeling How Generative AI Manipulates Human Decisions in Social Engineering Fraud
arXiv:2604.04951v1 Announce Type: cross Abstract: Imagine receiving a video call from your CFO, surrounded by colleagues, asking you to urgently authorise a con
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Generative AI for Video Trailer Synthesis: From Extractive Heuristics to Autoregressive Creativity
arXiv:2604.04953v1 Announce Type: cross Abstract: The domain of automatic video trailer generation is currently undergoing a profound paradigm shift, transition
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
The Planetary Cost of AI Acceleration, Part II: The 10th Planetary Boundary and the 6.5-Year Countdown
arXiv:2604.04956v1 Announce Type: cross Abstract: The recent, super-exponential scaling of autonomous Large Language Model (LLM) agents signals a broader, funda
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Self-Supervised Foundation Model for Calcium-imaging Population Dynamics
arXiv:2604.04958v1 Announce Type: cross Abstract: Recent work suggests that large-scale, multi-animal modeling can significantly improve neural recording analys
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
MG$^2$-RAG: Multi-Granularity Graph for Multimodal Retrieval-Augmented Generation
arXiv:2604.04969v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) mitigates hallucinations in Multimodal Large Language Models (MLLMs), yet
DeepCamp AI