📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 7,014 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (19106) ArXiv cs.AI Dev.to AI Dev.to · FORUM WEB Forbes Innovation Medium · Programming Medium · AI

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Experience Transfer for Multimodal LLM Agents in Minecraft Game

arXiv:2604.05533v1 Announce Type: new Abstract: Multimodal LLM agents operating in complex game environments must continually reuse past experience to solve new

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

SignalClaw: LLM-Guided Evolutionary Synthesis of Interpretable Traffic Signal Control Skills

arXiv:2604.05535v1 Announce Type: new Abstract: Traffic signal control TSC requires strategies that are both effective and interpretable for deployment, yet rei

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 2w ago

A canonical generalization of OBDD

arXiv:2604.05537v1 Announce Type: new Abstract: We introduce Tree Decision Diagrams (TDD) as a model for Boolean functions that generalizes OBDD. They can be se

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

From Large Language Model Predicates to Logic Tensor Networks: Neurosymbolic Offer Validation in Regulated Procurement

arXiv:2604.05539v1 Announce Type: new Abstract: We present a neurosymbolic approach, i.e., combining symbolic and subsymbolic artificial intelligence, to valida

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago

COSMO-Agent: Tool-Augmented Agent for Closed-loop Optimization,Simulation,and Modeling Orchestration

arXiv:2604.05547v1 Announce Type: new Abstract: Iterative industrial design-simulation optimization is bottlenecked by the CAD-CAE semantic gap: translating sim

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

ResearchEVO: An End-to-End Framework for Automated Scientific Discovery and Documentation

arXiv:2604.05587v1 Announce Type: new Abstract: An important recurring pattern in scientific breakthroughs is a two-stage process: an initial phase of undirecte

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Label Effects: Shared Heuristic Reliance in Trust Assessment by Humans and LLM-as-a-Judge

arXiv:2604.05593v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used as automated evaluators (LLM-as-a-Judge). This work challenge

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 2w ago

Beyond Behavior: Why AI Evaluation Needs a Cognitive Revolution

arXiv:2604.05631v1 Announce Type: new Abstract: In 1950, Alan Turing proposed replacing the question "Can machines think?" with a behavioral test: if a machine'

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

PECKER: A Precisely Efficient Critical Knowledge Erasure Recipe For Machine Unlearning in Diffusion Models

arXiv:2604.05634v1 Announce Type: new Abstract: Machine unlearning (MU) has become a critical technique for GenAI models' safe and compliant operation. While ex

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

CuraLight: Debate-Guided Data Curation for LLM-Centered Traffic Signal Control

arXiv:2604.05663v1 Announce Type: new Abstract: Traffic signal control (TSC) is a core component of intelligent transportation systems (ITS), aiming to reduce c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LUDOBENCH: Evaluating LLM Behavioural Decision-Making Through Spot-Based Board Game Scenarios in Ludo

arXiv:2604.05681v1 Announce Type: new Abstract: We introduce LudoBench, a benchmark for evaluating LLM strategic reasoning in Ludo, a stochastic multi-agent boa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

QA-MoE: Towards a Continuous Reliability Spectrum with Quality-Aware Mixture of Experts for Robust Multimodal Sentiment Analysis

arXiv:2604.05704v1 Announce Type: new Abstract: Multimodal Sentiment Analysis (MSA) aims to infer human sentiment from textual, acoustic, and visual signals. In

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Can Large Language Models Reinvent Foundational Algorithms?

arXiv:2604.05716v1 Announce Type: new Abstract: LLMs have shown strong potential to advance scientific discovery. Whether they possess the capacity for foundati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Emergent social transmission of model-based representations without inference

arXiv:2604.05777v1 Announce Type: new Abstract: How do people acquire rich, flexible knowledge about their environment from others despite limited cognitive cap

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents

arXiv:2604.05808v1 Announce Type: new Abstract: Large language model (LLM) agents have demonstrated strong capabilities in complex interactive decision-making t

ArXiv cs.AI 🛡️ AI Safety & Ethics 📄 Paper ⚡ AI Lesson 2w ago

Reciprocal Trust and Distrust in Artificial Intelligence Systems: The Hard Problem of Regulation

arXiv:2604.05826v1 Announce Type: new Abstract: Policy makers, scientists, and the public are increasingly confronted with thorny questions about the regulation

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2w ago

Vision-Guided Iterative Refinement for Frontend Code Generation

arXiv:2604.05839v1 Announce Type: new Abstract: Code generation with large language models often relies on multi-stage human-in-the-loop refinement, which is ef

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring

arXiv:2604.05854v1 Announce Type: new Abstract: We present \textbf{Deep Researcher Agent}, an open-source framework that enables large language model (LLM) agen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

When Do We Need LLMs? A Diagnostic for Language-Driven Bandits

arXiv:2604.05859v1 Announce Type: new Abstract: We study Contextual Multi-Armed Bandits (CMABs) for non-episodic sequential decision making problems where the c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

JTON: A Token-Efficient JSON Superset with Zen Grid Tabular Encoding for Large Language Models

arXiv:2604.05865v1 Announce Type: new Abstract: When LLMs process structured data, the serialization format directly affects cost and context utilization. Stand

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Joint Knowledge Base Completion and Question Answering by Combining Large Language Models and Small Language Models

arXiv:2604.05875v1 Announce Type: new Abstract: Knowledge Bases (KBs) play a key role in various applications. As two representative KB-related tasks, knowledge

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference

arXiv:2604.05887v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) have advanced unified reasoning over text, images, and videos, but thei

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Context-Value-Action Architecture for Value-Driven Large Language Model Agents

arXiv:2604.05939v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown promise in simulating human behavior, yet existing agents often exhibit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning

arXiv:2604.05943v1 Announce Type: new Abstract: Recent advances in multi-agent reinforcement learning (MARL) have demonstrated success in numerous challenging d