5,060 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (13554) ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI 📄 Paper 4d ago
METER: Evaluating Multi-Level Contextual Causal Reasoning in Large Language Models
arXiv:2604.11502v1 Announce Type: cross Abstract: Contextual causal reasoning is a critical yet challenging capability for Large Language Models (LLMs). Existin
ArXiv cs.AI 📄 Paper 4d ago
Deep Learning for Sequential Decision Making under Uncertainty: Foundations, Frameworks, and Frontiers
arXiv:2604.11507v1 Announce Type: cross Abstract: Artificial intelligence (AI) is moving increasingly beyond prediction to support decisions in complex, uncerta
ArXiv cs.AI 📄 Paper 4d ago
Not All Forgetting Is Equal: Architecture-Dependent Retention Dynamics in Fine-Tuned Image Classifiers
arXiv:2604.11508v1 Announce Type: cross Abstract: Fine-tuning pretrained image classifiers is standard practice, yet which individual samples are forgotten duri
ArXiv cs.AI 📄 Paper 4d ago
Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization
arXiv:2604.11510v1 Announce Type: cross Abstract: To encourage diverse exploration in reinforcement learning (RL) for large language models (LLMs) without compr
ArXiv cs.AI 📄 Paper 4d ago
EdgeCIM: A Hardware-Software Co-Design for CIM-Based Acceleration of Small Language Models
arXiv:2604.11512v1 Announce Type: cross Abstract: The growing demand for deploying Small Language Models (SLMs) on edge devices, including laptops, smartphones,
ArXiv cs.AI 📄 Paper 4d ago
From Translation to Superset: Benchmark-Driven Evolution of a Production AI Agent from Rust to Python
arXiv:2604.11518v1 Announce Type: cross Abstract: Cross-language migration of large software systems is a persistent engineering challenge, particularly when th
ArXiv cs.AI 📄 Paper 4d ago
SVD-Prune: Training-Free Token Pruning For Efficient Vision-Language Models
arXiv:2604.11530v1 Announce Type: cross Abstract: Vision-Language Models (VLM) have revolutionized multimodal learning by jointly processing visual and textual
ArXiv cs.AI 📄 Paper 4d ago
CLAY: Conditional Visual Similarity Modulation in Vision-Language Embedding Space
arXiv:2604.11539v1 Announce Type: cross Abstract: Human perception of visual similarity is inherently adaptive and subjective, depending on the users' interests
ArXiv cs.AI 📄 Paper 4d ago
NovBench: Evaluating Large Language Models on Academic Paper Novelty Assessment
arXiv:2604.11543v1 Announce Type: cross Abstract: Novelty is a core requirement in academic publishing and a central focus of peer review, yet the growing volum
ArXiv cs.AI 📄 Paper 4d ago
Time is Not a Label: Continuous Phase Rotation for Temporal Knowledge Graphs and Agentic Memory
arXiv:2604.11544v1 Announce Type: cross Abstract: Structured memory representations such as knowledge graphs are central to autonomous agents and other long-liv
ArXiv cs.AI 📄 Paper 4d ago
FM-Agent: Scaling Formal Methods to Large Systems via LLM-Based Hoare-Style Reasoning
arXiv:2604.11556v1 Announce Type: cross Abstract: LLM-assisted software development has become increasingly prevalent, and can generate large-scale systems, suc
ArXiv cs.AI 📄 Paper 4d ago
bacpipe: a Python package to make bioacoustic deep learning models accessible
arXiv:2604.11560v1 Announce Type: cross Abstract: 1. Natural sounds have been recorded for millions of hours over the previous decades using passive acoustic mo
ArXiv cs.AI 📄 Paper 4d ago
Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo
arXiv:2604.11563v1 Announce Type: cross Abstract: Providing AI agents with reliable long-term memory that does not hallucinate remains an open problem. Current
ArXiv cs.AI 📄 Paper 4d ago
Minimizing classical resources in variational measurement-based quantum computation for generative modeling
arXiv:2604.11578v1 Announce Type: cross Abstract: Measurement-based quantum computation (MBQC) is a framework for quantum information processing in which a comp
ArXiv cs.AI 📄 Paper 4d ago
A Triadic Suffix Tokenization Scheme for Numerical Reasoning
arXiv:2604.11582v1 Announce Type: cross Abstract: Standard subword tokenization methods fragment numbers inconsistently, causing large language models (LLMs) to
ArXiv cs.AI 📄 Paper 4d ago
Layerwise Dynamics for In-Context Classification in Transformers
arXiv:2604.11613v1 Announce Type: cross Abstract: Transformers can perform in-context classification from a few labeled examples, yet the inference-time algorit
ArXiv cs.AI 📄 Paper 4d ago
CUTEv2: Unified and Configurable Matrix Extension for Diverse CPU Architectures with Minimal Design Overhead
arXiv:2604.11615v1 Announce Type: cross Abstract: Matrix extensions have emerged as an essential feature in modern CPUs to address the surging demands of AI wor
ArXiv cs.AI 📄 Paper 4d ago
SCNO: Spiking Compositional Neural Operator -- Towards a Neuromorphic Foundation Model for Nuclear PDE Solving
arXiv:2604.11625v1 Announce Type: cross Abstract: Neural operators have emerged as powerful surrogates for partial differential equation (PDE) solvers, yet they
ArXiv cs.AI 📄 Paper 4d ago
CodeTracer: Towards Traceable Agent States
arXiv:2604.11641v1 Announce Type: cross Abstract: Code agents are advancing rapidly, but debugging them is becoming increasingly difficult. As frameworks orches
ArXiv cs.AI 📄 Paper 4d ago
RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents
arXiv:2604.11655v1 Announce Type: cross Abstract: The rapid adoption of Large Language Models (LLMs) in interactive systems has enabled the creation of dynamic,