📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 7,966 articles · Updated every 3 hours · View all reads

arXiv:2604.05005v1 Announce Type: cross Abstract: Large language models are increasingly used as educational assistants, yet evaluation of their educational cap

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Generalizable Audio-Visual Navigation via Binaural Difference Attention and Action Transition Prediction

arXiv:2604.05007v1 Announce Type: cross Abstract: In Audio-Visual Navigation (AVN), agents must locate sound sources in unseen 3D environments using visual and

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago

YMIR: A new Benchmark Dataset and Model for Arabic Yemeni Music Genre Classification Using Convolutional Neural Networks

arXiv:2604.05011v1 Announce Type: cross Abstract: Automatic music genre classification is a major task in music information retrieval; however, most current ben

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Comparative Characterization of KV Cache Management Strategies for LLM Inference

arXiv:2604.05012v1 Announce Type: cross Abstract: Efficient inference with Large Language Models (LLMs) increasingly relies on Key-Value (KV) caches to store pr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Scaling Coding Agents via Atomic Skills

arXiv:2604.05013v1 Announce Type: cross Abstract: Current LLM coding agents are predominantly trained on composite benchmarks (e.g., bug fixing), which often le

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

arXiv:2604.05014v1 Announce Type: cross Abstract: Building generalist embodied agents requires integrating perception, language understanding, and action, which

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Phase-Associative Memory: Sequence Modeling in Complex Hilbert Space

arXiv:2604.05030v1 Announce Type: cross Abstract: We present Phase-Associative Memory (PAM), a recurrent sequence model in which all representations are complex

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 3w ago

ID-Sim: An Identity-Focused Similarity Metric

arXiv:2604.05039v1 Announce Type: cross Abstract: Humans have remarkable selective sensitivity to identities -- easily distinguishing between highly similar ide

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago

PCA-Driven Adaptive Sensor Triage for Edge AI Inference

arXiv:2604.05045v1 Announce Type: cross Abstract: Multi-channel sensor networks in industrial IoT often exceed available bandwidth. We propose PCA-Triage, a str

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

This Treatment Works, Right? Evaluating LLM Sensitivity to Patient Question Framing in Medical QA

arXiv:2604.05051v1 Announce Type: cross Abstract: Patients are increasingly turning to large language models (LLMs) with medical questions that are complex and

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago

Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

arXiv:2604.05064v1 Announce Type: cross Abstract: Synthetic data is essential for training foundation models for time series (FMTS), but most generators assume

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 3w ago

AutoLALA: Automatic Loop Algebraic Locality Analysis for AI and HPC Kernels

arXiv:2604.05066v1 Announce Type: cross Abstract: Data movement is the primary bottleneck in modern computing systems. For loop-based programs common in high-pe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Feature-Aware Anisotropic Local Differential Privacy for Utility-Preserving Graph Representation Learning in Metal Additive Manufacturing

arXiv:2604.05077v1 Announce Type: cross Abstract: Metal additive manufacturing (AM) enables the fabrication of safety-critical components, but reliable quality

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago

Nidus: Externalized Reasoning for AI-Assisted Engineering

arXiv:2604.05080v1 Announce Type: cross Abstract: We present Nidus, a governance runtime that mechanizes the V-model for AI-assisted software delivery. In the s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation

arXiv:2604.05083v1 Announce Type: cross Abstract: While Large Language Models (LLMs) are increasingly adopted as automated judges for evaluating generated text,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks

arXiv:2604.05100v1 Announce Type: cross Abstract: Instructed code editing, where an LLM modifies existing code based on a natural language instruction, accounts

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago

Simultaneous Dual-View Mammogram Synthesis Using Denoising Diffusion Probabilistic Models

arXiv:2604.05110v1 Announce Type: cross Abstract: Breast cancer screening relies heavily on mammography, where the craniocaudal (CC) and mediolateral oblique (M

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner

arXiv:2604.05112v1 Announce Type: cross Abstract: Recent progress in in-context reinforcement learning (ICRL) has demonstrated its potential for training genera

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago

CRAB: Codebook Rebalancing for Bias Mitigation in Generative Recommendation

arXiv:2604.05113v1 Announce Type: cross Abstract: Generative recommendation (GeneRec) has introduced a new paradigm that represents items as discrete semantic t

ArXiv cs.AI 📄 Paper 3w ago

$\pi^2$: Structure-Originated Reasoning Data Improves Long-Context Reasoning Ability of Large Language Models

arXiv:2604.05114v1 Announce Type: cross Abstract: We study a pipeline that curates reasoning data from initial structured data for improving long-context reason

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

arXiv:2604.05117v1 Announce Type: cross Abstract: It is critical for vision-language models (VLMs) to comprehensively understand visual, temporal, and textual c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Offline RL for Adaptive Policy Retrieval in Prior Authorization

arXiv:2604.05125v1 Announce Type: cross Abstract: Prior authorization (PA) requires interpretation of complex and fragmented coverage policies, yet existing ret

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

arXiv:2604.05134v1 Announce Type: cross Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

EffiPair: Improving the Efficiency of LLM-generated Code with Relative Contrastive Feedback

arXiv:2604.05137v1 Announce Type: cross Abstract: Large language models (LLMs) often generate code that is functionally correct but inefficient in runtime and m