7,966 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 7,966 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (21429) ArXiv cs.AIDev.to AIForbes InnovationMedium · AIMedium · ProgrammingMedium · Cybersecurity
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Scaling Coding Agents via Atomic Skills
arXiv:2604.05013v1 Announce Type: cross Abstract: Current LLM coding agents are predominantly trained on composite benchmarks (e.g., bug fixing), which often le
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
arXiv:2604.05014v1 Announce Type: cross Abstract: Building generalist embodied agents requires integrating perception, language understanding, and action, which
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Phase-Associative Memory: Sequence Modeling in Complex Hilbert Space
arXiv:2604.05030v1 Announce Type: cross Abstract: We present Phase-Associative Memory (PAM), a recurrent sequence model in which all representations are complex
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 3w ago
ID-Sim: An Identity-Focused Similarity Metric
arXiv:2604.05039v1 Announce Type: cross Abstract: Humans have remarkable selective sensitivity to identities -- easily distinguishing between highly similar ide
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago
PCA-Driven Adaptive Sensor Triage for Edge AI Inference
arXiv:2604.05045v1 Announce Type: cross Abstract: Multi-channel sensor networks in industrial IoT often exceed available bandwidth. We propose PCA-Triage, a str
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
This Treatment Works, Right? Evaluating LLM Sensitivity to Patient Question Framing in Medical QA
arXiv:2604.05051v1 Announce Type: cross Abstract: Patients are increasingly turning to large language models (LLMs) with medical questions that are complex and
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago
Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series
arXiv:2604.05064v1 Announce Type: cross Abstract: Synthetic data is essential for training foundation models for time series (FMTS), but most generators assume
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 3w ago
AutoLALA: Automatic Loop Algebraic Locality Analysis for AI and HPC Kernels
arXiv:2604.05066v1 Announce Type: cross Abstract: Data movement is the primary bottleneck in modern computing systems. For loop-based programs common in high-pe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Feature-Aware Anisotropic Local Differential Privacy for Utility-Preserving Graph Representation Learning in Metal Additive Manufacturing
arXiv:2604.05077v1 Announce Type: cross Abstract: Metal additive manufacturing (AM) enables the fabrication of safety-critical components, but reliable quality
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago
Nidus: Externalized Reasoning for AI-Assisted Engineering
arXiv:2604.05080v1 Announce Type: cross Abstract: We present Nidus, a governance runtime that mechanizes the V-model for AI-assisted software delivery. In the s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation
arXiv:2604.05083v1 Announce Type: cross Abstract: While Large Language Models (LLMs) are increasingly adopted as automated judges for evaluating generated text,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks
arXiv:2604.05100v1 Announce Type: cross Abstract: Instructed code editing, where an LLM modifies existing code based on a natural language instruction, accounts
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago
Simultaneous Dual-View Mammogram Synthesis Using Denoising Diffusion Probabilistic Models
arXiv:2604.05110v1 Announce Type: cross Abstract: Breast cancer screening relies heavily on mammography, where the craniocaudal (CC) and mediolateral oblique (M
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner
arXiv:2604.05112v1 Announce Type: cross Abstract: Recent progress in in-context reinforcement learning (ICRL) has demonstrated its potential for training genera
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago
CRAB: Codebook Rebalancing for Bias Mitigation in Generative Recommendation
arXiv:2604.05113v1 Announce Type: cross Abstract: Generative recommendation (GeneRec) has introduced a new paradigm that represents items as discrete semantic t
ArXiv cs.AI 📄 Paper 3w ago
$\pi^2$: Structure-Originated Reasoning Data Improves Long-Context Reasoning Ability of Large Language Models
arXiv:2604.05114v1 Announce Type: cross Abstract: We study a pipeline that curates reasoning data from initial structured data for improving long-context reason
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Watch Before You Answer: Learning from Visually Grounded Post-Training
arXiv:2604.05117v1 Announce Type: cross Abstract: It is critical for vision-language models (VLMs) to comprehensively understand visual, temporal, and textual c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Offline RL for Adaptive Policy Retrieval in Prior Authorization
arXiv:2604.05125v1 Announce Type: cross Abstract: Prior authorization (PA) requires interpretation of complex and fragmented coverage policies, yet existing ret
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning
arXiv:2604.05134v1 Announce Type: cross Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolve
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
EffiPair: Improving the Efficiency of LLM-generated Code with Relative Contrastive Feedback
arXiv:2604.05137v1 Announce Type: cross Abstract: Large language models (LLMs) often generate code that is functionally correct but inefficient in runtime and m