3,273 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (11572) ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
This Treatment Works, Right? Evaluating LLM Sensitivity to Patient Question Framing in Medical QA
arXiv:2604.05051v1 Announce Type: cross Abstract: Patients are increasingly turning to large language models (LLMs) with medical questions that are complex and
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago
Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series
arXiv:2604.05064v1 Announce Type: cross Abstract: Synthetic data is essential for training foundation models for time series (FMTS), but most generators assume
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1w ago
AutoLALA: Automatic Loop Algebraic Locality Analysis for AI and HPC Kernels
arXiv:2604.05066v1 Announce Type: cross Abstract: Data movement is the primary bottleneck in modern computing systems. For loop-based programs common in high-pe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Feature-Aware Anisotropic Local Differential Privacy for Utility-Preserving Graph Representation Learning in Metal Additive Manufacturing
arXiv:2604.05077v1 Announce Type: cross Abstract: Metal additive manufacturing (AM) enables the fabrication of safety-critical components, but reliable quality
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
Nidus: Externalized Reasoning for AI-Assisted Engineering
arXiv:2604.05080v1 Announce Type: cross Abstract: We present Nidus, a governance runtime that mechanizes the V-model for AI-assisted software delivery. In the s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation
arXiv:2604.05083v1 Announce Type: cross Abstract: While Large Language Models (LLMs) are increasingly adopted as automated judges for evaluating generated text,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks
arXiv:2604.05100v1 Announce Type: cross Abstract: Instructed code editing, where an LLM modifies existing code based on a natural language instruction, accounts
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
Simultaneous Dual-View Mammogram Synthesis Using Denoising Diffusion Probabilistic Models
arXiv:2604.05110v1 Announce Type: cross Abstract: Breast cancer screening relies heavily on mammography, where the craniocaudal (CC) and mediolateral oblique (M
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner
arXiv:2604.05112v1 Announce Type: cross Abstract: Recent progress in in-context reinforcement learning (ICRL) has demonstrated its potential for training genera
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago
CRAB: Codebook Rebalancing for Bias Mitigation in Generative Recommendation
arXiv:2604.05113v1 Announce Type: cross Abstract: Generative recommendation (GeneRec) has introduced a new paradigm that represents items as discrete semantic t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Watch Before You Answer: Learning from Visually Grounded Post-Training
arXiv:2604.05117v1 Announce Type: cross Abstract: It is critical for vision-language models (VLMs) to comprehensively understand visual, temporal, and textual c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Offline RL for Adaptive Policy Retrieval in Prior Authorization
arXiv:2604.05125v1 Announce Type: cross Abstract: Prior authorization (PA) requires interpretation of complex and fragmented coverage policies, yet existing ret
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning
arXiv:2604.05134v1 Announce Type: cross Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolve
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
EffiPair: Improving the Efficiency of LLM-generated Code with Relative Contrastive Feedback
arXiv:2604.05137v1 Announce Type: cross Abstract: Large language models (LLMs) often generate code that is functionally correct but inefficient in runtime and m
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Compiled AI: Deterministic Code Generation for LLM-Based Workflow Automation
arXiv:2604.05150v1 Announce Type: cross Abstract: We study compiled AI, a paradigm in which large language models generate executable code artifacts during a co
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Planning to Explore: Curiosity-Driven Planning for LLM Test Generation
arXiv:2604.05159v1 Announce Type: cross Abstract: The use of LLMs for code generation has naturally extended to code testing and evaluation. As codebases grow i
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago
What Makes a Good Response? An Empirical Analysis of Quality in Qualitative Interviews
arXiv:2604.05163v1 Announce Type: cross Abstract: Qualitative interviews provide essential insights into human experiences when they elicit high-quality respons
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Not All Turns Are Equally Hard: Adaptive Thinking Budgets For Efficient Multi-Turn Reasoning
arXiv:2604.05164v1 Announce Type: cross Abstract: As LLM reasoning performance plateau, improving inference-time compute efficiency is crucial to mitigate overt
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
From Use to Oversight: How Mental Models Influence User Behavior and Output in AI Writing Assistants
arXiv:2604.05166v1 Announce Type: cross Abstract: AI-based writing assistants are ubiquitous, yet little is known about how users' mental models shape their use
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago
Modality-Aware and Anatomical Vector-Quantized Autoencoding for Multimodal Brain MRI
arXiv:2604.05171v1 Announce Type: cross Abstract: Learning a robust Variational Autoencoder (VAE) is a fundamental step for many deep learning applications in m