📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 7,014 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (18865)
ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI
📄 Paper
1w ago
What's In My Human Feedback? Learning Interpretable Descriptions of Preference Data
arXiv:2510.26202v2 Announce Type: replace-cross Abstract: Human feedback can alter language models in unpredictable and undesirable ways, as practitioners lack
ArXiv cs.AI
📄 Paper
1w ago
Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?
arXiv:2510.27269v3 Announce Type: replace-cross Abstract: Reasoning language models (RLMs) achieve strong performance on complex reasoning tasks, yet they still
ArXiv cs.AI
📄 Paper
1w ago
Thought Branches: Interpreting LLM Reasoning Requires Resampling
arXiv:2510.27484v2 Announce Type: replace-cross Abstract: Most work interpreting reasoning models studies only a single chain-of-thought (CoT), yet these models
ArXiv cs.AI
📄 Paper
1w ago
Context-Guided Decompilation: A Step Towards Re-executability
arXiv:2511.01763v2 Announce Type: replace-cross Abstract: Binary decompilation plays an important role in software security analysis, reverse engineering, and m
ArXiv cs.AI
📄 Paper
1w ago
Multimodal Diffusion Forcing for Forceful Manipulation
arXiv:2511.04812v2 Announce Type: replace-cross Abstract: Given a dataset of expert trajectories, standard imitation learning approaches typically learn a direc
ArXiv cs.AI
📄 Paper
1w ago
SynthAgent: Adapting Web Agents with Synthetic Supervision
arXiv:2511.06101v3 Announce Type: replace-cross Abstract: Web agents struggle to adapt to new websites due to the scarcity of environment specific tasks and dem
ArXiv cs.AI
📄 Paper
1w ago
Introduction to Automated Negotiation
arXiv:2511.08659v3 Announce Type: replace-cross Abstract: This book is an introductory textbook targeted towards computer science students who are completely ne
ArXiv cs.AI
📄 Paper
1w ago
Volumetric Ergodic Control
arXiv:2511.11533v3 Announce Type: replace-cross Abstract: Ergodic control synthesizes optimal coverage behaviors over spatial distributions for nonlinear system
ArXiv cs.AI
📄 Paper
1w ago
GroupRank: A Groupwise Paradigm for Effective and Efficient Passage Reranking with LLMs
arXiv:2511.11653v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have emerged as powerful tools for passage reranking in information retri
ArXiv cs.AI
📄 Paper
1w ago
Improving Neutrino Oscillation Measurements through Event Classification
arXiv:2511.11938v2 Announce Type: replace-cross Abstract: Precise neutrino energy reconstruction is essential for next-generation long-baseline oscillation expe
ArXiv cs.AI
📄 Paper
1w ago
LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs
arXiv:2511.14774v3 Announce Type: replace-cross Abstract: Evaluating cross-lingual knowledge transfer in large language models is challenging, as correct answer
ArXiv cs.AI
📄 Paper
1w ago
Process-Centric Analysis of Agentic Software Systems
arXiv:2512.02393v3 Announce Type: replace-cross Abstract: Agentic systems are modern software systems: they consist of orchestrated modules, expose interfaces,
ArXiv cs.AI
📄 Paper
1w ago
A Unified Theory of Sparse Dictionary Learning in Mechanistic Interpretability: Piecewise Biconvexity and Spurious Minima
arXiv:2512.05534v4 Announce Type: replace-cross Abstract: As AI models achieve remarkable capabilities across diverse domains, understanding what representation
ArXiv cs.AI
📄 Paper
1w ago
WisPaper: Your AI Scholar Search Engine
arXiv:2512.06879v3 Announce Type: replace-cross Abstract: We present \textsc{WisPaper}, an end-to-end agent system that transforms how researchers discover, org
ArXiv cs.AI
📄 Paper
1w ago
Interpretable Alzheimer's Diagnosis via Multimodal Fusion of Regional Brain Experts
arXiv:2512.10966v2 Announce Type: replace-cross Abstract: Accurate and early diagnosis of Alzheimer's disease (AD) is critical for effective intervention and re
ArXiv cs.AI
📄 Paper
1w ago
Enhancing Geo-localization for Crowdsourced Flood Imagery via LLM-Guided Attention
arXiv:2512.11811v3 Announce Type: replace-cross Abstract: Crowdsourced social media imagery provides real-time visual evidence of urban flooding but often lacks
ArXiv cs.AI
📄 Paper
1w ago
Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling
arXiv:2512.12675v2 Announce Type: replace-cross Abstract: Subject-driven image generation has advanced from single- to multi-subject composition, while neglecti
ArXiv cs.AI
📄 Paper
1w ago
Understanding Generalization in Role-Playing Models via Information Theory
arXiv:2512.17270v2 Announce Type: replace-cross Abstract: Role-playing models (RPMs) are widely used in real-world applications but underperform when deployed i
ArXiv cs.AI
📄 Paper
1w ago
M$^3$KG-RAG: Multi-hop Multimodal Knowledge Graph-enhanced Retrieval-Augmented Generation
arXiv:2512.20136v3 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) has recently been extended to multimodal settings, connecting mul
ArXiv cs.AI
📄 Paper
1w ago
LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving
arXiv:2512.20563v2 Announce Type: replace-cross Abstract: Simulators can generate virtually unlimited driving data, yet imitation learning policies in simulatio
ArXiv cs.AI
📄 Paper
1w ago
Variance-Aware Prior-Based Tree Policies for Monte Carlo Tree Search
arXiv:2512.21648v2 Announce Type: replace-cross Abstract: Monte Carlo Tree Search (MCTS) has profoundly influenced reinforcement learning (RL) by integrating pl
ArXiv cs.AI
📄 Paper
1w ago
CricBench: A Multilingual Benchmark for Evaluating LLMs in Cricket Analytics
arXiv:2512.21877v3 Announce Type: replace-cross Abstract: Cricket is the second most popular sport worldwide, with billions of fans seeking advanced statistical
ArXiv cs.AI
📄 Paper
1w ago
Artificial Intelligence for All? Brazilian Teachers on Ethics, Equity, and the Everyday Challenges of AI in Education
arXiv:2512.23834v2 Announce Type: replace-cross Abstract: This study examines the perceptions of Brazilian K-12 education teachers regarding the use of AI in ed
ArXiv cs.AI
📄 Paper
1w ago
Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice
arXiv:2512.24503v2 Announce Type: replace-cross Abstract: Data teams at frontier AI companies routinely train small proxy models to make critical decisions abou
DeepCamp AI