📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 7,014 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (18865) ArXiv cs.AI Dev.to AI Dev.to · FORUM WEB Forbes Innovation Medium · Programming Medium · AI

What's In My Human Feedback? Learning Interpretable Descriptions of Preference Data

arXiv:2510.26202v2 Announce Type: replace-cross Abstract: Human feedback can alter language models in unpredictable and undesirable ways, as practitioners lack

ArXiv cs.AI 📄 Paper 1w ago

Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?

arXiv:2510.27269v3 Announce Type: replace-cross Abstract: Reasoning language models (RLMs) achieve strong performance on complex reasoning tasks, yet they still

ArXiv cs.AI 📄 Paper 1w ago

Thought Branches: Interpreting LLM Reasoning Requires Resampling

arXiv:2510.27484v2 Announce Type: replace-cross Abstract: Most work interpreting reasoning models studies only a single chain-of-thought (CoT), yet these models

ArXiv cs.AI 📄 Paper 1w ago

Context-Guided Decompilation: A Step Towards Re-executability

arXiv:2511.01763v2 Announce Type: replace-cross Abstract: Binary decompilation plays an important role in software security analysis, reverse engineering, and m

ArXiv cs.AI 📄 Paper 1w ago

Multimodal Diffusion Forcing for Forceful Manipulation

arXiv:2511.04812v2 Announce Type: replace-cross Abstract: Given a dataset of expert trajectories, standard imitation learning approaches typically learn a direc

ArXiv cs.AI 📄 Paper 1w ago

SynthAgent: Adapting Web Agents with Synthetic Supervision

arXiv:2511.06101v3 Announce Type: replace-cross Abstract: Web agents struggle to adapt to new websites due to the scarcity of environment specific tasks and dem

ArXiv cs.AI 📄 Paper 1w ago

Introduction to Automated Negotiation

arXiv:2511.08659v3 Announce Type: replace-cross Abstract: This book is an introductory textbook targeted towards computer science students who are completely ne

ArXiv cs.AI 📄 Paper 1w ago

Volumetric Ergodic Control

arXiv:2511.11533v3 Announce Type: replace-cross Abstract: Ergodic control synthesizes optimal coverage behaviors over spatial distributions for nonlinear system

ArXiv cs.AI 📄 Paper 1w ago

GroupRank: A Groupwise Paradigm for Effective and Efficient Passage Reranking with LLMs

arXiv:2511.11653v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have emerged as powerful tools for passage reranking in information retri

ArXiv cs.AI 📄 Paper 1w ago

Improving Neutrino Oscillation Measurements through Event Classification

arXiv:2511.11938v2 Announce Type: replace-cross Abstract: Precise neutrino energy reconstruction is essential for next-generation long-baseline oscillation expe

ArXiv cs.AI 📄 Paper 1w ago

LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs

arXiv:2511.14774v3 Announce Type: replace-cross Abstract: Evaluating cross-lingual knowledge transfer in large language models is challenging, as correct answer

ArXiv cs.AI 📄 Paper 1w ago

Process-Centric Analysis of Agentic Software Systems

arXiv:2512.02393v3 Announce Type: replace-cross Abstract: Agentic systems are modern software systems: they consist of orchestrated modules, expose interfaces,

ArXiv cs.AI 📄 Paper 1w ago

A Unified Theory of Sparse Dictionary Learning in Mechanistic Interpretability: Piecewise Biconvexity and Spurious Minima

arXiv:2512.05534v4 Announce Type: replace-cross Abstract: As AI models achieve remarkable capabilities across diverse domains, understanding what representation

ArXiv cs.AI 📄 Paper 1w ago

WisPaper: Your AI Scholar Search Engine

arXiv:2512.06879v3 Announce Type: replace-cross Abstract: We present \textsc{WisPaper}, an end-to-end agent system that transforms how researchers discover, org

ArXiv cs.AI 📄 Paper 1w ago

Interpretable Alzheimer's Diagnosis via Multimodal Fusion of Regional Brain Experts

arXiv:2512.10966v2 Announce Type: replace-cross Abstract: Accurate and early diagnosis of Alzheimer's disease (AD) is critical for effective intervention and re

ArXiv cs.AI 📄 Paper 1w ago

Enhancing Geo-localization for Crowdsourced Flood Imagery via LLM-Guided Attention

arXiv:2512.11811v3 Announce Type: replace-cross Abstract: Crowdsourced social media imagery provides real-time visual evidence of urban flooding but often lacks

ArXiv cs.AI 📄 Paper 1w ago

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

arXiv:2512.12675v2 Announce Type: replace-cross Abstract: Subject-driven image generation has advanced from single- to multi-subject composition, while neglecti

ArXiv cs.AI 📄 Paper 1w ago

Understanding Generalization in Role-Playing Models via Information Theory

arXiv:2512.17270v2 Announce Type: replace-cross Abstract: Role-playing models (RPMs) are widely used in real-world applications but underperform when deployed i

ArXiv cs.AI 📄 Paper 1w ago

M$^3$KG-RAG: Multi-hop Multimodal Knowledge Graph-enhanced Retrieval-Augmented Generation

arXiv:2512.20136v3 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) has recently been extended to multimodal settings, connecting mul

ArXiv cs.AI 📄 Paper 1w ago

LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving

arXiv:2512.20563v2 Announce Type: replace-cross Abstract: Simulators can generate virtually unlimited driving data, yet imitation learning policies in simulatio

ArXiv cs.AI 📄 Paper 1w ago

Variance-Aware Prior-Based Tree Policies for Monte Carlo Tree Search

arXiv:2512.21648v2 Announce Type: replace-cross Abstract: Monte Carlo Tree Search (MCTS) has profoundly influenced reinforcement learning (RL) by integrating pl

ArXiv cs.AI 📄 Paper 1w ago

CricBench: A Multilingual Benchmark for Evaluating LLMs in Cricket Analytics

arXiv:2512.21877v3 Announce Type: replace-cross Abstract: Cricket is the second most popular sport worldwide, with billions of fans seeking advanced statistical

ArXiv cs.AI 📄 Paper 1w ago

Artificial Intelligence for All? Brazilian Teachers on Ethics, Equity, and the Everyday Challenges of AI in Education

arXiv:2512.23834v2 Announce Type: replace-cross Abstract: This study examines the perceptions of Brazilian K-12 education teachers regarding the use of AI in ed

ArXiv cs.AI 📄 Paper 1w ago

Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice

arXiv:2512.24503v2 Announce Type: replace-cross Abstract: Data teams at frontier AI companies routinely train small proxy models to make critical decisions abou