3,336 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,336 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (16615) ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Multilingual Medical Reasoning for Question Answering with Large Language Models
arXiv:2512.05658v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) with reasoning capabilities have recently demonstrated strong potential i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models
arXiv:2512.08503v2 Announce Type: replace-cross Abstract: Multi-modal large reasoning models (MLRMs) pose significant privacy risks by inferring precise geograp
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models
arXiv:2512.10932v2 Announce Type: replace-cross Abstract: Early children's developmental trajectories set up a natural goal for sample-efficient pretraining of
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Does Tone Change the Answer? Evaluating Prompt Politeness Effects on Modern LLMs: GPT, Gemini, and LLaMA
arXiv:2512.12812v2 Announce Type: replace-cross Abstract: Prompt engineering has emerged as a critical factor influencing large language model (LLM) performance
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago
RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering
arXiv:2512.17396v2 Announce Type: replace-cross Abstract: In this work, we introduce RadImageNet-VQA, a large-scale dataset designed to advance radiologic visua
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Measuring all the noises of LLM Evals
arXiv:2512.21326v2 Announce Type: replace-cross Abstract: Separating signal from noise is central to experiments. Applying well-established statistical methods
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago
StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
arXiv:2512.22065v2 Announce Type: replace-cross Abstract: Real-time, streaming interactive avatars represent a critical yet challenging goal in digital human re
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago
A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction
arXiv:2601.00809v2 Announce Type: replace-cross Abstract: Agentic workflows driven by large language models (LLMs) are increasingly applied to Building Informat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
JMedEthicBench: A Multi-Turn Conversational Benchmark for Evaluating Medical Safety in Japanese Large Language Models
arXiv:2601.01627v2 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) are increasingly deployed in healthcare field, it becomes essential to
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago
Vision-Language Agents for Interactive Forest Change Analysis
arXiv:2601.04497v2 Announce Type: replace-cross Abstract: Modern forest monitoring workflows increasingly benefit from the growing availability of high-resoluti
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago
Hellinger Multimodal Variational Autoencoders
arXiv:2601.06572v2 Announce Type: replace-cross Abstract: Multimodal variational autoencoders (VAEs) are widely used for weakly supervised generative learning w
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Symphonym: Universal Phonetic Embeddings for Cross-Script Name Matching
arXiv:2601.06932v4 Announce Type: replace-cross Abstract: Matching place names across writing systems is a persistent obstacle to the integration of multilingua
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 3w ago
FigEx2: Visual-Conditioned Panel Detection and Captioning for Scientific Compound Figures
arXiv:2601.08026v4 Announce Type: replace-cross Abstract: Scientific compound figures combine multiple labeled panels into a single image. However, in a PMC-sca
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Sparse-RL: Breaking the Memory Wall in LLM Reinforcement Learning via Stable Sparse Rollouts
arXiv:2601.10079v2 Announce Type: replace-cross Abstract: Reinforcement Learning (RL) has become essential for eliciting complex reasoning capabilities in Large
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago
Fairness in Healthcare Processes: A Quantitative Analysis of Decision Making in Triage
arXiv:2601.11065v2 Announce Type: replace-cross Abstract: Fairness in automated decision-making has become a critical concern, particularly in high-pressure hea
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago
An Agentic Operationalization of DISARM for FIMI Investigation on Social Media
arXiv:2601.15109v3 Announce Type: replace-cross Abstract: Interoperable data and intelligence flows among allied partners and operational end-users remain essen
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3w ago
Dual-Prototype Disentanglement: A Context-Aware Enhancement Framework for Time Series Forecasting
arXiv:2601.16632v3 Announce Type: replace-cross Abstract: Time series forecasting has witnessed significant progress with deep learning. While prevailing approa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
LLMs versus the Halting Problem: Revisiting Program Termination Prediction
arXiv:2601.18987v4 Announce Type: replace-cross Abstract: Determining whether a program terminates is a central problem in computer science. Turing's foundation
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 3w ago
On the Impact of AGENTS.md Files on the Efficiency of AI Coding Agents
arXiv:2601.20404v2 Announce Type: replace-cross Abstract: AI coding agents such as Codex and Claude Code are increasingly used to autonomously contribute to sof
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Does My Chatbot Have an Agenda? Understanding Human and AI Agency in Human-Human-like Chatbot Interaction
arXiv:2601.22452v2 Announce Type: replace-cross Abstract: As AI chatbots shift from tools to companions, critical questions arise: who controls the conversation