1,213 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 1,213 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (5032) ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIThe Verge
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion
arXiv:2511.21542v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models offer a unified framework for robotic manipulation by integrating
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Goal-Oriented Multi-Agent Semantic Networking: Unifying Intents, Semantics, and Intelligence
arXiv:2512.01035v2 Announce Type: replace-cross Abstract: 6G services are evolving toward goal-oriented and AI-native communication, which are expected to deliv
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
From Panel to Pixel: Zoom-In Vision-Language Pretraining from Biomedical Scientific Literature
arXiv:2512.02566v2 Announce Type: replace-cross Abstract: There is a growing interest in developing strong biomedical vision-language models. A popular approach
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
arXiv:2512.04000v2 Announce Type: replace-cross Abstract: The application of Large Multimodal Models (LMMs) to long-form video understanding is constrained by l
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Collaborative Causal Sensemaking: Closing the Complementarity Gap in Human-AI Decision Support
arXiv:2512.07801v5 Announce Type: replace-cross Abstract: LLM-based agents are increasingly deployed for expert decision support, yet human-AI teams in high-sta
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
ODMA: On-Demand Memory Allocation Strategy for LLM Serving on LPDDR-Class Accelerators
arXiv:2512.09427v3 Announce Type: replace-cross Abstract: Existing memory management techniques severely hinder efficient Large Language Model serving on accele
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Physics-driven human-like working memory outperforms digital networks in dynamic vision
arXiv:2512.15829v3 Announce Type: replace-cross Abstract: While the unsustainable energy cost of artificial intelligence necessitates physics-driven computing,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Deep Neural Networks as Discrete Dynamical Systems: Implications for Physics-Informed Learning
arXiv:2601.00473v2 Announce Type: replace-cross Abstract: We revisit the analogy between feed-forward deep neural networks (DNNs) and discrete dynamical systems
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 5d ago
Understanding Pure Textual Reasoning for Blind Image Quality Assessment
arXiv:2601.02441v2 Announce Type: replace-cross Abstract: Textual reasoning has recently been widely adopted in Blind Image Quality Assessment (BIQA). However,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection
arXiv:2601.09195v2 Announce Type: replace-cross Abstract: Supervised fine-tuning (SFT) is a fundamental post-training strategy to align Large Language Models (L
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset
arXiv:2601.10305v3 Announce Type: replace-cross Abstract: Vision-Language Pre-training (VLP) models have achieved remarkable success by leveraging large-scale i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
PASTA: A Scalable Framework for Multi-Policy AI Compliance Evaluation
arXiv:2601.11702v2 Announce Type: replace-cross Abstract: AI compliance is becoming increasingly critical as AI systems grow more powerful and pervasive. Yet th
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
HalluJudge: A Reference-Free Hallucination Detection for Context Misalignment in Code Review Automation
arXiv:2601.19072v2 Announce Type: replace-cross Abstract: Large Language models (LLMs) have shown strong capabilities in code review automation, such as review
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making
arXiv:2602.02378v2 Announce Type: replace-cross Abstract: As LLMs expand from assistance to decision support, a dangerous pattern emerges: fluent agreement with
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
SPARE: Self-distillation for PARameter-Efficient Removal
arXiv:2602.07058v2 Announce Type: replace-cross Abstract: Machine Unlearning aims to remove the influence of specific data or concepts from trained models while
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
On Randomness in Agentic Evals
arXiv:2602.07150v3 Announce Type: replace-cross Abstract: Agentic systems are evaluated on benchmarks where agents interact with environments to solve tasks. Mo
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 5d ago
KRONE: Hierarchical and Modular Log Anomaly Detection
arXiv:2602.07303v2 Announce Type: replace-cross Abstract: Log anomaly detection is crucial for uncovering system failures and security risks. Although logs orig
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering
arXiv:2602.07906v4 Announce Type: replace-cross Abstract: Autonomous Machine Learning Engineering (MLE) requires agents to perform sustained, iterative optimiza
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 5d ago
OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model
arXiv:2602.12304v3 Announce Type: replace-cross Abstract: Existing mainstream video customization methods focus on generating identity-consistent videos based o
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
arXiv:2602.16485v2 Announce Type: replace-cross Abstract: Existing Multi-Agent Systems (MAS) typically rely on homogeneous model configurations, failing to expl
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Smooth Gate Functions for Soft Advantage Policy Optimization
arXiv:2602.19345v2 Announce Type: replace-cross Abstract: Group Relative Policy Optimization (GRPO) has significantly advanced the training of large language mo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parametric Policies
arXiv:2602.23811v3 Announce Type: replace-cross Abstract: We investigate the theoretical aspects of offline reinforcement learning (RL) under general function a
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 5d ago
OSS-CRS: Liberating AIxCC Cyber Reasoning Systems for Real-World Open-Source Security
arXiv:2603.08566v2 Announce Type: replace-cross Abstract: DARPA's AI Cyber Challenge (AIxCC) showed that cyber reasoning systems (CRSs) can go beyond vulnerabil
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings
arXiv:2603.09643v3 Announce Type: replace-cross Abstract: Current evaluation frameworks and benchmarks for LLM powered agents focus on text chat driven agents,