1,258 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 1,258 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (4987) ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIWeaviate Blog
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 4d ago
Towards Exploratory and Focused Manipulation with Bimanual Active Perception: A New Problem, Benchmark and Strategy
arXiv:2602.01939v3 Announce Type: replace-cross Abstract: Recently, active vision has reemerged as an important concept for manipulation, since visual occlusion
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 4d ago
Monocular Normal Estimation via Shading Sequence Estimation
arXiv:2602.09929v5 Announce Type: replace-cross Abstract: Monocular normal estimation aims to estimate the normal map from a single RGB image of an object under
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia
arXiv:2602.18455v2 Announce Type: replace-cross Abstract: Search engines increasingly display LLM-generated answers shown above organic links, shifting search f
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago
The Landscape of AI in Science Education: What is Changing and How to Respond
arXiv:2602.18469v2 Announce Type: replace-cross Abstract: This introductory chapter explores the transformative role of artificial intelligence (AI) in reshapin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis
arXiv:2602.20951v2 Announce Type: replace-cross Abstract: Despite recent advances in diffusion models, AI generated images still often contain visual artifacts
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 4d ago
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
arXiv:2603.00141v3 Announce Type: replace-cross Abstract: Image Chain-of-Thought (Image-CoT) is a test-time scaling paradigm that improves image generation by e
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Why Adam Can Beat SGD: Second-Moment Normalization Yields Sharper Tails
arXiv:2603.03099v3 Announce Type: replace-cross Abstract: Despite Adam demonstrating faster empirical convergence than SGD in many applications, much of the exi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Graph-of-Mark: Promote Spatial Reasoning in Multimodal Language Models with Graph-Based Visual Prompting
arXiv:2603.06663v2 Announce Type: replace-cross Abstract: Recent advances in training-free visual prompting, such as Set-of-Mark, have emerged as a promising di
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago
The DMA Streaming Framework: Kernel-Level Buffer Orchestration for High-Performance AI Data Paths
arXiv:2603.10030v2 Announce Type: replace-cross Abstract: AI transport libraries move bytes efficiently, but they commonly assume that buffers are already corre
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI
arXiv:2603.11413v3 Announce Type: replace-cross Abstract: Ramaswamy et al. reported in Nature Medicine that ChatGPT Health under-triages 51.6% of emergencies, c
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago
Theory of Dynamic Adaptive Coordination
arXiv:2603.11560v2 Announce Type: replace-cross Abstract: This paper develops a dynamical theory of adaptive coordination governed by persistent environmental m
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization
arXiv:2603.11583v2 Announce Type: replace-cross Abstract: The success of a Large Language Model (LLM) task depends heavily on its prompt. Most use-cases specify
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
SemBench: A Universal Semantic Framework for LLM Evaluation
arXiv:2603.11687v2 Announce Type: replace-cross Abstract: Recent progress in Natural Language Processing (NLP) has been driven by the emergence of Large Languag
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Seeking Physics in Diffusion Noise
arXiv:2603.14294v2 Announce Type: replace-cross Abstract: Do video diffusion models encode signals predictive of physical plausibility? We probe intermediate de
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
360{\deg} Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method
arXiv:2603.16179v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) have shown impressive abilities in understanding and reasonin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making
arXiv:2603.16673v2 Announce Type: replace-cross Abstract: Embodied robotic systems increasingly rely on large language model (LLM)-based agents to support high-
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
P^2O: Joint Policy and Prompt Optimization
arXiv:2603.21877v2 Announce Type: replace-cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a powerful paradigm for enhancing
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
PLDR-LLMs Reason At Self-Organized Criticality
arXiv:2603.23539v1 Announce Type: new Abstract: We show that PLDR-LLMs pretrained at self-organized criticality exhibit reasoning at inference time. The charact
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Environment Maps: Structured Environmental Representations for Long-Horizon Agents
arXiv:2603.23610v2 Announce Type: new Abstract: Although large language models (LLMs) have advanced rapidly, robust automation of complex software workflows rem
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Evaluating a Multi-Agent Voice-Enabled Smart Speaker for Care Homes: A Safety-Focused Framework
arXiv:2603.23625v1 Announce Type: new Abstract: Artificial intelligence (AI) is increasingly being explored in health and social care to reduce administrative w
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Can LLM Agents Be CFOs? A Benchmark for Resource Allocation in Dynamic Enterprise Environments
arXiv:2603.23638v1 Announce Type: new Abstract: Large language models (LLMs) have enabled agentic systems that can reason, plan, and act across complex tasks, b
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
GTO Wizard Benchmark
arXiv:2603.23660v1 Announce Type: new Abstract: We introduce GTO Wizard Benchmark, a public API and standardized evaluation framework for benchmarking algorithm
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 5d ago
Grounding Vision and Language to 3D Masks for Long-Horizon Box Rearrangement
arXiv:2603.23676v1 Announce Type: new Abstract: We study long-horizon planning in 3D environments from under-specified natural-language goals using only visual
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
LLMs Do Not Grade Essays Like Humans
arXiv:2603.23714v1 Announce Type: new Abstract: Large language models have recently been proposed as tools for automated essay scoring, but their agreement with