📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 2,281 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions
arXiv:2510.05318v3 Announce Type: replace Abstract: Large language models (LLMs) have demonstrated remarkable performance on single-turn text-to-SQL tasks, but
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
BuilderBench: The Building Blocks of Intelligent Agents
arXiv:2510.06288v3 Announce Type: replace Abstract: Today's AI models learn primarily through mimicry and refining, so it is not surprising that they struggle t
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1w ago
Operational machine learning for remote spectroscopic detection of CH$_{4}$ point sources
arXiv:2511.07719v2 Announce Type: replace Abstract: Mitigating anthropogenic methane sources is one of the most cost-effective levers to slow down global warmin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Hybrid Stackelberg Game and Diffusion-based Auction for Two-tier Agentic AI Task Offloading in Internet of Agents
arXiv:2511.22076v2 Announce Type: replace Abstract: The Internet of Agents (IoA) is rapidly gaining prominence as a foundational architecture for interconnected
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
DriveSafe: A Hierarchical Risk Taxonomy for Safety-Critical LLM-Based Driving Assistants
arXiv:2601.12138v3 Announce Type: replace Abstract: Large Language Models (LLMs) are increasingly integrated into vehicle-based digital assistants, where unsafe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents
arXiv:2602.02050v3 Announce Type: replace Abstract: Tool-using agents based on Large Language Models (LLMs) excel in tasks such as mathematical reasoning and mu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search
arXiv:2602.22983v3 Announce Type: replace Abstract: As Large Language Models (LLMs) are increasingly used, their security risks have drawn increasing attention.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays
arXiv:2602.23276v2 Announce Type: replace Abstract: Chest X-ray plays a central role in thoracic diagnosis, and its interpretation inherently requires multi-ste
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Agentic AI-based Coverage Closure for Formal Verification
arXiv:2603.03147v2 Announce Type: replace Abstract: Coverage closure is a critical requirement in Integrated Chip (IC) development process and key metric for ve
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Retrieval-Augmented Generation with Covariate Time Series
arXiv:2603.04951v2 Announce Type: replace Abstract: While RAG has greatly enhanced LLMs, extending this paradigm to Time-Series Foundation Models (TSFMs) remain
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Planning as Goal Recognition: Deriving Heuristics from Intention Models -- Extended Version
arXiv:2603.14824v2 Announce Type: replace Abstract: Classical planning aims to find a sequence of actions, a plan, that maps a starting state into one of the go
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Cascade-Aware Multi-Agent Routing: Spatio-Temporal Sidecars and Geometry-Switching
arXiv:2603.17112v2 Announce Type: replace Abstract: Advanced AI reasoning systems route tasks through dynamic execution graphs of specialized agents. We identif
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models
arXiv:2603.20670v2 Announce Type: replace Abstract: The rapid growth in the volume, variety, and velocity of geospatial data has created data ecosystems that ar
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
A transformer architecture alteration to incentivise externalised reasoning
arXiv:2603.21376v2 Announce Type: replace Abstract: We propose a new architectural change, and post-training pipeline, for making LLMs more verbose reasoners by
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Cerebra: A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment
arXiv:2603.21597v2 Announce Type: replace Abstract: Modern clinical practice increasingly depends on reasoning over heterogeneous, evolving, and incomplete pati
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Mapping the Challenges of HCI: An Application and Evaluation of ChatGPT for Mining Insights at Scale
arXiv:2306.05036v5 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly used for analytical tasks, yet their effectiveness in re
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching
arXiv:2406.01825v5 Announce Type: replace-cross Abstract: Machine learning (ML) models are increasingly deployed for virtual screening in drug discovery, where
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Almost Sure Convergence of Linear Temporal Difference Learning with Arbitrary Features
arXiv:2409.12135v3 Announce Type: replace-cross Abstract: Temporal difference (TD) learning with linear function approximation (linear TD) is a classic and powe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Dataset Distillation-based Hybrid Federated Learning on Non-IID Data
arXiv:2409.17517v3 Announce Type: replace-cross Abstract: In federated learning, the heterogeneity of client data has a great impact on the performance of model
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
LOGSAFE: Logic-Guided Verification for Trustworthy Federated Time-Series Learning
arXiv:2411.03231v3 Announce Type: replace-cross Abstract: This paper introduces LOGSAFE, a defense mechanism for federated learning in time series settings, par
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1w ago
Mitigating Object Hallucinations in Large Vision-Language Models via Attention Calibration
arXiv:2502.01969v2 Announce Type: replace-cross Abstract: Large Vision-Language Models (LVLMs) exhibit impressive multimodal reasoning capabilities but remain h
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Streaming Attention Approximation via Discrepancy Theory
arXiv:2502.07861v3 Announce Type: replace-cross Abstract: Large language models (LLMs) have achieved impressive success, but their high memory requirements pres
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1w ago
Training-free Adjustable Polynomial Graph Filtering for Ultra-fast Multimodal Recommendation
arXiv:2503.04406v3 Announce Type: replace-cross Abstract: Multimodal recommender systems improve the performance of canonical recommender systems with no item f
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1w ago
Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems
arXiv:2503.04945v3 Announce Type: replace-cross Abstract: The proliferation of generative models has presented significant challenges in distinguishing authenti
DeepCamp AI