1,213 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 1,213 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (4959) ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIWeaviate Blog
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
Does Explanation Correctness Matter? Linking Computational XAI Evaluation to Human Understanding
arXiv:2603.25251v1 Announce Type: cross Abstract: Explainable AI (XAI) methods are commonly evaluated with functional metrics such as correctness, which computa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation
arXiv:2603.25253v1 Announce Type: cross Abstract: Large language models (LLMs) hold considerable potential for advancing scientific discovery, yet systematic as
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
CRAFT: Grounded Multi-Agent Coordination Under Partial Information
arXiv:2603.25268v1 Announce Type: cross Abstract: We introduce CRAFT, a multi-agent benchmark for evaluating pragmatic communication in large language models un
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
CSI-tuples-based 3D Channel Fingerprints Construction Assisted by MultiModal Learning
arXiv:2603.25288v1 Announce Type: cross Abstract: Low-altitude communications can promote the integration of aerial and terrestrial wireless resources, expand n
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Revealing the influence of participant failures on model quality in cross-silo Federated Learning
arXiv:2603.25289v1 Announce Type: cross Abstract: Federated Learning (FL) is a paradigm for training machine learning (ML) models in collaborative settings whil
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study
arXiv:2603.25322v1 Announce Type: cross Abstract: Alzheimer's disease (AD) is a growing global health challenge as populations age, and timely, accurate diagnos
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models
arXiv:2603.25325v1 Announce Type: cross Abstract: Weight pruning is a standard technique for compressing large language models, yet its effect on learned intern
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
Adaptive Chunking: Optimizing Chunking-Method Selection for RAG
arXiv:2603.25333v1 Announce Type: cross Abstract: The effectiveness of Retrieval-Augmented Generation (RAG) is highly dependent on how documents are chunked, th
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 3d ago
Image Rotation Angle Estimation: Comparing Circular-Aware Methods
arXiv:2603.25351v1 Announce Type: cross Abstract: Automatic image rotation estimation is a key preprocessing step in many vision pipelines. This task is challen
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics
arXiv:2603.25366v1 Announce Type: cross Abstract: Autonomous object search is challenging for mobile robots operating in indoor environments due to partial obse
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
GlowQ: Group-Shared LOw-Rank Approximation for Quantized LLMs
arXiv:2603.25385v1 Announce Type: cross Abstract: Quantization techniques such as BitsAndBytes, AWQ, and GPTQ are widely used as a standard method in deploying
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
A Causal Framework for Evaluating ICU Discharge Strategies
arXiv:2603.25397v1 Announce Type: cross Abstract: In this applied paper, we address the difficult open problem of when to discharge patients from the Intensive
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Shape and Substance: Dual-Layer Side-Channel Attacks on Local Vision-Language Models
arXiv:2603.25403v1 Announce Type: cross Abstract: On-device Vision-Language Models (VLMs) promise data privacy via local execution. However, we show that the ar
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
System Design for Maintaining Internal State Consistency in Long-Horizon Robotic Tabletop Games
arXiv:2603.25405v1 Announce Type: cross Abstract: Long-horizon tabletop games pose a distinct systems challenge for robotics: small perceptual or execution erro
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Decidable By Construction: Design-Time Verification for Trustworthy AI
arXiv:2603.25414v1 Announce Type: cross Abstract: A prevailing assumption in machine learning is that model correctness must be enforced after the fact. We obse
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
From Manipulation to Mistrust: Explaining Diverse Micro-Video Misinformation for Robust Debunking in the Wild
arXiv:2603.25423v1 Announce Type: cross Abstract: The rise of micro-videos has reshaped how misinformation spreads, amplifying its speed, reach, and impact on p
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Temporally Decoupled Diffusion Planning for Autonomous Driving
arXiv:2603.25462v1 Announce Type: cross Abstract: Motion planning in dynamic urban environments requires balancing immediate safety with long-term goals. While
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Maximum Entropy Behavior Exploration for Sim2Real Zero-Shot Reinforcement Learning
arXiv:2603.25464v1 Announce Type: cross Abstract: Zero-shot reinforcement learning (RL) algorithms aim to learn a family of policies from a reward-free dataset,
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
Interpretable PM2.5 Forecasting for Urban Air Quality: A Comparative Study of Operational Time-Series Models
arXiv:2603.25495v1 Announce Type: cross Abstract: Accurate short-term air-quality forecasting is essential for public health protection and urban management, ye
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
Lightweight GenAI for Network Traffic Synthesis: Fidelity, Augmentation, and Classification
arXiv:2603.25507v1 Announce Type: cross Abstract: Accurate Network Traffic Classification (NTC) is increasingly constrained by limited labeled data and strict p
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 3d ago
Challenges in Hyperspectral Imaging for Autonomous Driving: The HSI-Drive Case
arXiv:2603.25510v1 Announce Type: cross Abstract: The use of hyperspectral imaging (HSI) in autonomous driving (AD), while promising, faces many challenges rela
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
NERO-Net: A Neuroevolutionary Approach for the Design of Adversarially Robust CNNs
arXiv:2603.25517v1 Announce Type: cross Abstract: Neuroevolution automates the complex task of neural network design but often ignores the inherent adversarial
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 3d ago
CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild
arXiv:2603.25524v1 Announce Type: cross Abstract: Long-term behavioral monitoring of individual animals is crucial for studying behavioral changes that occur ov
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes
arXiv:2603.25562v1 Announce Type: cross Abstract: On-policy distillation (OPD) is appealing for large language model (LLM) post-training because it evaluates te