1,258 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 1,258 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (4959) ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIWeaviate Blog
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Mechanistically Interpreting Compression in Vision-Language Models
arXiv:2603.25035v1 Announce Type: new Abstract: Compressed vision-language models (VLMs) are widely used to reduce memory and compute costs, making them a suita
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting
arXiv:2603.25046v1 Announce Type: new Abstract: Precipitation forecasting remains a persistent challenge in tropical regions like Vietnam, where complex topogra
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Sparse Visual Thought Circuits in Vision-Language Models
arXiv:2603.25075v1 Announce Type: new Abstract: Sparse autoencoders (SAEs) improve interpretability in multimodal models, but it remains unclear whether SAE fea
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents
arXiv:2603.25097v1 Announce Type: new Abstract: Large Language Model based agents increasingly operate in high stakes, multi turn settings where factual groundi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
When Sensing Varies with Contexts: Context-as-Transform for Tactile Few-Shot Class-Incremental Learning
arXiv:2603.25115v1 Announce Type: new Abstract: Few-Shot Class-Incremental Learning (FSCIL) can be particularly susceptible to acquisition contexts with only a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following
arXiv:2603.25133v1 Announce Type: new Abstract: Rubric-based evaluation has become a prevailing paradigm for evaluating instruction following in large language
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
UniAI-GraphRAG: Synergizing Ontology-Guided Extraction, Multi-Dimensional Clustering, and Dual-Channel Fusion for Robust Multi-Hop Reasoning
arXiv:2603.25152v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) systems face significant challenges in complex reasoning, multi-hop queries
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
arXiv:2603.25158v1 Announce Type: new Abstract: Equipping Large Language Model (LLM) agents with domain-specific skills is critical for tackling complex tasks.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
The Competence Shadow: Theory and Bounds of AI Assistance in Safety Engineering
arXiv:2603.25197v1 Announce Type: new Abstract: As AI assistants become integrated into safety engineering workflows for Physical AI systems, a critical questio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation
arXiv:2603.25266v1 Announce Type: new Abstract: Probabilistic abstract interpretation is a theory used to extract particular properties of a computer program wh
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
Distribution and Clusters Approximations as Abstract Domains in Probabilistic Abstract Interpretation to Neural Network Analysis
arXiv:2603.25273v1 Announce Type: new Abstract: The probabilistic abstract interpretation framework of neural network analysis analyzes a neural network by anal
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion
arXiv:2603.25283v1 Announce Type: new Abstract: Gait is increasingly recognized as a vital sign, yet current approaches treat it as a symptom of specific pathol
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
SliderQuant: Accurate Post-Training Quantization for LLMs
arXiv:2603.25284v1 Announce Type: new Abstract: In this paper, we address post-training quantization (PTQ) for large language models (LLMs) from an overlooked p
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
DAGverse: Building Document-Grounded Semantic DAGs from Scientific Papers
arXiv:2603.25293v1 Announce Type: new Abstract: Directed Acyclic Graphs (DAGs) are widely used to represent structured knowledge in scientific and technical dom
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Evaluating Language Models for Harmful Manipulation
arXiv:2603.25326v1 Announce Type: new Abstract: Interest in the concept of AI-driven harmful manipulation is growing, yet current approaches to evaluating it ar
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automated and Human-Driven Vehicles
arXiv:2603.25328v1 Announce Type: new Abstract: Automated Vehicle (AV) control in mixed traffic, where AVs coexist with human-driven vehicles, poses significant
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Agentic Trust Coordination for Federated Learning through Adaptive Thresholding and Autonomous Decision Making in Sustainable and Resilient Industrial Networks
arXiv:2603.25334v1 Announce Type: new Abstract: Distributed intelligence in industrial networks increasingly integrates sensing, communication, and computation
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
4OPS: Structural Difficulty Modeling in Integer Arithmetic Puzzles
arXiv:2603.25356v1 Announce Type: new Abstract: Arithmetic puzzle games provide a controlled setting for studying difficulty in mathematical reasoning tasks, a
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 3d ago
Does Structured Intent Representation Generalize? A Cross-Language, Cross-Model Empirical Study of 5W3H Prompting
arXiv:2603.25379v1 Announce Type: new Abstract: Does structured intent representation generalize across languages and models? We study PPS (Prompt Protocol Spec
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models
arXiv:2603.25412v1 Announce Type: new Abstract: Large language models (LLMs) increasingly rely on explicit chain-of-thought (CoT) reasoning to solve complex tas
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation
arXiv:2603.25415v1 Announce Type: new Abstract: Semantic world models enable embodied agents to reason about objects, relations, and spatial context beyond pure
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Cross-Model Disagreement as a Label-Free Correctness Signal
arXiv:2603.25450v1 Announce Type: new Abstract: Detecting when a language model is wrong without ground truth labels is a fundamental challenge for safe deploym
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
Retraining as Approximate Bayesian Inference
arXiv:2603.25480v1 Announce Type: new Abstract: Model retraining is usually treated as an ongoing maintenance task. But as Harrison Katz now argues, retraining
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago
EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents
arXiv:2603.25498v1 Announce Type: new Abstract: As the Web transitions from static retrieval to generative interaction, the escalating environmental footprint o