📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 1,754 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
LLMORPH: Automated Metamorphic Testing of Large Language Models
arXiv:2603.23611v1 Announce Type: cross Abstract: Automated testing is essential for evaluating and improving the reliability of Large Language Models (LLMs), y
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
LLMLOOP: Improving LLM-Generated Code and Tests through Automated Iterative Feedback Loops
arXiv:2603.23613v1 Announce Type: cross Abstract: Large Language Models (LLMs) are showing remarkable performance in generating source code, yet the generated c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
A Theory of LLM Information Susceptibility
arXiv:2603.23626v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed as optimization modules in agentic systems, yet the fun
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
5d ago
Ukrainian Visual Word Sense Disambiguation Benchmark
arXiv:2603.23627v1 Announce Type: cross Abstract: This study presents a benchmark for evaluating the Visual Word Sense Disambiguation (Visual-WSD) task in Ukrai
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
Swiss-Bench SBP-002: A Frontier Model Comparison on Swiss Legal and Regulatory Tasks
arXiv:2603.23646v1 Announce Type: cross Abstract: While recent work has benchmarked large language models on Swiss legal translation (Niklaus et al., 2025) and
ArXiv cs.AI
📄 Paper
5d ago
{\lambda}Split: Self-Supervised Content-Aware Spectral Unmixing for Fluorescence Microscopy
arXiv:2603.23647v1 Announce Type: cross Abstract: In fluorescence microscopy, spectral unmixing aims to recover individual fluorophore concentrations from spect
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
Probing Ethical Framework Representations in Large Language Models: Structure, Entanglement, and Methodological Challenges
arXiv:2603.23659v1 Announce Type: cross Abstract: When large language models make ethical judgments, do their internal representations distinguish between norma
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
5d ago
Echoes: A semantically-aligned music deepfake detection dataset
arXiv:2603.23667v1 Announce Type: cross Abstract: We introduce Echoes, a new dataset for music deepfake detection designed for training and benchmarking detecto
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
5d ago
Estimating Individual Tree Height and Species from UAV Imagery
arXiv:2603.23669v1 Announce Type: cross Abstract: Accurate estimation of forest biomass, a major carbon sink, relies heavily on tree-level traits such as height
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
Prototype Fusion: A Training-Free Multi-Layer Approach to OOD Detection
arXiv:2603.23677v1 Announce Type: cross Abstract: Deep learning models are increasingly deployed in safety-critical applications, where reliable out-of-distribu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
PLACID: Privacy-preserving Large language models for Acronym Clinical Inference and Disambiguation
arXiv:2603.23678v1 Announce Type: cross Abstract: Large Language Models (LLMs) offer transformative solutions across many domains, but healthcare integration is
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
5d ago
Learning What Can Be Picked: Active Reachability Estimation for Efficient Robotic Fruit Harvesting
arXiv:2603.23679v1 Announce Type: cross Abstract: Agriculture remains a cornerstone of global health and economic sustainability, yet labor-intensive tasks such
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
Assessment Design in the AI Era: A Method for Identifying Items Functioning Differentially for Humans and Chatbots
arXiv:2603.23682v1 Announce Type: cross Abstract: The rapid adoption of large language models (LLMs) in education raises profound challenges for assessment desi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
The Diminishing Returns of Early-Exit Decoding in Modern LLMs
arXiv:2603.23701v1 Announce Type: cross Abstract: In Large Language Model (LLM) inference, early-exit refers to stopping computation at an intermediate layer on
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
5d ago
An In-Depth Study of Filter-Agnostic Vector Search on a PostgreSQL Database System: [Experiments and Analysis]
arXiv:2603.23710v1 Announce Type: cross Abstract: Filtered Vector Search (FVS) is critical for supporting semantic search and GenAI applications in modern datab
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
5d ago
CDMT-EHR: A Continuous-Time Diffusion Framework for Generating Mixed-Type Time-Series Electronic Health Records
arXiv:2603.23719v1 Announce Type: cross Abstract: Electronic health records (EHRs) are invaluable for clinical research, yet privacy concerns severely restrict
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
Self Paced Gaussian Contextual Reinforcement Learning
arXiv:2603.23755v1 Announce Type: cross Abstract: Curriculum learning improves reinforcement learning (RL) efficiency by sequencing tasks from simple to complex
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
5d ago
AI-driven Intent-Based Networking Approach for Self-configuration of Next Generation Networks
arXiv:2603.23772v1 Announce Type: cross Abstract: Intent-Based Networking (IBN) aims to simplify operating heterogeneous infrastructures by translating high-lev
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
Human-in-the-Loop Pareto Optimization: Trade-off Characterization for Assist-as-Needed Training and Performance Evaluation
arXiv:2603.23777v1 Announce Type: cross Abstract: During human motor skill training and physical rehabilitation, there is an inherent trade-off between task dif
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models
arXiv:2603.23783v2 Announce Type: cross Abstract: Adapting large-scale foundation models to new domains with limited supervision remains a fundamental challenge
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
The Cognitive Firewall:Securing Browser Based AI Agents Against Indirect Prompt Injection Via Hybrid Edge Cloud Defense
arXiv:2603.23791v1 Announce Type: cross Abstract: Deploying large language models (LLMs) as autonomous browser agents exposes a significant attack surface in th
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
Object Search in Partially-Known Environments via LLM-informed Model-based Planning and Prompt Selection
arXiv:2603.23800v1 Announce Type: cross Abstract: We present a novel LLM-informed model-based planning framework, and a novel prompt selection method, for objec
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
Deep Neural Regression Collapse
arXiv:2603.23805v1 Announce Type: cross Abstract: Neural Collapse is a phenomenon that helps identify sparse and low rank structures in deep classifiers. Recent
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
5d ago
Willful Disobedience: Automatically Detecting Failures in Agentic Traces
arXiv:2603.23806v1 Announce Type: cross Abstract: AI agents are increasingly embedded in real software systems, where they execute multi-step workflows through
DeepCamp AI