📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 2,281 articles · Updated every 3 hours · View all news
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1w ago
CoverageBench: Evaluating Information Coverage across Tasks and Domains
arXiv:2603.20034v1 Announce Type: cross Abstract: We wish to measure the information coverage of an ad hoc retrieval algorithm, that is, how much of the range o
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families
arXiv:2603.20042v1 Announce Type: cross Abstract: Large language models (LLMs) have driven substantial advances in speech language models (SpeechLMs), yielding
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries
arXiv:2603.20062v1 Announce Type: cross Abstract: When a traveler asks an AI search engine to recommend a hotel, which sources get cited -- and does query frami
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Fine-tuning Timeseries Predictors Using Reinforcement Learning
arXiv:2603.20063v1 Announce Type: cross Abstract: This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Agentic Harness for Real-World Compilers
arXiv:2603.20075v1 Announce Type: cross Abstract: Compilers are critical to modern computing, yet fixing compiler bugs is difficult. While recent large language
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace Domain
arXiv:2603.20094v1 Announce Type: cross Abstract: Large manufacturing companies face challenges in information retrieval due to data silos maintained by differe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models
arXiv:2603.20100v1 Announce Type: cross Abstract: Direct Preference Optimization (DPO) is widely used after supervised fine-tuning (SFT) to align language model
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Spectral Alignment in Forward-Backward Representations via Temporal Abstraction
arXiv:2603.20103v1 Announce Type: cross Abstract: Forward-backward (FB) representations provide a powerful framework for learning the successor representation (
ArXiv cs.AI
📄 Paper
1w ago
The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $\lambda$-Calculus
arXiv:2603.20105v1 Announce Type: cross Abstract: LLMs are increasingly used as general-purpose reasoners, but long inputs remain bottlenecked by a fixed contex
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Var-JEPA: A Variational Formulation of the Joint-Embedding Predictive Architecture -- Bridging Predictive and Generative Self-Supervised Learning
arXiv:2603.20111v1 Announce Type: cross Abstract: The Joint-Embedding Predictive Architecture (JEPA) is often seen as a non-generative alternative to likelihood
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech
arXiv:2603.20112v1 Announce Type: cross Abstract: Personalizing Automatic Speech Recognition (ASR) for non-normative speech remains challenging because data col
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning
arXiv:2603.20116v1 Announce Type: cross Abstract: Conventional fine-tuning on domain-specific datasets can inadvertently alter a model's pretrained multimodal p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models
arXiv:2603.20122v1 Announce Type: cross Abstract: Large Language Models (LLMs) have been widely deployed, especially through free Web-based applications that ex
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1w ago
An Agentic Multi-Agent Architecture for Cybersecurity Risk Management
arXiv:2603.20131v1 Announce Type: cross Abstract: Getting a real cybersecurity risk assessment for a small organization is expensive -- a NIST CSF-aligned engag
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1w ago
Enhancing Hyperspace Analogue to Language (HAL) Representations via Attention-Based Pooling for Text Classification
arXiv:2603.20149v1 Announce Type: cross Abstract: The Hyperspace Analogue to Language (HAL) model relies on global word co-occurrence matrices to construct dist
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1w ago
Design-OS: A Specification-Driven Framework for Engineering System Design with a Control-Systems Design Case
arXiv:2603.20151v1 Announce Type: cross Abstract: Engineering system design -- whether mechatronic, control, or embedded -- often proceeds in an ad hoc manner,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models
arXiv:2603.20161v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across diverse tasks. However, the trut
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning
arXiv:2603.20164v1 Announce Type: cross Abstract: Conventional robot social behavior generation has been limited in flexibility and autonomy, relying on predefi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation
arXiv:2603.20172v1 Announce Type: cross Abstract: Recent work on chain-of-thought (CoT) faithfulness reports single aggregate numbers (e.g., DeepSeek-R1 acknowl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
AI Agents Can Already Autonomously Perform Experimental High Energy Physics
arXiv:2603.20179v1 Announce Type: cross Abstract: Large language model-based AI agents are now able to autonomously execute substantial portions of a high energ
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Adaptive Greedy Frame Selection for Long Video Understanding
arXiv:2603.20180v1 Announce Type: cross Abstract: Large vision--language models (VLMs) are increasingly applied to long-video question answering, yet inference
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Improving Generalization on Cybersecurity Tasks with Multi-Modal Contrastive Learning
arXiv:2603.20181v1 Announce Type: cross Abstract: The use of ML in cybersecurity has long been impaired by generalization issues: Models that work well in contr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking
arXiv:2603.20185v1 Announce Type: cross Abstract: Video agentic models have advanced challenging video-language tasks. However, most agentic approaches still he
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1w ago
LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation
arXiv:2603.20192v1 Announce Type: cross Abstract: Recent advances in diffusion models have significantly improved text-to-video generation, enabling personalize
DeepCamp AI