AI News — Latest Developments & Breakthroughs

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

CoverageBench: Evaluating Information Coverage across Tasks and Domains

arXiv:2603.20034v1 Announce Type: cross Abstract: We wish to measure the information coverage of an ad hoc retrieval algorithm, that is, how much of the range o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families

arXiv:2603.20042v1 Announce Type: cross Abstract: Large language models (LLMs) have driven substantial advances in speech language models (SpeechLMs), yielding

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries

arXiv:2603.20062v1 Announce Type: cross Abstract: When a traveler asks an AI search engine to recommend a hotel, which sources get cited -- and does query frami

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Fine-tuning Timeseries Predictors Using Reinforcement Learning

arXiv:2603.20063v1 Announce Type: cross Abstract: This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Agentic Harness for Real-World Compilers

arXiv:2603.20075v1 Announce Type: cross Abstract: Compilers are critical to modern computing, yet fixing compiler bugs is difficult. While recent large language

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace Domain

arXiv:2603.20094v1 Announce Type: cross Abstract: Large manufacturing companies face challenges in information retrieval due to data silos maintained by differe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models

arXiv:2603.20100v1 Announce Type: cross Abstract: Direct Preference Optimization (DPO) is widely used after supervised fine-tuning (SFT) to align language model

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Spectral Alignment in Forward-Backward Representations via Temporal Abstraction

arXiv:2603.20103v1 Announce Type: cross Abstract: Forward-backward (FB) representations provide a powerful framework for learning the successor representation (

ArXiv cs.AI 📄 Paper 1w ago

The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $\lambda$-Calculus

arXiv:2603.20105v1 Announce Type: cross Abstract: LLMs are increasingly used as general-purpose reasoners, but long inputs remain bottlenecked by a fixed contex

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Var-JEPA: A Variational Formulation of the Joint-Embedding Predictive Architecture -- Bridging Predictive and Generative Self-Supervised Learning

arXiv:2603.20111v1 Announce Type: cross Abstract: The Joint-Embedding Predictive Architecture (JEPA) is often seen as a non-generative alternative to likelihood

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech

arXiv:2603.20112v1 Announce Type: cross Abstract: Personalizing Automatic Speech Recognition (ASR) for non-normative speech remains challenging because data col

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning

arXiv:2603.20116v1 Announce Type: cross Abstract: Conventional fine-tuning on domain-specific datasets can inadvertently alter a model's pretrained multimodal p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models

arXiv:2603.20122v1 Announce Type: cross Abstract: Large Language Models (LLMs) have been widely deployed, especially through free Web-based applications that ex

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

An Agentic Multi-Agent Architecture for Cybersecurity Risk Management

arXiv:2603.20131v1 Announce Type: cross Abstract: Getting a real cybersecurity risk assessment for a small organization is expensive -- a NIST CSF-aligned engag

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

Enhancing Hyperspace Analogue to Language (HAL) Representations via Attention-Based Pooling for Text Classification

arXiv:2603.20149v1 Announce Type: cross Abstract: The Hyperspace Analogue to Language (HAL) model relies on global word co-occurrence matrices to construct dist

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

Design-OS: A Specification-Driven Framework for Engineering System Design with a Control-Systems Design Case

arXiv:2603.20151v1 Announce Type: cross Abstract: Engineering system design -- whether mechatronic, control, or embedded -- often proceeds in an ad hoc manner,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

arXiv:2603.20161v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across diverse tasks. However, the trut

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning

arXiv:2603.20164v1 Announce Type: cross Abstract: Conventional robot social behavior generation has been limited in flexibility and autonomy, relying on predefi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation

arXiv:2603.20172v1 Announce Type: cross Abstract: Recent work on chain-of-thought (CoT) faithfulness reports single aggregate numbers (e.g., DeepSeek-R1 acknowl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

AI Agents Can Already Autonomously Perform Experimental High Energy Physics

arXiv:2603.20179v1 Announce Type: cross Abstract: Large language model-based AI agents are now able to autonomously execute substantial portions of a high energ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Adaptive Greedy Frame Selection for Long Video Understanding

arXiv:2603.20180v1 Announce Type: cross Abstract: Large vision--language models (VLMs) are increasingly applied to long-video question answering, yet inference

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Improving Generalization on Cybersecurity Tasks with Multi-Modal Contrastive Learning

arXiv:2603.20181v1 Announce Type: cross Abstract: The use of ML in cybersecurity has long been impaired by generalization issues: Models that work well in contr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking

arXiv:2603.20185v1 Announce Type: cross Abstract: Video agentic models have advanced challenging video-language tasks. However, most agentic approaches still he

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation

arXiv:2603.20192v1 Announce Type: cross Abstract: Recent advances in diffusion models have significantly improved text-to-video generation, enabling personalize

📰 ArXiv cs.AI