AI News — Latest Developments & Breakthroughs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

From Untamed Black Box to Interpretable Pedagogical Orchestration: The Ensemble of Specialized LLMs Architecture for Adaptive Tutoring

arXiv:2603.23990v1 Announce Type: cross Abstract: Monolithic Large Language Models (LLMs) used in educational dialogue often behave as "black boxes," where peda

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Understanding the Challenges in Iterative Generative Optimization with LLMs

arXiv:2603.23994v1 Announce Type: cross Abstract: Generative optimization uses large language models (LLMs) to iteratively improve artifacts (such as code, work

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Schema on the Inside: A Two-Phase Fine-Tuning Method for High-Efficiency Text-to-SQL at Scale

arXiv:2603.24023v1 Announce Type: cross Abstract: Applying large, proprietary API-based language models to text-to-SQL tasks poses a significant industry challe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs

arXiv:2603.24034v1 Announce Type: cross Abstract: Contextual automatic speech recognition (ASR) with Speech-LLMs is typically trained with oracle conversation h

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Mitigating Object Hallucinations in LVLMs via Attention Imbalance Rectification

arXiv:2603.24058v1 Announce Type: cross Abstract: Object hallucination in Large Vision-Language Models (LVLMs) severely compromises their reliability in real-wo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm

arXiv:2603.24079v1 Announce Type: cross Abstract: Recently, multimodal large language models (MLLMs) have emerged as a unified paradigm for language and image g

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Knowledge-Guided Manipulation Using Multi-Task Reinforcement Learning

arXiv:2603.24083v1 Announce Type: cross Abstract: This paper introduces Knowledge Graph based Massively Multi-task Model-based Policy Optimization (KG-M3PO), a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization

arXiv:2603.24093v1 Announce Type: cross Abstract: Recently, reinforcement learning~(RL) has become an important approach for improving the capabilities of large

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

KCLNet: Electrically Equivalence-Oriented Graph Representation Learning for Analog Circuits

arXiv:2603.24101v1 Announce Type: cross Abstract: Digital circuits representation learning has made remarkable progress in the electronic design automation doma

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Comparative analysis of dual-form networks for live land monitoring using multi-modal satellite image time series

arXiv:2603.24109v1 Announce Type: cross Abstract: Multi-modal Satellite Image Time Series (SITS) analysis faces significant computational challenges for live la

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation

arXiv:2603.24124v1 Announce Type: cross Abstract: RLHF-aligned language models exhibit response homogenization: on TruthfulQA (n=790), 40-79% of questions produ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare

arXiv:2603.24132v1 Announce Type: cross Abstract: Conversational artificial intelligence has the potential to assist users in preliminary medical consultations,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula

arXiv:2603.24202v1 Announce Type: cross Abstract: Reinforcement learning (RL) has emerged as a powerful paradigm for improving large language models beyond supe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search

arXiv:2603.24203v1 Announce Type: cross Abstract: Recent advances in the Model Context Protocol (MCP) have enabled large language models (LLMs) to invoke extern

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Powerful Teachers Matter: Text-Guided Multi-view Knowledge Distillation with Visual Prior Enhancement

arXiv:2603.24208v1 Announce Type: cross Abstract: Knowledge distillation transfers knowledge from large teacher models to smaller students for efficient inferen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Uncovering Memorization in Timeseries Imputation models: LBRM Membership Inference and its link to attribute Leakage

arXiv:2603.24213v1 Announce Type: cross Abstract: Deep learning models for time series imputation are now essential in fields such as healthcare, the Internet o

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Where Do Your Citations Come From? Citation-Constellation: A Free, Open-Source, No-Code, and Auditable Tool for Citation Network Decomposition with Complementary BARON and HEROCON Scores

arXiv:2603.24216v1 Announce Type: cross Abstract: Standard citation metrics treat all citations as equal, obscuring the social and structural pathways through w

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias

arXiv:2603.24218v1 Announce Type: cross Abstract: Large Language Models (LLMs) enhanced with Retrieval-Augmented Generation (RAG) have achieved substantial impr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing

arXiv:2603.24221v1 Announce Type: cross Abstract: The increasing complexity and interconnectivity of digital infrastructures make scalable and reliable security

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

DVM: Real-Time Kernel Generation for Dynamic AI Models

arXiv:2603.24239v1 Announce Type: cross Abstract: Dynamism is common in AI computation, e.g., the dynamic tensor shapes and the dynamic control flows in models.

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Embracing Heteroscedasticity for Probabilistic Time Series Forecasting

arXiv:2603.24254v1 Announce Type: cross Abstract: Probabilistic time series forecasting (PTSF) aims to model the full predictive distribution of future observat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep

arXiv:2603.24260v1 Announce Type: cross Abstract: Diffusion-based video editing has emerged as an important paradigm for high-quality and flexible content gener

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Bridging Biological Hearing and Neuromorphic Computing: End-to-End Time-Domain Audio Signal Processing with Reservoir Computing

arXiv:2603.24283v1 Announce Type: cross Abstract: Despite the advancements in cutting-edge technologies, audio signal processing continues to pose challenges an

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

The Specification Gap: Coordination Failure Under Partial Knowledge in Code Agents

arXiv:2603.24284v1 Announce Type: cross Abstract: When multiple LLM-based code agents independently implement parts of the same class, they must agree on shared

📰 ArXiv cs.AI