2,044 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 2,044 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (5123) ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIHackernoon
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
From Untamed Black Box to Interpretable Pedagogical Orchestration: The Ensemble of Specialized LLMs Architecture for Adaptive Tutoring
arXiv:2603.23990v1 Announce Type: cross Abstract: Monolithic Large Language Models (LLMs) used in educational dialogue often behave as "black boxes," where peda
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Understanding the Challenges in Iterative Generative Optimization with LLMs
arXiv:2603.23994v1 Announce Type: cross Abstract: Generative optimization uses large language models (LLMs) to iteratively improve artifacts (such as code, work
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Schema on the Inside: A Two-Phase Fine-Tuning Method for High-Efficiency Text-to-SQL at Scale
arXiv:2603.24023v1 Announce Type: cross Abstract: Applying large, proprietary API-based language models to text-to-SQL tasks poses a significant industry challe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs
arXiv:2603.24034v1 Announce Type: cross Abstract: Contextual automatic speech recognition (ASR) with Speech-LLMs is typically trained with oracle conversation h
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Mitigating Object Hallucinations in LVLMs via Attention Imbalance Rectification
arXiv:2603.24058v1 Announce Type: cross Abstract: Object hallucination in Large Vision-Language Models (LVLMs) severely compromises their reliability in real-wo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm
arXiv:2603.24079v1 Announce Type: cross Abstract: Recently, multimodal large language models (MLLMs) have emerged as a unified paradigm for language and image g
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Knowledge-Guided Manipulation Using Multi-Task Reinforcement Learning
arXiv:2603.24083v1 Announce Type: cross Abstract: This paper introduces Knowledge Graph based Massively Multi-task Model-based Policy Optimization (KG-M3PO), a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization
arXiv:2603.24093v1 Announce Type: cross Abstract: Recently, reinforcement learning~(RL) has become an important approach for improving the capabilities of large
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago
KCLNet: Electrically Equivalence-Oriented Graph Representation Learning for Analog Circuits
arXiv:2603.24101v1 Announce Type: cross Abstract: Digital circuits representation learning has made remarkable progress in the electronic design automation doma
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago
Comparative analysis of dual-form networks for live land monitoring using multi-modal satellite image time series
arXiv:2603.24109v1 Announce Type: cross Abstract: Multi-modal Satellite Image Time Series (SITS) analysis faces significant computational challenges for live la
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation
arXiv:2603.24124v1 Announce Type: cross Abstract: RLHF-aligned language models exhibit response homogenization: on TruthfulQA (n=790), 40-79% of questions produ
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare
arXiv:2603.24132v1 Announce Type: cross Abstract: Conversational artificial intelligence has the potential to assist users in preliminary medical consultations,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula
arXiv:2603.24202v1 Announce Type: cross Abstract: Reinforcement learning (RL) has emerged as a powerful paradigm for improving large language models beyond supe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search
arXiv:2603.24203v1 Announce Type: cross Abstract: Recent advances in the Model Context Protocol (MCP) have enabled large language models (LLMs) to invoke extern
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Powerful Teachers Matter: Text-Guided Multi-view Knowledge Distillation with Visual Prior Enhancement
arXiv:2603.24208v1 Announce Type: cross Abstract: Knowledge distillation transfers knowledge from large teacher models to smaller students for efficient inferen
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Uncovering Memorization in Timeseries Imputation models: LBRM Membership Inference and its link to attribute Leakage
arXiv:2603.24213v1 Announce Type: cross Abstract: Deep learning models for time series imputation are now essential in fields such as healthcare, the Internet o
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago
Where Do Your Citations Come From? Citation-Constellation: A Free, Open-Source, No-Code, and Auditable Tool for Citation Network Decomposition with Complementary BARON and HEROCON Scores
arXiv:2603.24216v1 Announce Type: cross Abstract: Standard citation metrics treat all citations as equal, obscuring the social and structural pathways through w
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias
arXiv:2603.24218v1 Announce Type: cross Abstract: Large Language Models (LLMs) enhanced with Retrieval-Augmented Generation (RAG) have achieved substantial impr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing
arXiv:2603.24221v1 Announce Type: cross Abstract: The increasing complexity and interconnectivity of digital infrastructures make scalable and reliable security
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
DVM: Real-Time Kernel Generation for Dynamic AI Models
arXiv:2603.24239v1 Announce Type: cross Abstract: Dynamism is common in AI computation, e.g., the dynamic tensor shapes and the dynamic control flows in models.
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago
Embracing Heteroscedasticity for Probabilistic Time Series Forecasting
arXiv:2603.24254v1 Announce Type: cross Abstract: Probabilistic time series forecasting (PTSF) aims to model the full predictive distribution of future observat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep
arXiv:2603.24260v1 Announce Type: cross Abstract: Diffusion-based video editing has emerged as an important paradigm for high-quality and flexible content gener
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago
Bridging Biological Hearing and Neuromorphic Computing: End-to-End Time-Domain Audio Signal Processing with Reservoir Computing
arXiv:2603.24283v1 Announce Type: cross Abstract: Despite the advancements in cutting-edge technologies, audio signal processing continues to pose challenges an
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
The Specification Gap: Coordination Failure Under Partial Knowledge in Code Agents
arXiv:2603.24284v1 Announce Type: cross Abstract: When multiple LLM-based code agents independently implement parts of the same class, they must agree on shared