8,253 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 8,253 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (21843) ArXiv cs.AIDev.to AIMedium · AIMedium · ProgrammingForbes InnovationMedium · Machine Learning
ArXiv cs.AI 🧠 Large Language Models 📄 Paper 1mo ago
Multi-Agent LLMs for Adaptive Acquisition in Bayesian Optimization
arXiv:2603.28959v1 Announce Type: cross Abstract: The exploration-exploitation trade-off is central to sequential decision-making and black-box optimization, ye
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago
AutoWorld: Scaling Multi-Agent Traffic Simulation with Self-Supervised World Models
arXiv:2603.28963v1 Announce Type: cross Abstract: Multi-agent traffic simulation is central to developing and testing autonomous driving systems. Recent data-dr
ArXiv cs.AI 📄 Paper 1mo ago
The Spectral Edge Thesis: A Mathematical Framework for Intra-Signal Phase Transitions in Neural Network Training
arXiv:2603.28964v1 Announce Type: cross Abstract: We develop the spectral edge thesis: phase transitions in neural network training -- grokking, capability gain
ArXiv cs.AI 🧠 Large Language Models 📄 Paper 1mo ago
Privacy Guard & Token Parsimony by Prompt and Context Handling and LLM Routing
arXiv:2603.28972v1 Announce Type: cross Abstract: The large-scale adoption of Large Language Models (LLMs) forces a trade-off between operational cost (OpEx) an
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper 1mo ago
Design Principles for the Construction of a Benchmark Evaluating Security Operation Capabilities of Multi-agent AI Systems
arXiv:2603.28998v1 Announce Type: cross Abstract: As Large Language Models (LLMs) and multi-agent AI systems are demonstrating increasing potential in cybersecu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper 1mo ago
Understand and Accelerate Memory Processing Pipeline for Disaggregated LLM Inference
arXiv:2603.29002v1 Announce Type: cross Abstract: Modern large language models (LLMs) increasingly depends on efficient long-context processing and generation m
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Improving Efficiency of GPU Kernel Optimization Agents using a Domain-Specific Language and Speed-of-Light Guidance
arXiv:2603.29010v1 Announce Type: cross Abstract: Optimizing GPU kernels with LLM agents is an iterative process over a large design space. Every candidate must
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Human-Like Lifelong Memory: A Neuroscience-Grounded Architecture for Infinite Interaction
arXiv:2603.29023v1 Announce Type: cross Abstract: Large language models lack persistent, structured memory for long-term interaction and context-sensitive retri
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning
arXiv:2603.29025v1 Announce Type: cross Abstract: Large language models systematically fail when a salient surface cue conflicts with an unstated feasibility co
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation
arXiv:2603.29029v1 Announce Type: cross Abstract: Recent multimodal face generation models address the spatial control limitations of text-to-image diffusion mo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Trojan-Speak: Bypassing Constitutional Classifiers with No Jailbreak Tax via Adversarial Finetuning
arXiv:2603.29038v1 Announce Type: cross Abstract: Fine-tuning APIs offered by major AI providers create new attack surfaces where adversaries can bypass safety
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
A Latent Risk-Aware Machine Learning Approach for Predicting Operational Success in Clinical Trials based on TrialsBank
arXiv:2603.29041v1 Announce Type: cross Abstract: Clinical trials are characterized by high costs, extended timelines, and substantial operational risk, yet rel
ArXiv cs.AI 🛡️ AI Safety & Ethics 📄 Paper ⚡ AI Lesson 1mo ago
CivicShield: A Cross-Domain Defense-in-Depth Framework for Securing Government-Facing AI Chatbots Against Multi-Turn Adversarial Attacks
arXiv:2603.29062v1 Announce Type: cross Abstract: LLM-based chatbots in government services face critical security gaps. Multi-turn adversarial attacks achieve
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
On the Mirage of Long-Range Dependency, with an Application to Integer Multiplication
arXiv:2603.29069v1 Announce Type: cross Abstract: Integer multiplication has long been considered a hard problem for neural networks, with the difficulty widely
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
WybeCoder: Verified Imperative Code Generation
arXiv:2603.29088v1 Announce Type: cross Abstract: Recent progress in large language models (LLMs) has advanced automatic code generation and formal theorem prov
ArXiv cs.AI 📄 Paper 1mo ago
WorldFlow3D: Flowing Through 3D Distributions for Unbounded World Generation
arXiv:2603.29089v1 Announce Type: cross Abstract: Unbounded 3D world generation is emerging as a foundational task for scene modeling in computer vision, graphi
ArXiv cs.AI 📄 Paper 1mo ago
APEX-EM: Non-Parametric Online Learning for Autonomous Agents via Structured Procedural-Episodic Experience Replay
arXiv:2603.29093v1 Announce Type: cross Abstract: LLM-based autonomous agents lack persistent procedural memory: they re-derive solutions from scratch even when
ArXiv cs.AI 📄 Paper 1mo ago
Evaluating a Data-Driven Redesign Process for Intelligent Tutoring Systems
arXiv:2603.29094v1 Announce Type: cross Abstract: Past research has defined a general process for the data-driven redesign of educational technologies and has s
ArXiv cs.AI 📄 Paper 1mo ago
SemLoc: Structured Grounding of Free-Form LLM Reasoning for Fault Localization
arXiv:2603.29109v1 Announce Type: cross Abstract: Fault localization identifies program locations responsible for observed failures. Existing techniques rank su
ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper 1mo ago
Towards Explainable Stakeholder-Aware Requirements Prioritisation in Aged-Care Digital Health
arXiv:2603.29114v1 Announce Type: cross Abstract: Requirements engineering for aged-care digital health must account for human aspects, because requirement prio