✕ Clear all filters
16,243 articles

Research Papers

16,243 articles · Updated every 3 hours · View all reads

All Articles 76,447Blog Posts 102,395Tech Tutorials 18,617Research Papers 16,243News 13,218 ⚡ AI Lessons
ArXiv cs.AI 📄 Paper 6h ago
CrowdMath: A Dataset of Crowdsourced Mathematical Research Discussions
arXiv:2606.06526v1 Announce Type: new Abstract: Large language models have made substantial progress on mathematical reasoning, but existing benchmarks typicall
ArXiv cs.AI 📄 Paper 6h ago
Attack Selection in Agentic AI Control Evaluations Meaningfully Decreases Safety
arXiv:2606.06529v1 Announce Type: new Abstract: An attacker that strategically chooses when to attack is much harder to catch than one that attacks indiscrimina
ArXiv cs.AI 📄 Paper 6h ago
CARVE-Q: Quantum-Proposed, Classically Certified Interactive Driving Repair
arXiv:2606.06531v1 Announce Type: new Abstract: The critical question after a correct driving veto is not only whether a maneuver is unsafe, but whether the blo
ArXiv cs.AI 📄 Paper 6h ago
Position: Don't Just "Fix it in Post": A Science of AI Must Study Training Dynamics
arXiv:2606.06533v1 Announce Type: new Abstract: What would it mean to have a scientific understanding of AI? Models are not static objects: they are snapshots o
ArXiv cs.AI 📄 Paper 6h ago
Accelerated Fourier SAT (AFSAT): Fully Realising a GPU-based Symmetric Pseudo-Boolean SAT Solver
arXiv:2606.06641v1 Announce Type: new Abstract: We present Accelerated Fourier SAT (AFSAT), a GPU-accelerated solver for pseudo-Boolean satisfiability based on
ArXiv cs.AI 📄 Paper 6h ago
A Study of Parallel Continuous Local Search
arXiv:2606.06656v1 Announce Type: new Abstract: We study parallel Continuous Local Search (CLS) as a solution approach for Boolean satisfiability problems with
ArXiv cs.AI 📄 Paper 6h ago
AEGIS: A Backup Reflex for Physical AI
arXiv:2606.06660v1 Announce Type: new Abstract: Long-horizon robot manipulation tends to fail gradually: one bad step degrades the state, and the policy spirals
ArXiv cs.AI 📄 Paper 6h ago
A Geometric Account of Activation Steering through Angle-Norm Decomposition
arXiv:2606.06735v1 Announce Type: new Abstract: Linear activation steering has gained popularity as a simple and empirically effective way to control language m
ArXiv cs.AI 📄 Paper 6h ago
OpenSkill: Open-World Self-Evolution for LLM Agents
arXiv:2606.06741v1 Announce Type: new Abstract: Self-evolving agents requires adaptation after deployment, but existing approaches assume a usable learning loop
ArXiv cs.AI 📄 Paper 6h ago
AdMem: Advanced Memory for Task-solving Agents
arXiv:2606.06787v1 Announce Type: new Abstract: Large Language Models (LLMs) show promise as tool-using agents but remain limited in long-horizon tasks that req
ArXiv cs.AI 📄 Paper 6h ago
Evidence-Based Intelligent Diagnostic and Therapeutic Visualization System with Large Language Models: Multi-Turn Interaction and Multimodal Treatment Plan Generation
arXiv:2606.06869v1 Announce Type: new Abstract: Aim: Existing AI-assisted traditional Chinese medicine diagnostic tools suffer from opaque reasoning processes,
ArXiv cs.AI 📄 Paper 6h ago
Workflow-to-Skill: Skill Creation via Routing-Workflow-Semantics-Attachments Decomposition
arXiv:2606.06893v1 Announce Type: new Abstract: Large language model agents increasingly rely on Skills to encode procedural knowledge, yet high-quality Skills
ArXiv cs.AI 📄 Paper 6h ago
Declarative Skills for AI Agents in Knowledge-Grounded Tool-Use Workflows
arXiv:2606.06923v1 Announce Type: new Abstract: We study orchestration mechanisms for tool-using AI agents in realistic customer-service workflows over an unstr
ArXiv cs.AI 📄 Paper 6h ago
Quantum-Inspired Trace-Augmented Evidence Selection for Reasoning over Structured Hypothesis Spaces
arXiv:2606.06941v1 Announce Type: new Abstract: Large language models (LLMs) now solve a wide range of expert-level exams at or above human level, yet remain br
ArXiv cs.AI 📄 Paper 6h ago
Accounting for Context: Shaping Moral Credences for Value Alignment
arXiv:2606.06972v1 Announce Type: new Abstract: Ensuring that agent behaviours are aligned with human moral values inevitably raises the problem of how to accou
ArXiv cs.AI 📄 Paper 6h ago
Exploring Agentic Tool-Calling Decisions via Uncertainty-Aligned Reinforcement Learning
arXiv:2606.06976v1 Announce Type: new Abstract: Large language model (LLM)-based agents often make suboptimal tool-use decisions, including unsupported tool inv
ArXiv cs.AI 📄 Paper 6h ago
Teaching the Way, Not the Answer: Privileged Tutoring Distillation for Multimodal Policy Optimization
arXiv:2606.07000v1 Announce Type: new Abstract: Recent post-training methods, particularly Reinforcement Learning with Verifiable Rewards (RLVR), have significa
ArXiv cs.AI 📄 Paper 6h ago
The Sim-to-Real Gap of Foundation Model Agents: A Unified MDP Perspective
arXiv:2606.07017v1 Announce Type: new Abstract: Foundation model agents are increasingly deployed for real-world decision-making, but suffer from the sim-to-rea
ArXiv cs.AI 📄 Paper 6h ago
StainFlow: Entity-Stain Tracking and Evidence Linking for Process Rewards in GUI Agents
arXiv:2606.07027v1 Announce Type: new Abstract: Reinforcement Learning (RL) has become a promising approach for improving GUI Agents in long-horizon, stochastic
ArXiv cs.AI 📄 Paper 6h ago
Hierarchical Semantic-Constrained Heterogeneous Graph for Audio-Visual Event Localization
arXiv:2606.07033v1 Announce Type: new Abstract: Open-vocabulary audio-visual event localization (OV-AVEL) jointly models audio-visual cues to recognize and temp