Research Papers

16,243 articles · Updated every 3 hours · View all reads

All Articles 76,447 Blog Posts 102,395 Tech Tutorials 18,617 Research Papers 16,243 News 13,218 ⚡ AI Lessons

Detecting and Mitigating Bias by Treating Fairness as a Symmetry Operation

arXiv:2606.06514v1 Announce Type: new Abstract: Machine learning systems deployed in high stakes socioeconomic settings routinely display bias. We formalize bia

ArXiv cs.AI 📄 Paper 6h ago

DiBS: Diffusion-Informed Branch Selection

arXiv:2606.06518v1 Announce Type: new Abstract: Sudoku is a representative constraint satisfaction problem that requires global structural reasoning under stric

ArXiv cs.AI 📄 Paper 6h ago

SafeGene: Reusable Adapters for Transferable Safety Alignment

arXiv:2606.06519v1 Announce Type: new Abstract: Open-weight LLMs are increasingly fine-tuned into customized assistants, but downstream fine-tuning can weaken s

ArXiv cs.AI 📄 Paper 6h ago

Lean4Agent: Formal Modeling and Verification for Agent Workflow and Trajectory

arXiv:2606.06523v1 Announce Type: new Abstract: Equipping Large Language Models (LLMs) to execute reliable multi-step workflows has become a central challenge i

ArXiv cs.AI 📄 Paper 6h ago

CrowdMath: A Dataset of Crowdsourced Mathematical Research Discussions

arXiv:2606.06526v1 Announce Type: new Abstract: Large language models have made substantial progress on mathematical reasoning, but existing benchmarks typicall

ArXiv cs.AI 📄 Paper 6h ago

Attack Selection in Agentic AI Control Evaluations Meaningfully Decreases Safety

arXiv:2606.06529v1 Announce Type: new Abstract: An attacker that strategically chooses when to attack is much harder to catch than one that attacks indiscrimina

ArXiv cs.AI 📄 Paper 6h ago

CARVE-Q: Quantum-Proposed, Classically Certified Interactive Driving Repair

arXiv:2606.06531v1 Announce Type: new Abstract: The critical question after a correct driving veto is not only whether a maneuver is unsafe, but whether the blo

ArXiv cs.AI 📄 Paper 6h ago

Position: Don't Just "Fix it in Post": A Science of AI Must Study Training Dynamics

arXiv:2606.06533v1 Announce Type: new Abstract: What would it mean to have a scientific understanding of AI? Models are not static objects: they are snapshots o

ArXiv cs.AI 📄 Paper 6h ago

Accelerated Fourier SAT (AFSAT): Fully Realising a GPU-based Symmetric Pseudo-Boolean SAT Solver

arXiv:2606.06641v1 Announce Type: new Abstract: We present Accelerated Fourier SAT (AFSAT), a GPU-accelerated solver for pseudo-Boolean satisfiability based on

ArXiv cs.AI 📄 Paper 6h ago

A Study of Parallel Continuous Local Search

arXiv:2606.06656v1 Announce Type: new Abstract: We study parallel Continuous Local Search (CLS) as a solution approach for Boolean satisfiability problems with

ArXiv cs.AI 📄 Paper 6h ago

AEGIS: A Backup Reflex for Physical AI

arXiv:2606.06660v1 Announce Type: new Abstract: Long-horizon robot manipulation tends to fail gradually: one bad step degrades the state, and the policy spirals

ArXiv cs.AI 📄 Paper 6h ago

A Geometric Account of Activation Steering through Angle-Norm Decomposition

arXiv:2606.06735v1 Announce Type: new Abstract: Linear activation steering has gained popularity as a simple and empirically effective way to control language m

ArXiv cs.AI 📄 Paper 6h ago

OpenSkill: Open-World Self-Evolution for LLM Agents

arXiv:2606.06741v1 Announce Type: new Abstract: Self-evolving agents requires adaptation after deployment, but existing approaches assume a usable learning loop

ArXiv cs.AI 📄 Paper 6h ago

AdMem: Advanced Memory for Task-solving Agents

arXiv:2606.06787v1 Announce Type: new Abstract: Large Language Models (LLMs) show promise as tool-using agents but remain limited in long-horizon tasks that req

ArXiv cs.AI 📄 Paper 6h ago

Evidence-Based Intelligent Diagnostic and Therapeutic Visualization System with Large Language Models: Multi-Turn Interaction and Multimodal Treatment Plan Generation

arXiv:2606.06869v1 Announce Type: new Abstract: Aim: Existing AI-assisted traditional Chinese medicine diagnostic tools suffer from opaque reasoning processes,

ArXiv cs.AI 📄 Paper 6h ago

Workflow-to-Skill: Skill Creation via Routing-Workflow-Semantics-Attachments Decomposition

arXiv:2606.06893v1 Announce Type: new Abstract: Large language model agents increasingly rely on Skills to encode procedural knowledge, yet high-quality Skills

ArXiv cs.AI 📄 Paper 6h ago

Declarative Skills for AI Agents in Knowledge-Grounded Tool-Use Workflows

arXiv:2606.06923v1 Announce Type: new Abstract: We study orchestration mechanisms for tool-using AI agents in realistic customer-service workflows over an unstr

ArXiv cs.AI 📄 Paper 6h ago

Quantum-Inspired Trace-Augmented Evidence Selection for Reasoning over Structured Hypothesis Spaces

arXiv:2606.06941v1 Announce Type: new Abstract: Large language models (LLMs) now solve a wide range of expert-level exams at or above human level, yet remain br

ArXiv cs.AI 📄 Paper 6h ago

Accounting for Context: Shaping Moral Credences for Value Alignment

arXiv:2606.06972v1 Announce Type: new Abstract: Ensuring that agent behaviours are aligned with human moral values inevitably raises the problem of how to accou

ArXiv cs.AI 📄 Paper 6h ago

Exploring Agentic Tool-Calling Decisions via Uncertainty-Aligned Reinforcement Learning

arXiv:2606.06976v1 Announce Type: new Abstract: Large language model (LLM)-based agents often make suboptimal tool-use decisions, including unsupported tool inv

ArXiv cs.AI 📄 Paper 6h ago

Teaching the Way, Not the Answer: Privileged Tutoring Distillation for Multimodal Policy Optimization

arXiv:2606.07000v1 Announce Type: new Abstract: Recent post-training methods, particularly Reinforcement Learning with Verifiable Rewards (RLVR), have significa

ArXiv cs.AI 📄 Paper 6h ago

The Sim-to-Real Gap of Foundation Model Agents: A Unified MDP Perspective

arXiv:2606.07017v1 Announce Type: new Abstract: Foundation model agents are increasingly deployed for real-world decision-making, but suffer from the sim-to-rea

ArXiv cs.AI 📄 Paper 6h ago

StainFlow: Entity-Stain Tracking and Evidence Linking for Process Rewards in GUI Agents

arXiv:2606.07027v1 Announce Type: new Abstract: Reinforcement Learning (RL) has become a promising approach for improving GUI Agents in long-horizon, stochastic

ArXiv cs.AI 📄 Paper 6h ago

Hierarchical Semantic-Constrained Heterogeneous Graph for Audio-Visual Event Localization

arXiv:2606.07033v1 Announce Type: new Abstract: Open-vocabulary audio-visual event localization (OV-AVEL) jointly models audio-visual cues to recognize and temp