Quantile Geometry Regularization for Distributional Reinforcement Learning
📰 ArXiv cs.AI
arXiv:2605.08182v1 Announce Type: cross Abstract: Quantile-based distributional reinforcement learning methods learn return distributions through sampled quantile regression, but their bootstrapped target quantiles may induce distorted or degenerate distribution estimates. We propose Robust Quantile-based Implicit Quantile Networks (RQIQN), a lightweight Wasserstein distributionally robust enhancement boosted from a quantile estimation perspective. We first reinterpret a snapshot of IQN loss as
DeepCamp AI