Continual Model-Based Reinforcement Learning with Hypernetworks

📰 ArXiv cs.AI

arXiv:2009.11997v3 Announce Type: replace-cross Abstract: Effective planning in model-based reinforcement learning (MBRL) and model-predictive control (MPC) relies on the accuracy of the learned dynamics model. In many instances of MBRL and MPC, this model is assumed to be stationary and is periodically re-trained from scratch on state transition experience collected from the beginning of environment interactions. This implies that the time required to train the dynamics model - and the pause re

Published 27 May 2026

Read full paper → ← Back to Reads