EfficientTDMPC: Improved MPC Objectives for Sample-Efficient Continuous Control
📰 ArXiv cs.AI
arXiv:2605.16692v2 Announce Type: cross Abstract: We introduce EfficientTDMPC, a sample-efficient model-based reinforcement learning method for continuous control built on the TD-MPC family of algorithms. Central to this family is a planner that aims to find an action sequence that maximizes the estimated return. The return is estimated using a learned model and value networks, each of which can introduce error. EfficientTDMPC proposes to reduce this error in two ways. First, it introduces an en
DeepCamp AI