EfficientTDMPC: Improved MPC Objectives for Sample-Efficient Continuous Control

📰 ArXiv cs.AI

arXiv:2605.16692v2 Announce Type: cross Abstract: We introduce EfficientTDMPC, a sample-efficient model-based reinforcement learning method for continuous control built on the TD-MPC family of algorithms. Central to this family is a planner that aims to find an action sequence that maximizes the estimated return. The return is estimated using a learned model and value networks, each of which can introduce error. EfficientTDMPC proposes to reduce this error in two ways. First, it introduces an en

Published 19 May 2026
Read full paper → ← Back to Reads