Grounded World Model for Semantically Generalizable Planning
📰 ArXiv cs.AI
arXiv:2604.11751v1 Announce Type: cross Abstract: In Model Predictive Control (MPC), world models predict the future outcomes of various action proposals, which are then scored to guide the selection of the optimal action. For visuomotor MPC, the score function is a distance metric between a predicted image and a goal image, measured in the latent space of a pretrained vision encoder like DINO and JEPA. However, it is challenging to obtain the goal image in advance of the task execution, particu
DeepCamp AI