Policy-Guided World Model Planning for Language-Conditioned Visual Navigation

📰 ArXiv cs.AI

PiJEPA framework combines navigation policies with latent world model planning for language-conditioned visual navigation

advanced Published 30 Mar 2026

Action Steps

Learned navigation policies are used to initialize actions in a high-dimensional space
Latent world model planning is employed to plan long-horizon trajectories
The two-stage framework combines the strengths of both approaches to improve navigation performance
Evaluation of the framework is done on language-conditioned visual navigation tasks to demonstrate its effectiveness

Who Needs to Know This

AI engineers and researchers on a team working on embodied AI and navigation tasks can benefit from this framework as it addresses long-horizon planning challenges and poor action initialization in high-dimensional spaces

Key Insight

💡 Combining learned navigation policies with latent world model planning can effectively address challenges in long-horizon planning and action initialization