Towards Robust Sequential Decomposition for Complex Image Editing

📰 ArXiv cs.AI

arXiv:2605.09233v1 Announce Type: cross Abstract: Recent advances in visual generative models have enabled high-fidelity image editing guided by human instructions. However, these models often struggle with complex instructions involving combinatorial editing operations or inter-step dependencies. This difficulty stems from the limitations of two canonical paradigms: (1) single-turn editing, which attempts to apply all instructed edits in one pass, often fails to parse the complex instruction ac

Published 12 May 2026
Read full paper → ← Back to Reads