Coding with Eyes: Visual Feedback Unlocks Reliable GUI Code Generating and Debugging

📰 ArXiv cs.AI

arXiv:2604.19750v1 Announce Type: cross Abstract: Recent advances in Large Language Model (LLM)-based agents have shown remarkable progress in code generation. However, current agent methods mainly rely on text-output-based feedback (e.g. command-line outputs) for multi-round debugging and struggle in graphical user interface (GUI) that involve visual information. This is mainly due to two limitations: 1) GUI programs are event-driven, yet existing methods cannot simulate user interactions to tr

Published 23 Apr 2026
Read full paper → ← Back to Reads