Orthogonal Quadratic Complements for Vision Transformer Feed-Forward Networks

📰 ArXiv cs.AI

arXiv:2604.09709v1 Announce Type: cross Abstract: Recent bilinear feed-forward replacements for vision transformers can substantially improve accuracy, but they often conflate two effects: stronger second-order interactions and increased redundancy relative to the main branch. We study a complementary design principle in which auxiliary quadratic features contribute only information not already captured by the dominant hidden representation. To this end, we propose Orthogonal Quadratic Complemen

Published 14 Apr 2026
Read full paper → ← Back to Reads