BiCLIP: Domain Canonicalization via Structured Geometric Transformation
📰 ArXiv cs.AI
arXiv:2603.08942v2 Announce Type: replace-cross Abstract: Recent advances in vision-language models (VLMs) have demonstrated remarkable zero-shot capabilities, yet adapting these models to specialized domains remains a significant challenge. Building on recent theoretical insights suggesting that independently trained VLMs are related by a canonical transformation, we extend this understanding to the concept of domains. We hypothesize that image features across disparate domains are related by a
DeepCamp AI