BiCLIP: Domain Canonicalization via Structured Geometric Transformation

📰 ArXiv cs.AI

arXiv:2603.08942v2 Announce Type: replace-cross Abstract: Recent advances in vision-language models (VLMs) have demonstrated remarkable zero-shot capabilities, yet adapting these models to specialized domains remains a significant challenge. Building on recent theoretical insights suggesting that independently trained VLMs are related by a canonical transformation, we extend this understanding to the concept of domains. We hypothesize that image features across disparate domains are related by a

Published 14 Apr 2026

Read full paper → ← Back to Reads