NEO-unify — A 2B multimodal model with no Vision Encoder, no VAE. Open source coming "hopefully not too long"
📰 Reddit r/LocalLLaMA
<img src="https://preview.redd.it/23t1c330b2vg1.jpg?width=140&height=122&auto=webp&s=a3f3a14cc2a5cfcf11a5db37cb7094b9aade7d37" alt="NEO-unify — A 2B multimodal model with no Vision Encoder, no VAE. Open source coming "hopefully not too long"" title="NEO-unify — A 2B multimodal model with no Vision Encoder, no VAE. Open source coming "hopefully not too long""
DeepCamp AI