From one model to seven — what it took to make TurboQuant model-portable

📰 Dev.to · Alberto Nieto

turboquant-vllm started on Molmo2 only. v1.3.0 validates seven model families — after rewriting five Triton kernels and teaching the cache about sliding window attention.

Published 1 Apr 2026